Particle Physics 


Chapter 1 
Introduction to Accelerator Physics 


The development of charged particle accelerators and it’s underlying principles 
has its basis on the theoretical and experimental progress in fundamental physical 
phenomena. While active particle accelerator experimentation started seriously 
only in the twentieth century, it depended on the basic physical understanding of 
electromagnetic phenomena as investigated both theoretically and experimentally 
mainly during the nineteenth and beginning twentieth century. In this introduction 
we will recall briefly the history leading to particle accelerator development, 
applications and introduce basic definitions and formulas governing particle beam 
dynamics. 


1.1 Short Historical Overview 


The history and development of particle accelerators is intimately connected to 
the discoveries and understanding of electrical phenomena and the realization that 
the electrical charge comes in lumps carried as a specific property by individual 
particles. It is reported that the Greek philosopher and mathematician Thales of 
Milet, who was born in 625 BC first observed electrostatic forces on amber. The 
Greek word for amber is electron or nAextpov and has become the origin for 
all designations of electrical phenomena and related sciences. For more than 2000 
years this observation was hardly more than a curiosity. In the eighteenth century, 
however, electrostatic phenomena became quite popular in scientific circles and 
since have been developed into a technology which by now completely embraces 
and dominates modern civilization as we know it. 

It took another 100 years before the carriers of electric charges could be isolated. 
Many systematic experiments were conducted and theories developed to formulate 
the observed electrical phenomena mathematically. It was Coulomb, who in 1785 
first succeeded to quantify the forces between electrical charges which we now call 
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Coulomb forces. As more powerful sources for electrical charges became available, 
glow discharge phenomena were observed and initiated an intensive effort on 
experimental observations during most of the second half of the nineteenth century. 
It was the observations of these electrical glow discharge phenomena that led the 
scientific community to the discovery of elementary particles and electromagnetic 
radiation which are both basic ingredients for particle acceleration. 

Research leading to the discovery of elementary particles and to ideas for the 
acceleration of such particles is dotted with particularly important milestones which 
from time to time set the directions for further experimental and theoretical research. 
It is obviously somewhat subjective to choose which discoveries might have been 
the most influential. Major historical discoveries leading to present day particle 
accelerator physics started to happen more than a 150 years ago: 


1815 The physician and chemist W. Proust postulates, initially anonymous, that 
all atoms are composed of hydrogen atoms and that therefore all atomic weights 
come in multiples of the weight of a hydrogen atom. 

1839 M. Faraday [1] publishes his experimental investigations of electricity and 
described various phenomena of glow discharge. 

1858 J. Pliicker [2] reports on the observation of cathode rays and their deflection 
by magnetic fields. He found the light to become deflected in the same spiraling 
direction as Ampere’s current flows in the electromagnet and therefore postulated 
that the electric light, as he calls it, under the circumstances of the experiment 
must be magnetic. 

1867  L. Lorenz working in parallel with J.C. Maxwell on the theory of electro- 
magnetic fields formulates the concept of retarded potentials although not yet for 
moving point charges. 

1869 J.W. Hittorf [3], a student of Pliicker, started his thesis paper with the 
statement (translated from german): “The undisputed darkest part of recent 
theory of electricity is the process by which in gaseous volumes the propagation 
of electrical current is effected”. Obviously observations with glow discharge 
tubes displaying an abundance of beautiful colors and complicated reactions 
to magnetic fields kept a number of researchers fascinated. Hittorf conducted 
systematic experiments on the deflection of the light in glow discharges by 
magnetic fields and corrected some erroneous interpretations by Pliicker. 

1871 C.F Varley postulates that cathode rays are particle rays. 

1874  H. von Helmholtz postulates atomistic structure of electricity. 

1883 J.C. Maxwell publishes his Treatise on Electricity and Magnetism. 

1883 _T.A. Edison discovers thermionic emission. 

1886 E. Goldstein [4] observed positively charged rays which he was able to 
isolate from a glow discharge tube through channels in the cathode. He therefore 
calls these rays Kanalstrahlen. 

1887  H. Hertz discoveries transmission of electromagnetic waves and photoelec- 
tric effect. 

1891 GJ. Stoney introduces the name electron. 
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1895 H.A. Lorentz formulates electron theory, the Lorentz force equation and 
Lorentz contraction. 

1894 P. Lenard builts a discharge tube that allows cathode rays to exit to 
atmospheric air. 

1895 W. Rontgen discovers x-rays. 

1895 E. Wiedemann [5] reports on a new kind of radiation studying electrical 
sparks. 

1897 J.J. Thomson measures the e/m-ratio for kanal and cathode rays with 
electromagnetic spectrometer and found the e/m ratio for cathode rays to be 
larger by a factor of 1,700 compared to the e/m ratio for kanal rays. He concluded 
that cathode rays consist of free electricity giving evidence to free electrons. 

1897 J. Larmor formulates concept of Larmor precession. 

1898 A. Liénard calculates the electric and magnetic field in the vicinity of 
a moving point charge and evaluated the energy loss due to electromagnetic 
radiation from a charged particle travelling on a circular orbit. 

1900 EE. Wiechert derives expression for retarded potentials of moving point 
charges. 

1901  W. Kaufmann, first alone, and in 1907 together with A.H. Bucherer measure 
increase of electron mass with energy. First experiment in support of theory of 
special relativity. 

1905 A. Einstein publishes theory of special relativity. 

1906 J.J. Thomson [6] explains the emission of this radiation as being caused by 
acceleration occurring during the collision of charged particles with other atoms 
and calculated the energy emitted per unit time to be (2e?f *)/(3V), where e is 
the charge of the emitting particle, f the acceleration and V the velocity of light. 

1907. G.A. Schott [7, 8] formulated the first theory of synchrotron radiation in an 
attempt to explain atomic spectra. 

1909 _ R.A. Millikan starts measuring electric charge of electron. 

1913 First experiment by J. Franck and G. Hertz to excite atoms by accelerated 
electrons. 

1914 E. Marsden produces first proton beam irradiating paraffin with alpha 
particles. 

1920 _ H. Greinacher [9] builts first cascade generator. 

1922 R. Wideroe as a graduate student sketches ray transformer (betatron). 

1924 G._ Ising [10] invents as a student the electron linac with drift tubes and spark 
gap excitation. 

1928 R. Wideroe [11] reports first operation of linear accelerator with potassium 
and sodium ions. Discusses operation of betatron and failure to get beam for lack 
of focusing. 

1928 P.A.M. Dirac predicts existence of positrons. 

1931 RJ. Van de Graaff [12] builts first high voltage generator. 

1932 Lawrence and Livingston [13] accelerate first proton beam from 1.2 MeV 
cyclotron employing weak focusing. 
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1932 J.D. Cockcroft and E.T.S. Walton [14] use technically improved cascade 
generator to accelerate protons and initiate first artificial atomic reaction: Li + 
p— 2He. 

1932 inthe same year, C.D. Andersen discovers positrons, neutrons were discov- 
ered by J. Chadwick, and H.C. Urey discoveries deuterons. 

1939 W.W. Hansen, R. Varian and his brother S. Varian invent klystron microwave 
tube at Stanford. 

1941 D.W. Kerst and R. Serber [15] complete first functioning betatron. 

1941 B. Touschek and R. Wideroe formulate storage ring principle. 

1944 _D. Ivanenko and I.Ya. Pomeranchuk [16] and J. Schwinger [17] point out 
independently an energy limit in circular electron accelerators due to synchrotron 
radiation losses. 

1945 VI. Veksler [18] and E.M. McMillan [19] independently discover the 
principle of phase focusing. 

1945 J.P. Blewett [20] experimentally discovers synchrotron radiation by measur- 
ing the energy loss of electrons. 

1947 L.W. Alvarez [21] designs first proton linear accelerator at Berkeley. 

1948 E.L. Ginzton et al. [22] accelerate electrons to 6MeV with Mark I at 
Stanford. 

1949 McMillan et al. commissioned 320 MeV electron synchrotron. 

1950 N. Christofilos [23] formulates concept of strong focusing. 

1952 M.S. Livingston et al. [24] describe design for 2.2GeV Cosmotron in 
Brookhaven. 

1951 H. Motz [25] builds first wiggler magnet to produce quasi monochromatic 
synchrotron radiation. 

1952 EE. Courant et al. [26] publish first paper on strong focusing. 

1954 R.R. Wilson et al. operate first AG electron synchrotron in Cornell at 
1.1 GeV. 

1954 Lofgren et al. accelerate protons to 5.7 GeV in Bevatron. 

1955 M. Chodorow et al. [27] complete 600 MeV MARK IIT electron linac. 

1955. M. Sands [28] define limits of phase focusing due to quantum excitation. 

1959 EE. Courant and Snyder [29] publish their paper on the Theory of the 
Alternating-Gradient Synchrotron. 


Research and development in accelerator physics blossomed significantly during 
the 1950s supported by the development of high power radio frequency sources 
and the increased availability of government funding for accelerator projects. 
Parallel with the progress in accelerator technology, we also observe advances in 
theoretical understanding, documented in an increasing number of publications. 
It is beyond the scope of this text to only try to give proper credit to all major 
advances in the past 60 years and refer the interested reader to more detailed 
references. 


1.2 Particle Accelerator Systems 7 
1.2 Particle Accelerator Systems 


Particle accelerators come in many forms applying a variety of technical principles. 
All are based on the interaction of the electric charge with static and dynamic 
electromagnetic fields and it is the technical realization of these interactions that 
leads to the different types of particle accelerators. Electromagnetic fields are used 
over most of the available frequency range from static electric fieldsto ac magnetic 
fields in betatrons oscillating at 50 or 60 Hz, to radio frequency fields in the MHz 
to GHz range and ideas are being explored to use laser beams to generate high field 
particle acceleration. 

In this text, we will not discuss the different technical realization of particle 
acceleration but rather concentrate on basic principles which are designed to help the 
reader to develop technical solutions for specific applications meeting basic beam 
stability requirements. For particular technical solutions we refer to the literature. 
Further down we will discuss briefly basic accelerator types and their theoretical 
back ground. Furthermore, to discuss basic principles of particle acceleration and 
beam dynamics it is desirable to stay in contact with technical reality and reference 
practical and working solutions. We will therefore repeatedly refer to certain types 
of accelerators and apply theoretical beam dynamics solutions to exhibit the salient 
features and importance of the theoretical ideas under discussion. In these references 
we use mostly such types of accelerators which are commonly used and are 
extensively publicized. 


1.2.1 Main Components of Accelerator Facilities 


In the following paragraphs we describe components of particle accelerators in 
a rather cursory way to introduce the terminology and overall features. Particle 
accelerators consist of two basic units, the particle source or injector and the main 
accelerator. The particle source comprises all components to generate a beam of 
desired particles. 

Generally glow discharge columns are used to produce proton or ion beams, 
which then are first accelerated in electrostatic accelerators like a Van de Graaff 
or Cockcroft-Walton accelerator and then in an Alvarez-type linear accelerator. To 
increase the energy of heavy ion beams the initially singly charged ions are, after 
some acceleration, guided through a thin metal foil to strip more electrons off the 
ions. More than one stripping stage may be used at different energies to reach the 
maximum ionization for most efficient acceleration. 

Much more elaborate measures must be used to produce antiprotons. Generally 
a high energy proton beam is aimed at a heavy metal target, where, through 
hadronic interactions with the target material, among other particles antiprotons 
are generated. Emerging from the target, these antiprotons are collected by strong 
focusing devices and further accelerated. 
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Electrons are commonly generated from a heated cathode, also called a 
thermionic gun, which is covered on the surface by specific alkali oxides or any 
other substance with a low work function to emit electrons at technically practical 
temperatures. Another method to create a large number of electrons within a short 
pulse uses a strong laser pulse directed at the surface of a photo cathode. Systems 
where the cathode is inserted directly into an accelerating rf field are called rf 
guns. Positrons are created the same way as antiprotons by aiming high energy 
electrons on a heavy metal target where, through an electromagnetic shower and 
pair production, positrons are generated. These positrons are again collected by 
strong magnetic fields and further accelerated. 

Whatever the method of generating particles may be, in general they do not have 
the time structure desired for further acceleration or special application. Efficient 
acceleration by rf fields occurs only during a very short period per oscillation cycle 
and most particles would be lost without proper preparation. For high beam densities 
it is desirable to compress the continuous stream of particles from a thermionic 
gun or a glow discharge column into a shorter pulse with the help of a chopper 
device and/or a prebuncher. The chopper may be a mechanical device or a deflecting 
magnetic or rf field moving the continuous beam across the opening of a slit. At the 
exit of the chopper we observe a series of beam pulses, called bunches, to be further 
processed by the prebuncher. Here early particles within a bunch are decelerated 
and late particles accelerated. After a well defined drift space, the bunch length 
becomes reduced due to the energy dependence of the particle velocity. Obviously 
this compression works only as long as the particles are not relativistic while the 
particle velocity can be modulated by acceleration or deceleration. 

No such compression is required for antiparticles, since they are produced by 
high energetic particles having the appropriate time structure. Antiparticle beams 
emerging from a target have, however, a large beam size and beam divergence. 
To make them suitable for further acceleration they are generally stored for some 
time in a cooling or damping ring. Such cooling rings are circular “accelerators” 
where particles are not accelerated but spend just some time circulating. Positrons 
circulating in such storage rings quickly lose their transverse momenta and large 
beam divergence through the emission of synchrotron radiation. In the case of 
antiprotons, external fields are applied to damp the transverse beam size or 
they circulate against a strong counterrotating electron beam loosing transverse 
momentum through scattering. 

Antiparticles are not always generated in large quantities. On the other hand, the 
accelerator ahead of the conversion target can often be pulsed at a much higher rate 
than the main accelerator can accept injection. In such cases, the antiparticles are 
collected from the rapid cycling injector in an accumulator ring and then transferred 
to the main accelerator when required. 

Particle beams prepared in such a manner may now be further accelerated in 
linear or circular accelerators. A linear accelerator consists of a linear sequence 
of many accelerating units where accelerating fields are generated and timed such 
that particles absorb and accumulate energy from each acceleration unit. Most 
commonly used linear accelerators consist of a series of cavities excited by 
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radio frequency sources to high accelerating fields. In the induction accelerator , 
each accelerating unit consists of a transformer which generates from an external 
electrical pulse a field on the transformer secondary which is formed such as to allow 
the particle beam to be accelerated. Such induction accelerators can be optimized to 
accelerate very high beam currents to medium beam energies. 

For very high beam energies linear accelerators become very long and costly. 
Such practical problems can be avoided in circular accelerators where the beam is 
held on a circular path by magnetic fields in bending magnets and passing repeatedly 
every turn through accelerating sections, similar to those in a linear accelerator. This 
way, the particles gain energy from the accelerating cavities at each turn and reach 
higher energies while the fields in the bending magnets are raised. 

The basic principles to accelerate particles of different kind are similar and 
we do not need to distinguish between protons, ions, and electrons. Technically, 
individual accelerator components differ more or less to adjust to the particular 
beam parameters which have mostly to do with the particle velocities. For highly 
relativistic particles the differences in beam dynamics vanish. Protons and ions 
are more likely to be nonrelativistic and therefore vary the velocity as the kinetic 
energy is increased, thus generating problems of synchronism with the oscillating 
accelerating fields which must be solved by technical means. 

After acceleration in a linear or circular accelerator the beam can be directed onto 
a target, mostly a target of liquid hydrogen, to study high energy interactions with 
the target protons. Such fixed target experimentation dominated nuclear and high 
energy particle physics from the first applications of artificially accelerated particle 
beams far into the 1970s and is still a valuable means of basic research. Obviously, 
it is also the method in conjunction with a heavy metal target to produce secondary 
particles like antiparticles for use in colliding beam facilities and mesons for basic 
research. 

To increase the center-of-mass energy for basic research, particle beams are 
aimed not at fixed targets but to collide head on with another beam. This is one 
main goal for the construction of colliding beam facilities or storage rings. In 
such a ring, particle and antiparticle beams are injected in opposing directions and 
made to collide in specifically designed interaction regions. Because the interactions 
between counter orbiting particles is very rare, storage rings are designed to allow 
the beams to circulate for many turns with beam life times of several hours to give 
the particles ample opportunity to collide with other counter rotating particles. Of 
course, beams can counter rotate in the same magnetic fields only if one beam is 
made of the antiparticles of the other beam while two intersecting storage rings 
must be employed to allow the collision of unequal particles. 

The circulating beam in an electron storage ring emits synchrotron radiation 
due to the transverse acceleration during deflection in the bending magnets. This 
radiation is highly collimated in the forward direction, of high brightness and 
therefore of great interest for basic and applied research, technology, and medicine. 

Basically the design of a storage ring is the same as that for a synchrotron 
allowing some adjustment in the technical realization to optimize the desired 
features of acceleration and long beam lifetime, respectively. Beam intensities are 
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generally very different in a synchrotron from that in a storage ring. In a synchrotron, 
the particle intensity is determined by the injector and this intensity is much smaller 
than desired in a storage ring. The injection system into a storage ring is therefore 
designed such that many beam pulses from a linear accelerator, an accumulator ring 
or a synchrotron can be accumulated. A synchrotron serving to accelerate beam 
from a low energy preinjector to the injection energy of the main facility, which 
may be a larger synchrotron or a storage ring, is also called a booster synchrotron 
or short a booster. 

Although a storage ring is not used for particle acceleration it often occurs that a 
storage ring is constructed long after and for a higher beam energy than the injector 
system. In this case, the beam is accumulated at the maximum available injection 
energy. After accumulation the beam energy is slowly raised in the storage ring to 
the design energy by merely increasing the strength of the bending and focusing 
magnets. 

Electron positron storage rings have played a great role in basic high-energy 
research. For still higher collision energies, however, the energy loss due to 
synchrotron radiation has become a practical and economic limitation. To avoid 
this limit, beams from two opposing linear accelerators are brought into head 
on collision at energies much higher than is possible to produce in circular 
accelerators. To match the research capabilities in colliding beam storage rings, 
such linear colliders must employ sophisticated beam dynamics controls, focusing 
arrangements and technologies similar to X-ray laser systems now operating. 


1.2.2. Applications of Particle Accelerators 


Particle accelerators are mainly known for their application as research tools in 
nuclear and high energy particle physics requiring the biggest and most energetic 
facilities. Smaller accelerators, however, have found broad applications in a wide 
variety of basic research and technology, as well as medicine. In this text, we will 
not discuss the details of all these applications but try to concentrate only on the 
basic principles of particle accelerators and the theoretical treatment of particle 
beam dynamics and instabilities. An arbitrary and incomplete listing of applications 
for charged particle beams and their accelerators is given for reference to the 
interested reader: 


Nuclear physics 
Electron/proton accelerators 
Ton accelerators/colliders 
Continuous beam facility 
High-energy physics 

Fixed target accelerator 
Colliding beam storage rings 
Linear colliders 


Power generation 

Inertial fusion 

Reactor fuel breeding 
Industry 

Radiography by x-rays 

Ton implantation 

Isotope production/separation 
Materials testing/modification 
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Food sterilization Coherent radiation 

X-ray lithography Free electron lasers, X-FEL 
Synchrotron radiation Microprobe 

Basic atomic and molecular physics Holography 

Condensed matter physics Medicine 

Earth sciences Radiotherapy 

Material sciences Health physics 

Chemistry Microsurgery with tunable FEL 
Molecular and cell biology Sterilization 


Surface/interface physics 


This list is by no means exhaustive and additions must be made at an impressive 
pace as the quality and characteristics of particle beams become more and more 
sophisticated, predictable and controllable. Improvements in any parameter of 
particle beams create opportunities for new experiments and applications which 
were not possible before. More detailed information on specific uses of particle 
accelerators as well as an extensive catalogue of references has been compiled by 
Scharf [30]. 


1.3 Definitions and Formulas 


Particle beam dynamics can be formulated in a variety units and it is therefore 
prudent to define the units used in this text to avoid confusion. In addition, we 
recall fundamental relations of electromagnetic fields and forces as well as some 
laws of special relativity to the extend that will be required in the course of 
discussions. 


1.3.1 Units and Dimensions 


A set of special physical units, selected primarily for convenience, are most 
commonly used to quantify physical constants in accelerator physics. The use of 
many such units is often determined more by historical developments than based on 
the choice of a consistent set of quantities useful for accelerator physics. 

Generally, accelerator physics theory is formulated in the metric mks-system 
of units or SJ-units which we follow also in this text. For readers used to cgs 
units, we include here conversion tables for convenience. To measure the energy 
of charged particles the unit Joule is actually used very rarely. The basic unit 
of energy in particle accelerator physics is the electron Volt (eV), which is the 
kinetic energy a particle with one basic unit of electrical charge e would gain while 
being accelerated between two conducting plates at a potential difference of 1 V. 
Therefore, 1 eV is equivalent to 1.60217733 x 10~!° J. Specifically, we will often 
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use derivatives of the basic units to express actual particle energies in a convenient 
form: 


1 keV = 1000 eV; 1 MeV = 10° eV; 1 GeV = 10° eV; 1 TeV = 10% eV 


To describe particle dynamics we find it necessary to sometimes use the particle’s 
momentum and sometimes the particle’s energy. The effect of the Lorentz force 
from electric or magnetic fields is inversely proportional to the momentum of the 
particle. Acceleration in rf fields, on the other hand, is most conveniently measured 
by the increase in kinetic or total energy. 

In an effort to simplify the technical jargon used in accelerator physics the term 
energy is used for all three quantities although mathematically the momentum is 
then multiplied by the velocity of light for dimensional consistency. There are still 
numerical differences which must be considered for all but very highly relativistic 
particles. Where we need to mention the pure particle momentum and quote a 
numerical value, we generally use the total energy divided by the velocity of light 
with the unit eV/c. With this definition a particle of energy cp = 1 eV would have a 
momentum of p = 1 eV/c. 

An additional complication arises in the case of composite particles like heavy 
ions, consisting of protons and neutrons. In this case, the particle energy is not 
quoted for the whole ion but in terms of the energy per nucleon. 

The particle beam current is measured generally in Amperes, no matter what 
general system of units is used but also occasionally in terms of the total charge or 
number of particles. The current is then the total charge Q passing a point during 
the time ¢. Depending on the time duration one gets an instantaneous current or 
some average current. Therefore a quotation of the particle current requires also the 
definition of the time structure of the beam. In circular accelerators, for example, 
the average beam current / relates directly to the beam intensity or the number 
of circulating particles N. If Bc is the velocity of the particle and Z the charge 
multiplicity, we get for the relation of beam current and beam intensity 


1 =eZfeN, (1.1) 


where the revolution frequency frey = {c/C and C is the circumference of the 
circular accelerator. This is the average circulating current to be distinguished from 
the bunch current or peak bunch current, which is the charge per bunch q divided by 
the duration of the bunch. 

For a linear accelerator or beam transport line where particles come by only 
once, the definition of the beam current is more subtle. We still have a simple case 
if the particles come by in a continuous stream in which case the beam current is 
proportional to the particle flux N or I = eZN. This case, however, occurs very 
rarely since particle beams are generally accelerated by rf fields. As a consequence 
there is no continuous flux of particles reflecting the time varying acceleration of 
the rf field. The particle flux therefore is better described by a series of equidistant 
particle bunches separated by an integral number of wavelengths of the accelerating 
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Table 1.1. Numerical 


: Quantity Replace cgs parameter by practical units 
conversion factors : 
Potential lesu | 300V 
Electrical field | 1esu | 310*V/m 
Current lesu |0.1-cA 
Charge lesu | 0.333310~°C 
Force 1 dyn |10-°N 
Energy leV_ | 1.602107!°J 


leV_ | 1.60210~" erg 


rf field. Furthermore, the acceleration often occurs only in bursts or pulses producing 
either a single bunch of particles or a string of many bunches. In these cases 
we distinguish between different current definitions. The peak current is the peak 
instantaneous beam current for a single bunch, while the average current is defined 
as the particle flux averaged over the duration of the beam pulse or any other given 
time period, e.g. 1s. 

Magnetic fields are quoted either in Tesla or Gauss.! Similarly, field gradients 
and higher derivatives are expressed in Tesla per meter or Gauss per centimeter. 
Frequently we find the need to perform numerical calculations with parameters 
given in different units. Some helpful numerical conversions from cgs to mks-units 
are compiled in Table 1.1. 

Similar conversion factors can be derived for electromagnetic quantities in 
formulas by comparisons of similar equations in the MKS and cgs-system. Table 1.2 
includes some of the most frequently used conversions. The absolute dielectric 
constant is 


10’ C C 
&) = —~—— = 8.854x 10°? — (1.2) 
4rnc* Vm Vm 
and the absolute is 
4x10? = 1.2566 x 10 (1.3) 
— IT a s. ——~ . 
“a Am Am 
Both constants are related by c?éo/4o = 1. Using these conversion factors it is 


possible to convert formulas in cgs units into the equivalent form for mks-units. 


1.3.2. Maxwell’s Equations 


Predictable control of charged particles is effected only by electric and magnetic 
fields and beam dynamics is the result of such interaction. We try to design and 


‘Because of its wide use, we use in rare cases the unit Gauss even though it is not a SI unit (1 
Gauss = 0.0001 Tesla = 0.1 mT). 
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Table 1.2 Conversion Replace cgs-parameter 

factors for equations Quantity by mks-parameter 
Potential Vegs V4 0 Vinks 
Electric field Ecgs V 41 €0E mks 
Current Togs Tes Tiles 
Current density Jegs Tea liaks 
Charge Aces Saat mks 
Charge density Pegs a Pmks 
Conductivity Oces Tes Omks 
Inductance Legs Ar eq Links 
Capacitance Cogs in Cinks 

“Magnetic field | Heys | Eats 

Magnetic induction Begs = Binks 


formulate electromagnetic fields in a way that can be used to accurately predict the 
behavior of charged particles. To describe the general interaction of fields based on 
electric currents in specific devices and free charged particles which we want to 
preserve, guide and focus, we use as a starting point Maxwell’s equations: 


V (cE) = a Coulomb’s law, 

VB = 0, 

V x E=— +B, Faraday’s law, (1.4) 
vx (18) =Moj + a (cE) . Ampére’s law, 


consistent with the SI-system of units by inclusion of the unit scale factors € 
and j49. The quantities « and py are the relative dielectric constant and magnetic 
permeability of the surrounding materials, respectively. Integration of one or the 
other of Maxwell’s equations results, for example, in the fields from singly charged 
particles or those of an assembly of particles travelling along a common path and 
forming a beam. Applying Maxwell’s equations, we will make generous use of 
algebraic relations which have been collected in Appendix A. 


1.4 Primer in Special Relativity 


In accelerator physics the dynamics of particle motion is formulated for a large 
variety of energies from nonrelativistic to highly relativistic values and the equations 
of motion obviously must reflect this. Relativistic mechanics is therefore a funda- 
mental ingredient of accelerator physics and we will recall a few basic relations 
of relativistic particle mechanics from a variety of more detailed derivations in 
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generally available textbooks. Beam dynamics is expressed in a laboratory by a 
fixed system of coordinates but some specific problems are better discussed in the 
moving coordinate system of particles. Transformation between the two systems is 
effected through a Lorentz transformation. 


1.4.1 Lorentz Transformation 


Physical phenomena can appear different for observers in different systems of 
reference. Yet, the laws of nature must be independent of the reference sys- 
tem. In classical mechanics, we transform physical laws from one to another 
system of reference by way of the Galileo transformation z* = z— vt assum- 
ing that one system moves with velocity v along the z-axis of the other sys- 
tem. 

As the velocities of bodies under study became faster, it became necessary 
to reconsider this simple transformation leading to Einstein’s special theory of 
relativity. Maxwell’s equations result in electromagnetic waves expanding at a finite 
velocity and do not contain any reference to a specific system of reference. Any 
attempt to find a variation of the “velocity of light” with respect to moving reference 
systems failed, most notably in Michelson’s experiments. The expansion velocity 
of electromagnetic waves is therefore independent of the reference system and is 
finite. 

Any new transformation laws must include the observation that no element of 
energy can travel faster than the speed of light. The new transformation formulae 
combine space and time and are for a reference system £L* moving with velocity 
v, = cB, along the z-axis with respect to the stationary system L. 


ba it 
y=y", 

1.5 
2 =yle+ 6, ct*) , (1.5) 
ct=y (B,2* + ct*), 


where the relativistic factor is 


1 
—— (1.6) 


with 
Bz = v-/c (1.7) 


and where all quantities designated with * are defined in the moving system L*. 
Of course, either system is moving relative to the other and we will use this 
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relativity in various circumstances depending on whether quantities are known in 
the laboratory or moving system. The Lorentz transformations can be expressed in 
matrix formulation by 


Xx 10 0 O a x 
y 01 0 =O y* y* 

= =M 1.8 
z 00 y +fy || z | # cL) 
ct 00+By y ct* ct 


and the inverse transformation is the same except that the velocity or 6 changes sign 
(v > —v). 
Lorentz Transformation of Fields 


Without proof, electromagnetic fields transform between reference systems in 
relative motion like 


E, y 0 0 0 +yf.0\ (ES 
Ey 0 y O-ypB, 0 O|] E 
E.{_} 0 0 1 0 0 Oj] & 6 
cBy 0 -yB,0 y 0 0 cB* 
cBy +yB, 0 0 O y 0 cBY 
cB. 0 000 0 1) \cB* 


Again, for the inverse transformation only the sign of the relative velocity must 
be changed, 8, — — 6,. According to this transformation of fields, a pure static 
magnetic field in the laboratory system L, for example, becomes an electromagnetic 
field in the moving system £*. An undulator field, therefore, looks to an electron like 
a virtual photon with an electromagnetic field like a laser field and both interactions 
can be described by Compton scattering. 


Lorentz Contraction 


Characteristic for relativistic mechanics is the Lorentz contraction and time dilata- 
tion, both of which become significant in the description of particle dynamics. To 
describe the Lorentz contraction, we consider a rod at rest in the stationary system 
£ along the z-coordinate with a length € = z) — z,. In the system £*, which 
is moving with the velocity v, in the positive z-direction with respect to L, the 
rod appears to have the length £* = z} — z}. By a Lorentz transformation we 
can relate that to the length in the £-system. Observing both ends of the rod at 
the same time the lengths of the rod as observed from both systems relate like 
=H -2 = (Gt vety) — v@q + vet) = ye or 


C= yl*. (1.10) 
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A rod at rest in system £ appears shorter in the moving particle system £L* by a 
factor y and is always longest in it’s own rest system. For example, the periodicity 
of an undulator A, becomes Lorentz contracted to Ap/y as seen by relativistic 
electrons. Because of the Lorentz contraction, the volume of a body at rest in the 
system £ appears also reduced in the moving system £* and we have for the volume 
of a body in three dimensional space 


V=yv*. (1.11) 


Only one dimension of this body is Lorentz contracted and therefore the volume 
scales only linearly with y. As a consequence, the charge density p of a particle 
bunch with the volume V is lower in the laboratory system £ compared to the density 
in the system moving with this bunch and becomes 


==. (1.12) 


Y 


Time Dilatation 
Similarly, we may derive the time dilatation or the elapsed time between two events 


occurring at the same point in both coordinate systems. Applying the Lorentz 
transformations we get from (1.5) with z} = zjf 


)-y(a+ Pet) (1.13) 


At = yAt*. (1.14) 


At=n-n=y(g+ 
c 


or 


For a particle at rest in the moving system £* the time f* varies slower than the 
time in the laboratory system. This is the mathematical expression for the famous 
twin paradox where one of the brothers moving in a space capsule at relativistic 
speed would age slower than his twin brother staying back. This phenomenon 
gains practical importance for unstable particles. For example, high-energy pions, 
observed in the laboratory system, have a longer lifetime by the factor y compared 
to low-energy pions with y = |. As a consequence, high energy unstable particles, 
like pions and muons, live longer and can travel farther as measured in the 
laboratory system, because the particle decay time is a particle property and is 
therefore measured in its own moving system. This is important. For example, in 
medical applications when a beam of pions has to be transported from the highly 
radioactive target area to a radiation free environment for the patient for cancer 
treatment. 
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1.4.2. Lorentz Invariance 


Briefly, we have to introduce 4-vectors, because they will make later discussions 
much easier and illuminate fundamental properties of synchrotron radiation which 
is emitted in the particle system, but observed in the laboratory system as we will 
see later in this section. Four-vectors have a special significance in physics. As their 
name implies, four physical quantities can form a 4-vector which has convenient 
properties when viewed in different reference systems. The components of space- 
time, for example, form a 4-vector § = (x, y, z, ict). To identify 4-vectors, we add a 
tilde § to the symbols. All true 4-vectors transform like the space-time coordinates 
through Lorentz transformations. 


a=M,a*. (1.15) 


Invariance to Lorentz Transformations 


The length of 4-vectors is the same in all reference systems and is therefore open to 
measurements and comparisons independent of the location of the experimenter. 
In fact, it can be shown (exercise) that even the product of two arbitrary 4- 
vectors is Lorentz invariant. Take two 4-vectors in an arbitrary frame of reference 
a* = (at a3 ,a3, iat) and b= (7, b3,b5, iby) and form the product a*b in com- 
ponent form. A Lorentz transformation on both 4-vectors gives a*b = ab, which 
is the same in any reference system and is therefore Lorentz invariant. Specifically, 
the length of any 4-vector is Lorentz invariant. 


Space-Time 


Imagine a light flash to originate at the origin of the coordinate system L(x, y, z). At 
the time ¢, the edge of this expanding light flash has expanded with the velocity of 
light to 


rYt+y42=c'P. (1.16) 


Observing the same light flash from a moving system, we apply a Lorentz 
transformation from the laboratory system £ to the moving system L* and get 


2 ea ang = er? (1.17) 


demonstrating the invariance of the velocity of light c as has been experimentally 
verified by Michelson and Morley in 1887. The velocity of light is the same in all 
reference systems and its value is 


c = 299,792,458 m/s. (1.18) 
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The components of the space-time 4-vector are 
§ = (X1,X2,%3,%4) = (x,y,z, ict) , (1.19) 


where the time component has been multiplied by c to give all components the same 
dimension. From the Lorentz invariant world time tT, defined as 


ct = V—-s" (1.20) 


we get 


cdt = yc? (dt)? — (dx) — (dy)? — (dz)? = je — (v2 + v2 + v2)dt 
= V2 — v2dt = V1 — B2cdr, (1.21) 


a relation, we know from the Lorentz transformation as time dilatation dt = + dr. 

Other 4-vectors can be formulated and often become relevant in accelerator 
physics as, for example, those listed below. More 4-vectors are listed in Appendix 
B. 


Four-Velocity 


A velocity 4-vector can be derived from the space-time 4-vector by simple 
differentiation 


ds ds 
d= — =y—=7(%3,zZ ic). 1.22 
BS eee) (1.22) 
Evaluating the square of the velocity 4-vector we find 3” = yv? — yc? = —c? in 


the rest frame and in any other reference frame. The velocity of light is the same in 
any reference system as experimentally verified by Michelson and Morley. 


Four-Acceleration 


From the velocity 4-vector, we derive the 4-acceleration 


en a8 d (ds sa5 yp 505 
— —_— —_— = Vv = 
oi a a ae a 


3 
+54 (va) (1.23) 
c 
or in component form @ = (a,., dy, Az, id,) , we get a, = y2ay + y*B, (Ba),.., a; = 
y* (Ba) where a = (X,},2) is the ordinary acceleration. The Lorentz invariance 
of a becomes important to describe the emission of synchrotron radiation from a 
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relativistic charged particle and observation in a laboratory reference frame. Con- 
versely, experimental verification of the theory of synchrotron radiation validates 
the invariance of a”. 


Momentum-Energy 4-Vector 


An important 4-vector is the 4-momentum or momentum-energy 4-vector defined 
by the canonical momentum cp and total energy E 


CP = (CPx, CPy, Cpz, iE). (1.24) 


The length of the energy-momentum 4-vector cp = (cp, cpy,cpz,iE) can be 
determined by going into the rest frame where the momentum is zero and we get 


2=2 


cp = cr + ce, + cp. — FP = —A’m', (1.25) 


where we have set Ey = Amc’ for a particle with atomic mass A. From this the total 
energy is 


B= p+ Amc, (1.26) 


demonstrating the experimentally verifiable fact that the particle mass is Lorentz 
invariant. 

We look now for an expression of (1.26) without the use of velocities and derive 
from the product of the velocity and momentum-energy 4-vectors 


(yv,iyc) (cp,iE) = yvep—cyE = —cAmc* (1.27) 
an expression for the momentum cp = ve tne since p||B . Inserting this into (1.26), 
2\2 
we get E2 = (a) + A2m?c*,and with By? = y? -—1 
a (1.28) 
y= Ame 


defining the relativistic factor y in terms of energies. Sometimes, authors attach 
this relativistic factor to the mass and assume thereby an increasing moving mass. 
Einstein’s point of view is expressed in the following quote: “It is not good to 
introduce the concept of the mass of a moving body M = ym for which no clear 
definition can be given. It is better to introduce no mass concept other than the ‘rest 
mass’ mo. Instead of introducing M it is better to mention the expression for the 
momentum and energy of a body in motion.” In this book, we take the rest mass mo 
as an invariant. 
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The total energy of a particle is given by 
E = yEy = yAmc’, (1.29) 


where Ey = Amc? is the rest energy of the particle and A the atomic mass. For 
electrons we assume that A = 1 and m = me. Since in this text we concentrate 
mainly on electrons and protons, we assume A = |. The kinetic energy is defined 
as the total energy minus the rest energy 


Exin = E — Ey = (y — 1)imc’. (1.30) 


The change in kinetic energy during acceleration is equal to the product of the 
accelerating force and the path length over which the force acts on the particle. 
Since the force may vary along the path we use the integral 


ABixs -| Fds (1.31) 


aCe 


to define the energy increase. The length L,,, is the path length through the 
accelerating field. In discussions of energy gain through acceleration, we consider 
only energy differences and need therefore not to distinguish between total and 
kinetic energy. The particle momentum finally is defined by 


cp’ = FE — Be (1.32) 


or 


cp = \/ E2 — E2 = mc? y/y? — 1 = yBmc? = BE, (1.33) 


where 6 = v/c. The simultaneous use of the terms energy and momentum might 
seem sometimes to be misleading as we discussed earlier. In this text, however, 
we will always use physically correct quantities in mathematical formulations even 
though we sometimes use the term energy for the quantity cp. In electron accelera- 
tors the numerical distinction between energy and momentum is insignificant since 
we consider in most cases highly relativistic particles. For proton accelerators and 
even more so for heavy ion accelerators the difference in both quantities becomes, 
however, significant. 

Often we need differential expressions or expressions for relative variations of 
a quantity in terms of variations of another quantity. Such relations can be derived 
from the definitions in this section. By variation of (1.33), for example, we get 


me> dE  dExin 
dy = = 
B B B 


dcp = (1.34) 
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and 


dcp _ 1 dy 


cp pty” 


Varying (1.32) and replacing dy from (1.33) we get 


dcp = y*mc? dB (1.35) 
and 
dep? i SE 
cp po 


Photon 4-Vector 


An analogous 4-vector can be formulated for photons using deBroglie’s relations 
p = hk and E = ho for ck = (cky, cky, ck,,i@) . Since the energy-momentum 
4-vector is derived from the canonical momentum, we will have to modify this 4- 
vector when electromagnetic fields are present. 


Force 4-Vector 
The force 4-vector is the time derivative of the energy-momentum 4-vector 


(cp, iE) , which is consistent with the observation (so far) that the rest mass does 
not change with time. 


Electro-magnetic 4-Vector 


The electromagnetic-potential 4-vector is (cA, i) . 


1.4.3 Spatial and Spectral Distribution of Radiation 


Of great importance in accelerator and synchrotron radiation physics is the Lorentz 
invariance of the product of two 4-vectors. Electromagnetic fields emanating from 
relativistic charges can be described by plane waves E* = Ej e'®” where @* = 
o*t* — k*n*r* is the phase of the wave in the particle system and is Lorentz 
invariant. This invariance stems from the fact that the phase can be formulated as 
the product of the photon and space-time 4-vectors 


cp +8 = [ckn,io] [z,ict] , (1.36) 
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where we have set k = nk with n being the unit vector in the direction of wave 
propagation. Using k = w/c the phase as measured in the laboratory £ is the same 
as that in the particle frame of reference £* 


wo” [(ntx* + ON i nzz*) —ct*| = [(nx + ny + nz) — ct] = invariant. 


To derive the relationships between similar quantities in both systems, we use 
the Lorentz transformation (1.8), noting that the particle reference frame is the 
frame, where the particle or radiation source is at rest, and replace the coordinates 
(x*, y*, z*, ct*) by those in the laboratory system for 


w* | (nix* + nty* + n¥z*) — ct*| 
= o* [ntx+ nye a, (e= Pye) =e Ppyek yct)| (1.37) 
=O [ (nx + nyy + n-Z) — ct] F 

from which one can isolate, for example, a relation between w* and w. Since 


the space-time coordinates are independent from each other, we may equate their 
coefficients on either side of the equation separately. 


Spectral Distribution 
In so doing, the ct-coefficients define the transformation of the oscillation frequency 
wo*y (1+ B.n*) =o, (1.38) 


which expresses the relativistic Doppler effect. Looking parallel and opposite to the 
direction of particle motion n*¥ = 1, the observed oscillation frequency is increased 
by the factor (1 + 6.) y * 2y for highly relativistic particles. The Doppler effect 
is reduced (red shifted) if the radiation is viewed at some finite angle © with 
respect to the direction of motion of the source. In these cases n* = cos0* 
and the frequency shift can be very large for highly relativistic particles with 
y >. 


Spatial Distribution 


Similarly, we obtain the transformation of spatial directions from 


fee ee ge (1.39) 
“ y(lt+ Barty)" y+ Baty (1+ Bent)” . 
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These transformations define the spatial distribution of radiation in the laboratory 
system. In case of transverse acceleration the radiation in the particle rest frame is 
distributed like cos* ©* about the direction of motion. This distribution becomes 
greatly collimated into the forward direction in the laboratory system. With n*? + 
n*? = sin’ @* and n2 + n? = sin? © ~ ©? and n* = cos O*, we find 


sin @* 


0 x ———_—___ 
y(1 + Bcos O*) 


(1.40) 


In other words, radiation from relativistic particles, emitted in the particle system 
into an angle —z/2 < ©* < z/2 appears in the laboratory system highly 
collimated in the forward direction within an angle of 


1 
AO +-. (1.41) 
Y 


This angle is very small for highly relativistic electrons like those in a storage 
ring, where y is of the order of 10°—10*. 


1.4.4 Particle Collisions at High Energies 


The most common use of high-energy particle accelerators has been for basic 
research in elementary particle physics. Here, accelerated particles are aimed at a 
target, which incidentally may be just another particle beam, and the researchers 
try to analyze the reaction of high-energy particles colliding with target particles. 
The available energy from the collision depends on the kinematic parameters of the 
colliding particles. We define a center of mass coordinate system which is the system 
that moves with the center of mass of the colliding particles. In this system the vector 
sum of all momenta is zero and is preserved through the collision. Similarly, the total 
energy is conserved and we may define a center of mass energy the same way the 
rest energy of a single particle is defined by 


2 


Een = (<8) -()o,<) » (1.42) 


where the summation is taken over all particles forming the center of mass system. 
The center of mass energy includes all old particle masses but also new masses 
of new particles which have not been there before. We apply this to two colliding 
particles with masses m, and my and velocities vy and v2, respectively, 


(m,v1) —> <—_ (m,Vv2). 
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The center of mass energy for this system of two colliding particles is then 


2 2 2 2 
Eom = bs (Exin + me) - bp | (1.43) 


i=l i=l 
= (yim + y2m)? cf (vi Bim + y2B2m2)° cA 


We apply these kinematic relations to a proton (m, = Mp) of energy y colliding with 
a proton at rest in a target. For a target proton at rest with yy = 1, mz = mp, B2 = 0 


and By = ./y? — 1, the center of mass energy is 
Eom = (y + mpc! = (7? = Dmge 
or after some manipulations 


Eon = ¥2(y + Ipc’. (1.44) 


The available energy for high-energy reactions after conservation of energy and 
momentum for the whole particle system is the center of mass energy minus 
the rest energy of the particles that need to be conserved. If, for example, two 
protons collide, high-energy physics conservation laws tell us that the hadron 
number must be conserved and therefore the reaction products must include two 
units of the hadron number. In the most simple case the reaction will produce just 
two protons and some other particles with a total energy equal to the available 
energy 


Enysit = Eom — 2iyc? = | v2 +h 2 Mpc? . (1.45) 


The energy available from such reactions increases only like the square root of the 
energy of the accelerated particle which makes such stationary target physics an 
increasingly inefficient use of high-energy particles. A significantly more efficient 
way of using the energy of colliding particles can be obtained by head on collision 
of two equal particles of equal energy. In this case y) = y2 = y, the mass 
of the colliding particles is my) = mz, = mp, and Bj = —f. = 8B. In 
this case, the center of mass energy is simply twice the energy of each of the 
particles 


Bon = Qyne = 25. (1.46) 


In colliding beam facilities, where particles collide with their antiparticles no parti- 
cle type conservation laws must be obeyed and therefore the total energy of both par- 
ticles becomes available for the production of new particles at the collision point. In 
a similar way we may calculate the available energy for a variety of collision scenar- 
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ios like the collision of an accelerated electron with a stationary proton, the head on 
collision of electrons with protons or collisions involving high-energy heavy ions. 


1.5 Principles of Particle-Beam Dynamics 


Accelerator physics relates primarily to the interaction of charged particles with 
electromagnetic fields. Detailed knowledge of the functionality of this interaction 
allows the design of accelerators meeting specific goals and the prediction of 
charged particle beam behavior in those accelerators. The interplay between 
particles and fields is called beam dynamics. In this section, we recall briefly 
some features of electromagnetic fields and fundamental processes of classical and 
relativistic mechanics as they relate to particle beam dynamics. 


1.5.1 Electromagnetic Fields of Charged Particles 


Predictable control of charged particles is effected only by electric and magnetic 
fields and beam dynamics is the result of such interaction. We try to design and 
formulate electromagnetic fields in a way that can be used to accurately predict the 
behavior of charged particles. To describe the general interaction of fields based on 
electric currents in specific devices and free charged particles which we want to 
preserve, guide and focus, we use as a starting point Maxwell’s equations (1.4). 


Electric Field of a Point Charge 


First, we apply Gauss’ theorem to a point charge g at rest. The natural coordinate 
system is the polar system because the fields of a point charge depend only on the 
radial distance from the charge. We integrate Coulomb’s law (1.4) over a spherical 


volume containing the charge q at its center. With dV = 4ar7dr the integral 
becomes { VEdV= i 42 (r°E,)dV = 42R’E, (R), where R is the radial distance 


from the charge. On the r.h.s. of Coulomb’s law (1.4), an integration over all the 
charge q gives f zee" = ee and the electric field of a point charge at distance 
Ris 


1 q 


E, (R) = =. 
Se 4meye R? 


(1.47) 


The electric field is proportional to the charge and decays quadratically with 
distance R. 
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Fields of a Charged Particle Beam 


Many charged particles, travelling along the same path form a beam. This particle 
beam generates an electric as well as a magnetic field. The proper coordinates are 
now cylindrical and Coulomb’s law is 


lo 1 0E dE, lo 
VE = -—(rE,) + e+ = (rE) =, (1.48) 
r or r 0p Oz r or €0€ 
——’ —_— 
=0 =0 


where (op is the charge density in the particle beam. We assume a uniform continuous 
beam and expect therefore no azimuthal or longitudinal dependence, leaving only 
the radial dependence. Radial integration over a cylindrical volume of unit length 
collinear with the beam gives with the volume element dV = 2zrdr, on the 1.h.s. 
|rE,|9 27. The r.h.s. is aur and the electric field for a uniformly charged particle 
beam with radius R is 


E,(r) oer forr<R (1.49) 
r= 2 . . 
pee forr>R 


The magnetic field for the same beam can be derived by applying Stoke’s theorem 
on Ampere’s law to give after integration 


5 Loljor forr<R 


By (r) = | (1.50) 


5 Hopjo® forr>R 


The fields increase linearly within the beam and decay again like 1/r outside 
the beam. Real particle beams do not have a uniform distribution and therefore a 
form function must be included in the integration. In most cases, the radial particle 
distribution is bell shaped or Gaussian. Both distributions differ little in the core of 
the beam and therefore a convenient assumption is that of a Gaussian distribution 
for which the fields will be derived in Problem 1.3. 


1.5.2. Vector and Scalar Potential 


By virtue of Maxwell’s equation VB = 0 onecan derive the magnetic field from 
a vector potential A defined by B = V x A.Faraday’s law can be used to derive 
also the electric field from potentials. The equation V x E = —2B a —# (V x A) 
can be written like V x (E + A) = 0, and solved by E = — iA — VV, where we 


added the gradient of a scalar potential function V which does not alter the validity 
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of Maxwell’s equations for all fields so defined. To summarize, both, electric and 
magnetic fields can be derived from a scalar V and vector A potential 


B=VxA-VV, (1.51) 
0A 

E=-—-—VV. (1.52) 
ot 


These definitions of the magnetic and electric fields from potentials will not alter 
the validity of Maxwell’s equations as can be verified by backinsertion. 


1.5.3 Wave Equation 


From Ampere’s law both the vector and scalar potentials can be derived. Replacing 
in Ampére’s law V x B = pow j + SE the fields with their expressions in terms of 


potentials, we get V x (V x A) = Lowy + + (-4 a vv) and with Vx (VxA) = 
V (VA) —V7A 


A ELL - 
Via = =f + V(VA+5V). (1.53) 
ee 
=0 


At this point, we specify the potential function V such that it meets the condition 


va+ “v=o (1.54) 
a 


thereby simplifying greatly (1.53) and separating both potentials. This condition is 
called the Lorenz gauge and the resulting wave equation is 


‘A 
V7A — —— —— = —popj. (1.55) 


The vector potential is clearly defined by the placement of electrical currents j. 
We will use this property later in the design of, for example, magnets for particle 
beam guidance. Similarly, the wave equation for the scalar potential is 


os ee (1.56) 
Ce 


Knowledge of the placement of electrical charges defines uniquely the scalar 
potential function. However, because the velocity of electro-magnetic waves is 
finite, the potentials at the observation point depend on the charges and currents 
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at the retarded time, e.g. the location when the electro-magnetic waves have been 
emitted. The second order differential equations (1.55), (1.56) can be integrated 
readily and the potentials are 


Mok [ JQ y.2) 
A(P,j)=2" | Le | dxdydz 1.57 
(P, t) ax J RGy.D|, ys (1.57) 
and 
1 P(x, y, 2) 
V(P,t) = dxdydz. 1.58 
Pp Amee / R(X, Y, 2) |p, ? ee 


Integration over all currents and charges at the retarded distance R,, from the 
observation point P results in the definition of the vector and scalar potential at 
the point P. Both electric and magnetic fields may be derived as discussed in the last 
section. 

The wave equation just derived has special relevance for static fields where the 
Lorenz gauge reduces to the Coulomb gauge 


VA =0 (1.59) 


and (1.55) and (1.56) reduce in a charge and current free environment to the Laplace 
equation being equal to zero 


AA = 0, (1.60) 
AV = 0. 


Static magnetic and electric fields used in beam dynamics will be derived from 
theses potentials being solutions of the Laplace equation. 


Lienard-Wiechert Potentials 


For a point charge e at rest, we can integrate (1.57) readily to get A(R, t) = 0 and 
V(R,t) = moor On the other hand, in case of a moving point charge the potentials 
cannot be obtained by simply integrating over the “volume” of the point charge. The 
motion of the charge must be taken into account and the result of a proper integration 
(see Chap. 25) are the Liénard-Wiechert potentials for moving charges [31, 32] 


Hone q_ B 
A(R,t) = = 1.61 
Ge) 4n R1+nB é ( ) 
and 
ee (1.62) 
, 4re9 R1+nB|,- , 
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These potentials describe the radiation fields of synchrotron radiation being emitted 
from relativistic electrons. 


1.5.4 Induction 


Applying Stokes’ theorem to Faraday’s law (1.4), we get on the L.h.s. a line integral 
along the boundaries of the surface area S, which is equivalent to a voltage. On the 
r.h.s. the magnetic flux passing through the surface S is integrated and 


OB 0® 


[tv x tlaa = Bas = - eae (1.63) 


By virtue of Faraday’s law, the time varying magnetic flux ® through the area S 
generates an electromotive force along the boundaries of S. In accelerator physics 
this principle is applied in the design of a betatron. Similarly, from the second term 
on the right hand side in Ampére’s law (1.4), we get a magnetic induction from a 
time varying electric field. Both phenomena play together to form the principle of 
induction or, in a particular example, that of a transformer. 


1.5.5 Lorentz Force 


The trajectories of charged particles can be influenced only by electric and magnetic 
fields through the Lorentz force 


F, = qE+q(vxB). (1.64) 


Guiding particles through appropriate electric or magnetic fields is called particle 
beam optics or beam dynamics. Knowledge of the location and amplitudes of 
electric and magnetic fields allows us to predict the path of charged particles. Closer 
inspection of (1.64) shows that the same force from electric or magnetic fields can be 
obtained if E = vB, where we have assumed that the particle velocity is orthogonal 
to the magnetic field, v LB. For relativistic particles v ~ c and to get the same force 
from an electric field as from, say a 1 Tesla magnetic field, one would have to have 
an unrealistic high field strength of E ~ 300 MV/m. For this reason, magnetic fields 
are used to deflect or focus relativistic charged particles. For sub-relativistic particles 
like ion beams with velocities v < c, on the other hand, electric fields may be more 
efficient. 
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1.5.6 Equation of Motion 


Accelerator physics is to a large extend the description of charged particle dynamics 
in the presence of external electromagnetic fields or of fields generated by other 
charged particles. We use the Lorentz force to formulate particle dynamics under 
the influence of electromagnetic fields. Whatever the interaction of charged particles 
with electromagnetic fields and whatever the reference system may be, we depend in 
accelerator physics on the invariance of the Lorentz force equation under coordinate 
transformations. All acceleration and beam guidance in accelerator physics will be 
derived from the Lorentz force. For simplicity, we use throughout this text particles 
with one unit of electrical charge e like electrons and protons. In case of multiply 
charged ions the single charge e must be replaced by eZ where Z is the charge 
multiplicity of the ion. Both components of the Lorentz force are used in accelerator 
physics where the force due to the electrical field is mostly used to actually increase 
the particle energy while magnetic fields are used to guide particle beams along 
desired beam transport lines. This separation of functions, however, is not exclusive 
as the example of the betatron accelerator shows where particles are accelerated by 
time dependent magnetic fields. Similarly electrical fields are used in specific cases 
to guide or separate particle beams. 

Relating the Lorentz force to particle momentum or kinetic energy, we know 
from definitions in classical mechanics that 


Ap = f Fat 
Acp = AEkin , 1.65 
AEkin = [ Fuds Pmt pa@ ise?) 
where B = v/c. The Lorentz force can be expressed in terms of fields and the 
change of kinetic energy becomes 
Abin = [ Pas =4 f E+ x B)| ds (1.66) 


= 4 | Bas +4 f wx B) ver, 
—S E—=_ 


=0 


which indicates that an electric field component in the direction of particle motion 
does increase the particle’s kinetic energy, while the magnetic field does not 
contribute any acceleration. Magnetic fields are used only to deflect a particle’s path 
by changing the direction of its momentum vector. 

It becomes obvious that the kinetic energy of a particle changes whenever it 
travels in an accelerating electric field E and the acceleration occurs in the direction 
of the electric field. This acceleration is independent of the particle velocity and 
acts even on a particle at rest v = 0. The second component of the Lorentz force 
in contrast depends on the particle velocity and is directed normal to the direction 
of propagation and normal to the magnetic field direction. We find therefore from 
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(1.66) the result that the kinetic energy is not changed by the presence of magnetic 
fields since the scalar product (v x B) v = O vanishes. The magnetic field causes 
only a deflection of the particle trajectory. 

The Lorentz force (1.64) in conjunction with (1.65) is used to derive the equation 
of motion for charged particles in the presence of electromagnetic fields 


d 
° = _ (Amy) = eZE + eZ(v x B), (1.67) 


where Z is the charge multiplicity of the charged particle and A the atomic mass. For 
simplicity we drop from here on the factors A and Z since they are different from 
unity only for ion beams. For ion accelerators we note therefore that the particle 
charge e must be replaced by eZ and the mass by Am. 

Both relations in (1.65) can be used to describe the effect of the Lorentz force on 
particles. However, ease of mathematics makes us use one or the other. We use the 
first equation for dynamics in magnetic fields and the second for that in accelerating 
fields. Since the energy or the particle velocity does not change in a magnetic field it 
is straightforward to calculate Ap. On the other hand, accelerating fields do change 
the particle’s velocity which must be included in the time integration to get Ap. 
Calculating AE\in, we need to know only the spatial extend and magnitude of the 
accelerating fields to perform the integration. 

The particle momentum p = ymv and it’s time derivative 


dp dv dy 
= ; iE 
dpa ae eas 
With 
d d, d d 
Y_ B -_ ae U (1.69) 
dt dB dt c dt 
we get from (1.68) the equation of motion 
dp dv 3B dv 
F=—= : 1.70 
ae ae” oe ae 


For a force parallel to the particle propagation v, we have vv = vv and (1.70) 
becomes 


oP 
dt 


dv 
er B 
dt dt 


v 
= my (1+ 776") (1.71) 
c 
On the other hand, if the force is directed normal to the particle propagation, we 
have du/dt = 0 and (1.70) reduces to 


Pi _ ,,, 84 


—. 1.72. 
dt - dt ( ) 
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It is obvious from (1.71) and (1.72) how differently the dynamics of particle 
motion is affected by the direction of the Lorentz force. Specifically the dynamics 
of highly relativistic particles under the influence of electromagnetic fields depends 
greatly on the direction of the force with respect to the direction of particle 
propagation. The difference between parallel and perpendicular acceleration will 
have a great impact on the design of electron accelerators. As we will see later, the 
acceleration of electrons is limited due to the emission of synchrotron radiation. 
This limitation, however, is much more severe for electrons in circular accelerators 
where the magnetic forces act perpendicularly to the propagation compared to the 
acceleration in linear accelerators where the accelerating fields are parallel to the 
particle propagation. This argument is also true for protons or for that matter, 
any charged particle, but because of the much larger particle mass the amount of 
synchrotron radiation is generally negligibly small. 


1.5.7 Charged Particles in an Electromagnetic Field 


An electromagnetic field exerts a force on a charged particle. A magnetic field 
or transverse electric field can deflect the beam and we use magnets as guiding 
and focusing elements for particle beam dynamics. This dynamics guides the 
particles on a path which is in equilibrium between the Lorentz force and the 
centrifugal force. A charged particle in a magnetic field follows a path defined by 
the equilibrium between centrifugal and Lorentz force 


ymv> 


n+e[v xB] =0, (1.73) 


where n is the unit vector in the direction of the centrifugal force, 1/p the local 
curvature and m the mass of the particle with charge e. For a magnetic field 
orthogonal to the velocity vector of the particle the vector product is always parallel 
and opposite to m and (1.73) reduces to 

ymv 


p 


=—evB,, (1.74) 


with the local bending radius 


1 B B 
Sea (1.75) 
p BE vr cp 


The plane of the particle path is orthogonal to the transverse magnetic field. In a 
uniform magnetic field the particle follows the path of an arc with radius 


Pa) gg 2 ll 


a" = ee icy] © 8 Sievi 


(1.76) 


Dlr 
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in more practical units.We have a similar situation with respect to a transverse 
electrical field. Here, the centrifugal force is now 


2) 
+1 =0 (177) 


or 


i E E 
is eR es (1.78) 
p yc? B? B? Evot 


or in more practical units 


1, 4, E. [Vim] 
Fs i= Bop [GV] (1.79) 


Here, some caution is appropriate, because during the deflection the unit vector 
n is changing direction while the electric field may not change direction as in the 
case of a field between parallel straight plates. However, if the electrodes are bend 
along the expected particle path, the direction of the electric field is changing with 
n or the deflection of the beam. 


1.5.8 Linear Equation of Motion 


We have now all ingrediences to formulate an equation of motion in linear 
approximation. Analytical geometry tells us that the curvature is given in cartesian 
coordinates by 


— x" 


kK = —_.. (1.80) 
V1 + x2 


This equation can be simplified if we assume that x’ ~ 0. We recognize this from 
light optics as the paraxial approximation where all trajectories or rays are assumed 
to be close to the optical axis. This approximation suits beam dynamics very well 
since we try hard to keep all particles within a rather narrow vacuum chamber. 
Therefore (1.80) reduces with (1.75) to 


Ke —x" =, (1.81) 


The magnetic fields will have two main components, the guiding field for bending 
and a focusing field. Both fields together can be expressed by B, = Boy + gx, where 
Boy is the bending field and g the field gradient g = 0B,/0x. The particle beam is not 
perfectly monochromatic and we account for this by expanding the particle energy 
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1 1 , 7 , 

to first order = & 5 1 - 6), where 6 = Ap/po.With this we get the equation of 

motion 


ecBy ec 
p23 = 18) (By ee) 
cp cPo 


or keeping only linear terms in x and 6 


1 1 
x" + kx =-— + —6. (1.82) 
Po Po 
Here we have introduced the quadrupole focusing strength k = an BX and the 


bending radius is taken to be in the horizontal plane. The solution of this equation 
of motion will be very complicated due to the arbitrary layout of the beam transport 
line or fo (z). we are not interested in a mathematical formulation of this layout, 
but are interested only on the deviation of a particle from the desired transport line 
layout as defined by the location of magnets. We may transform away the beam line 
layout by merely dropping the = term from (1.82) to get finally the linear equation 
of motion for particle dynamics 


re eae 9 (1.83) 
po (z) 


Later we will introduce this coordinate system rigorously. This looks basically 
like the differential equation of a harmonic oscillator if it were not for the fact 
that the magnet strengths are functions of z.However, the solutions will be of 
oscillatory nature describing the particle motion in the restoring fields of the 
focusing devices. Actual analytical solutions will be discussed in great detail later 
in this text. 


1.5.9 Energy Conservation 


The rate of work done in a charged particle-field environment is defined by the 
Lorentz force and the particle velocity FLy = eHv + e(v x B)v. Noting that 
(v x B)v = 0, we set eEv = jE, and the total rate of work done by all particles 
and fields can be obtained by integrating Ampére’s law(1.4) over all currents and 
fields 


/ jEdV = ee / ‘G (V x B) -#) Edv. (1.84) 
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With the vector relation V(a x b) = b(V x a) —a(V xb) 


| sea = ae [ [2B VxE —c*V (E x B) — EE] dv (1.85) 


=-B 
d 
= -| E + e9ec2V (E <B)| dV, 
where an energy density has been defined by 
u = te9 (E+ cB’). (1.86) 


Applying Gauss’s theorem to the vector product in (1.85), we get an expression 
for the energy conservation of the complete particle-field system 


< / udV + | jEdV + p Snda =0. (1.87) 
ate ents patna! Seca poieane” 

change of particle energy radiation loss through 

field energy loss or gain closed surface a 


This equation expresses the conservation of energy relating the change in field 
energy and particle acceleration with a new quantity describing energy loss or gain 
through radiation. 


Poynting Vector 


The third integral in (1.87) is performed over a surface enclosing all charges and 
currents considered. The Poynting vector S is the energy loss/gain through a unit 
surface element in the direction of the unit vector n normal to the surface defined by 


1 
S= — [Ex B)]. (1.88) 
Mop 


Equation (1.88) exhibits characteristic features of electromagnetic radiation. 
Both, electric and magnetic radiation fields are orthogonal to each other (E LB), 
orthogonal to the direction of propagation (ELn, Bn), and the vectors E, B, S 
form a right handed orthogonal system. For plane waves n x E = cB and 


1 
S= E’n. (1.89) 
CHolL 


Knowing the electric fields we may determine the Poynting vector describing 
electro-magnetic waves or synchrotron radiation. 
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1.5.10 Stability of a Charged-Particle Beam 


Individual particles in an intense beam are under the influence of strong repelling 
electrostatic forces creating the possibility of severe stability problems. Particle 
beam transport over long distances could be greatly restricted unless these space- 
charge forces can be kept under control. First, it is interesting to calculate the 
magnitude of the problem. 

If all particles would be at rest within a small volume, we would clearly expect 
the particles to quickly diverge from the center of charge under the influence of 
the repelling space charge forces from the other particles. This situation may be 
significantly different in a particle beam where all particles propagate in the same 
direction. We will therefore calculate the fields generated by charged particles in 
a beam and derive the corresponding Lorentz force due to these fields. Since the 
Lorentz force equation is invariant with respect to coordinate transformations, we 
may derive this force either in the laboratory system or in the moving system of the 
particle bunch. 

From (1.49) and (1.50) we determine the Lorentz force due to electro-magnetic 
fields generated by the beam itself and acting on a particle within that beam. From 
(1.49) and (1.50) we get 


e po 
F, = e(E,— vBy) = Dene v2” (1.90) 


Only the radial component of the Lorentz force is finite. The Lorentz force remains 
repelling but due to a relativistic effect we find that the repelling electrostatic 
force at higher energies is increasingly compensated by the magnetic field. The 
total Lorentz force due to space charges therefore vanishes like y~? for higher 
energies. Obviously this repelling space charge force is much stronger for proton 
and especially for ion beams because of the smaller value for y and, in the case 
of ions, because of the larger charge multiplicity which increases the space-charge 
force by a factor of Z. 

We find the same result if we derive the Lorentz force in the moving system L* 
of the particle beam and then transform to the laboratory system. In this moving 
system we have obviously only the repelling electrostatic force since the particles 
are at rest and the only field component is the radial electrical field which is from 
(1.49) 


F* = eE* = ——ppr*. (1.91) 


Transforming this equation back into the laboratory system we note that the force 
is purely radial and therefore acts only on the radial momentum. With F, = dp,/dt 
and p, = p* we find F* = yF, since dt = yd¢*. The charge densities in both 
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systems are related by p* = p/y, the radii by r* = r, and the Lorentz force in the 
laboratory system becomes thereby 


__& 
" Qene y? 


(1.92) 


in agreement with (1.90). 

We obtained the encouraging result that at least relativistic particle beams 
become stable under the influence of their own fields. For lower particle energies, 
however, significant diverging forces must be expected and adequate focusing 
measures must be applied. The physics of such space charge dominated beams is 
beyond the scope of this book and is treated elsewhere, for example in considerable 
detail in [33]. 


Problems 


1.1 (S). Use the definition for 6, the momentum, the total and kinetic energy and 
derive expressions p (f, Fxin), Pp (Ekin) and Exin (vy). Simplify the expressions for 
very large energies, y >> 1. Derive from these relativistic expressions the classical 
nonrelativistic formulas. 


1.2 (S). Prove the validity of the field equations E, = in por and By = 5 LLoBpor 
for a uniform cylindrical particle beam with constant charge density oo within a 
radius r < R. Derive the field expressions for r > R. 


1.3 (S). Derive the electric and magnetic fields of a beam with a radial charge 
distribution p(r,~,z) = p(r). Derive the field equations for a Gaussian charge 
distribution with standard deviation o given by p(r) = po exp[—r’/ (207) |. What 
are the fields for r = 0 andr = 0? 


1.4 (S). A circular accelerator with a circumference of 300m contains a uniform 
distribution of singly charged particles orbiting with the speed of light. If the 
circulating current is 1 amp, how many particles are orbiting? We instantly turn on 
an ejection magnet so that all particles leave the accelerator during the time of one 
revolution. What is the peak current at the ejection point? How long is the current 
pulse duration? If the accelerator is a synchrotron accelerating particles at a rate of 
10 acceleration cycles per second, what is the average ejected particle current? 


1.5 (S). A proton with a kinetic energy of 1 eV is emitted parallel to the surface 
of the earth. What is the bending radius due to gravitational forces? What are the 
required transverse electrical and magnetic fields to obtain the same bending radius? 
What is the ratio of electrical to magnetic field? Is this ratio different for a proton 
energy of say 10 TeV? Why? (gravitational constant 6.67259 x 10~!! m3kg™!s~). 
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1.6 (S). Consider a highly relativistic electron bunch of n = 10!° uniformly 
distributed electrons. The bunch has the form of a cylindrical slug, £ = 1 mm long 
and a radius of R = 0.1 zm. What is the electrical and magnetic field strength at 
the surface of the beam. Calculate the peak electrical current of the bunch. If two 
such beams in a linear collider with an energy of 500 GeV pass by each other at a 
distance of 101m (center to center), what is the deflection angle of each beam due 
to the field of the other beam? 


1.7 (S). Show that for plane wavesn x E = cB. 
1.8 (S). Show that the product of two 4-vectors is Lorentz invariant. 
1.9 (S). Prove that the 4-acceleration is indeed given by (1.23). 


1.10 (S). Using 4-vectors, derive the frequency of an outgoing photon from a head- 
on Compton scattering process of an electron with a photon of frequency wr. 


1.11 (S). Using 4-vectors, derive the frequency of an outgoing photon from a head- 
on Compton scattering process of an electron with the field of an undulator with 
period Ay. 


1.12. Protons are accelerated to a kinetic energy of 200 MeV at the end of the 
Fermilab Alvarez linear accelerator. Calculate their total energy, their momentum 
and their velocity in units of the velocity of light (mpc? = 938.27 MeV). 


1.13. Consider electrons to be accelerated in the L = 3km long SLAC linear 
accelerator with a uniform gradient of 20 MeV/m. The electrons have a velocity 
v= $c at the beginning of the linac. What is the length of the linac in the rest frame 
of the electron? Assume the particles at the end of the 3 km long linac would enter 
another 3 km long tube and coast through it. How long would this tube appear to be 
to the electron? 


1.14 (S). A positron beam of energy E accelerated in a linac hits a fixed hydrogen 
target. What is the available energy from a collision with a target electron assumed 
to be at rest? Compare this available energy with that obtained in a linear collider 
where electrons and positrons from two similar linacs collide head on at the same 
energy. 


1.15 (S). The SPEAR colliding beam storage ring has been constructed originally 
for electron and positron beams to collide head on with an energy of up to 3.5 GeV. 
At 1.55 GeV per beam a new particle, the w/J-particle, was created. In a concurrent 
fixed target experiment at BNL, such w/J-particle have been produced by protons 
hitting a hydrogen target. What proton energy was required to produce the new 
particle? Determine the positron energy needed to create w/J-particles by collisions 
with electrons in a fixed target. 


1.16. A charged pion meson has a rest energy of 139.568 MeV and a mean life time 
of to, = 26.029ns in its rest frame. What are the life times t,,, if accelerated to a 
kinetic energy of 20 MeV? and 100 MeV? A pion beam decays exponentially like 
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e—'/™ At what distance from the source will the pion beam intensity have fallen to 
50 %, if the kinetic energy is 20 MeV? or 100 MeV? 


1.17 (S). Assume you want to produce antiprotons by accelerating protons and 
letting them collide with other protons in a stationary hydrogen target. What is the 
minimum kinetic energy the accelerated protons must have to produce antiprotons? 
Use the reactionp +p—>p+p+pt+p. 


1.18. Use the results of Problem 1.3 and consider a parallel beam at the beginning 
of a long magnet free drift space. Follow a particle under the influence of the beam 
self fields starting at a distance r9 = o from the axis. Derive the radial particle 
distance from the axis as a function of z. 


1.19. Show that (1.57) is indeed a solution of (1.55). 


1.20. Express the equation of motion (1.67) for Z = 1 in terms of particle 
acceleration, velocity and fields only. Verify from this result the validity of (1.71) 
and (1.72). 


1.21. Plot on log-log scale the velocity 6, total energy as a function of the kinetic 
energy for electrons, protons, and gold ions Aut!*. Vary the total energy from 
0.01mc? to 10+mc?. Why does the total energy barely change at low kinetic energies. 


1.22. The design for a Relativistic Heavy Ion Collider calls for the acceleration of 
completely ionized gold atoms in a circular accelerator with a bending radius of 
p = 242.78 m and superconducting magnets reaching a maximum field of 34.5 kg. 
What is the maximum achievable kinetic energy per nucleon for gold ions Aut”” 
compared to protons? Calculate the total energy, momentum, and velocity of the 
gold atoms (A,y=197). 


1.23. Gold ions Aut! are injected into the Brookhaven Alternating Gradient 
Synchrotron AGS at a kinetic energy per nucleon of 72 MeV/u. What is the velocity 
of the gold ions? The AGS was designed to accelerate protons to a kinetic energy 
of 28.1 GeV. What is the corresponding maximum kinetic energy per nucleon for 
these gold ions that can be achieved in the AGS? The circulating beam is expected 
to contain 6- 10° gold ions. Calculate the beam current at injection and at maximum 
energy assuming there are no losses during acceleration. The circumference of 
the AGS is Cags = 807.1m. Why does the beam current increase although the 
circulating charge stays constant during acceleration? 


1.24. Particles undergo elastic collisions with gas atoms. The rms multiple scatter- 
ing angle is given by og + Z ee Je , where Z is the charge multiplicity of the 


beam particles, s the distance travelled and ¢, the radiation length of the scattering 
material (for air the radiation length at atmospheric pressure is £- = 500m or 
60.2 g/cm”). Derive an approximate expression for the beam radius as a function of s 
due to scattering. What is the approximate tolerable gas pressure in a proton storage 
ring if a particle beam is supposed to orbit for 20h and the elastic gas scattering 
shall not increase the beam size by more than a factor of two during that time? 


References 41 


R 


Te 
8. 
9. 
10. 


11 


12. 
13. 
14. 
15. 
16. 
I. 
18. 
19. 
20. 
21. 
22. 


23 


24. 
25. 
26. 
27. 


28. 
29. 


30 
31 
32 
33 


eferences 


. M. Faraday, Poggendorf Ann. 48, 430 (1839) 

. J. Pluecker, Ann. Phys. Chem. 13, 88 (1858) 

. J.W. Hittorf, Ann. Phys. Chem. 5 Reihe 16, | (1869) 

. E. Goldstein, Monatsberichte der KGniglich Preussischen Akademie der Wissenschaften, p. 
284 (1876) 

. E. Wiedemann, Z. Elektrochem. 8, 155 (1895) 

. J.J. Thomson, Conduction of Electricity Through Gases, vol. 161 (Cambridge University Press, 

Cambridge, 1906), p. 602 

G.A. Schott, Ann. Phys. 24, 635 (1907) 

G.A. Schott, Phil. Mag. [6] 13, 194 (1907) 

H. Greinacher, Z. Phys. 4, 195 (1921) 

G. Ising, Ark. Mat. Astron. Fys. 18, 1 (1924) 

. R. Wideroe, Arch. Elektrotech. 21, 387 (1928) 

R.J. Van de Graaff, pr 38, 1919 (1931) 

E.O. Lawrence, M.S. Livingston, pr 40, 19 (1932) 

J.D. Cockcroft, E.T.S. Walton, Proc. Roy. Soc A 136, 619 (1932) 

D.W. Kerst, R. Serber, Phys. Rev. 60, 47 (1941) 

D. Ivanenko, A.A. Sokolov, DAN(USSR) 59, 1551 (1972) 

J.S. Schwinger, On the classical radiation of accelerated electrons. Phys. Rev. 75, 1912 (1949) 

V.I. Veksler, DAN(USSR) 44, 393 (1944) 

E.M. McMillan, Phys. Rev. 68, 143 (1945) 

J.P. Blewett, Phys. Rev. 69, 87 (1946) 

L.W. Alvarez, Phys. Rev. 70, 799 (1946) 

E.L. Ginzton, W.W. Hansen, W.R. Kennedy, Rev. Sci. Inst. 19, 89 (1948) 

. N. Christofilos, US Patent No 2,736,766 (1950) 

M.S. Livingston, J.P. Blewett, G.K. Green, L.J. Haworth, Rev. Sci. Instrum. 21, 7 (1950) 

H. Motz, J. Appl. Phys. 22, 527 (1951) 

E.D. Courant, M.S. Livingston, H.S. Snyder, Phys. Rev. 88, 1190 (1952) 

M. Chodorow, E.L. Ginzton, W.W. Hansen, R.L. Kyhl, R. Neal, W.H.K. Panofsky, Rev. Sci. 

Instrum. 26, 134 (1955) 

M. Sands, Phys. Rev. 97, 470 (1955) 

E.D. Courant, H.S. Snyder, Appl. Phys. 3, 1 (1959) 

. W. Scharf, Particle Accelerators and Their Uses (Harwood Academic, New York, 1985) 

. A. Liénard, L’Eclaire Electrique 16, 5 (1898) 

. E. Wiechert, Arch. Neerl. 5, 546 (1900) 

. M. Reiser, Theory and Design of Charged Particle Beams (Wiley, New York, 1994) 


Chapter 2 
Linear Accelerators 


Before we address the physics of beam dynamics in accelerators it seems appropri- 
ate to discuss briefly various methods of particle acceleration as they have been 
developed over the years. It would, however, exceed the purpose of this text to 
discuss all variations in detail. Fortunately, extensive literature is available on a 
large variety of different accelerators and therefore only fundamental principles of 
particle acceleration shall be discussed here. A valuable source of information for 
more detailed information on the historical development of particle accelerators is 
Livingston’s collection of early publications on accelerator developments [8]. 

The development of charged-particle accelerators has progressed along double 
paths which by the appearance of particle trajectories are distinguished as linear 
accelerators and circular accelerators. Particles in linear accelerators travel on 
a straight line and pass only once through the accelerator structure while in a 
circular accelerator they follow a closed orbit periodically for many revolutions 
accumulating energy at every passage of the accelerating structure. 


2.1 Principles of Linear Accelerators 


No fundamental advantage or disadvantage can be claimed for one or the other 
class of accelerators. It is mostly the particular application and sometimes the 
available technology that determines the choice between both classes. Both types 
have been invented and developed throughout the twentieth century, and continue 
to be improved and optimized as associated technologies advance. In this chapter 
we will concentrate on linear accelerators and postpone the discussion on circular 
accelerators to the next chapter. In linear accelerators the particles are accelerated 
by definition along a straight path by either electrostatic fields or microwave fields. 
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2.1.1 Charged Particles in Electric Fields 


In accelerator physics all forces on charged particles originate from electromagnetic 
fields. For particle acceleration we consider only the electric-field term of the 
Lorentz force. The nature of the electric field can be static, pulsed, generated 
by a time varying magnetic field or a microwave field. Both the electric and 
magnetic fields are connected by Maxwell’s equations. Such fields are generated 
by appropriate sources hooked up to an accelerating section which, in the case of 
electro-static fields, consists of just two electrodes with the particle source at the 
potential of one electrode and a hole in the center of the other electrode to let the 
accelerated particles pass through. Special resonant cavities are used as accelerating 
sections with two holes on the axis of the cavity to let the beam pass through. Either 
field can be represented by the plane wave equation 


E(W) = Ege’) = Ege”, (2.1) 


where w is the frequency and k the wave number including the case of static fields 
with m = O and k = 0. The Lorentz force acting on an electric charge is 


d 
FL = [ney B = eE(y) (2.2) 


and the equation of motion for particles under this force is 


<meyB = ecE(). (2.3) 


Integration of (2.3) results in an expression for the momentum gain of the particle 


Ap = me(y8 ~ wBs) =e i: E(y) dt, (2.4) 


where mcyofo is the initial momentum of the particle. Generally, it is somewhat 
complicated to perform a time integration which requires the tracking of particles 
though the accelerating cavity. To simplify the calculation, we look for the gain in 
kinetic energy which reduces (2.4) to a spatial integration of the electric field in the 
accelerating cavity. This integral is a property of the cavity and is independent of 
particle motion. With 6 Acp = AE, the energy gain for particles passing through 
the accelerating section is 


AEkin = | E(y) ds, (2.5) 
L 


cy 


where L,y is the length of the accelerating section. 
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The effectiveness of acceleration in a microwave field depends greatly on the 
phase relationship of the field with the particle motion. For successful particle 
acceleration we expect therefore the need to meet specific synchronicity conditions 
to ensure acceleration. 


2.1.2 Electrostatic Accelerators 


In electrostatic accelerators the potential difference between two electrodes is used 
for particle acceleration as shown in Fig. 2.1. The most simple such arrangement 
has been used now for almost two centuries in glow discharge tubes for fundamental 
research on the nature of plasmas, as light sources or as objects of aesthetic interest 
due to colorful phenomena in such tubes. In another, more modern application 
electrons are accelerated in an x-ray tube by high electrostatic fields and produce 
after striking a metal target intense x-rays used in medicine and industry. 

The voltages that can be achieved by straight voltage transformation and rectifi- 
cation are quite limited by electrical breakdown effects to a few 10,000 V/cm. More 
sophisticated methods of producing high voltages therefore have been developed to 
reach potential differences of up to several million volts. 

To distribute evenly the electric fields of high potential differences a series of 
irises are distributed along the acceleration column and separated by appropriate 
resistors to break down the high voltage into smaller steps between the irises. As an 
added benefit we also gain focusing of the particle beam as will be discussed later 
in this chapter. 

A variety of techniques to obtain high voltages have been developed and applied 
to particle acceleration with more or less success. We will discus briefly a few of 
theses techniques because they are still used. 


Fig. 2.1 Principle of 


electrostatic accelerators cathode 


a aS 


electron beam 


high voltage 
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Cascade Generators 


The basic method implemented in the cascade generator is that of a voltage 
multiplier circuit which has been proposed by Greinacher [5] in 1914 and Schenkel 
[10] in 1919 which allows to achieve a multiplication of the voltage across the 
plates of a capacitor. A set of capacitors are charged through appropriately placed 
diodes from an alternating current source (Fig. 2.2) in such a way that during the 
positive half wave, half the capacitors are charged to a positive voltage and during 
the negative half wave, the other half of the capacitors are charged to a negative 
voltage thus providing twice the maximum ac voltage. By arranging 2N capacitors 
in this way the charging voltage can be multiplied by the factor N. 

While there is no fundamental limit to the total voltage, high voltage break down 
will impose a technical limit on the maximum achievable voltage. Based on this 
method Cockcroft and Walton [3] developed appropriate high-voltage techniques 
and built the first high energy particle accelerator reaching voltages as high as 
several million Volt. Applying the high voltage to a beam of protons they were 
able for the first time to initiate through artificially accelerated protons a nuclear 
reaction. In this case it was the conversion of a Lithium nucleus into two helium 
nuclei, in the reaction 


p+Li — 2He. (2.6) 


Such Cockcroft-Walton accelerators turned out to be very efficient and are still 
used as the first step in modern proton accelerator systems. Obviously with this 
kind of voltage generation it is not possible anymore to produce a continuous 
stream of particles. Because of the switching process, there is a time to charge the 
capacitors followed by a time to apply the multiplied voltage to particle acceleration. 
As a consequence, we observe a pulsed particle beam from a Cockroft-Walton 
accelerator. 


U= U, sinot en J diodes 


Fig. 2.2. Cascade generator (schematic) 
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Van de Graaff Accelerator 


Much higher voltages can be reached with a Van de Graaff accelerator [12]. Here, 
electric charge is extracted by field emission from a pointed metal electrode and 
sprayed onto an isolated endless belt. This belt is moved by motor action to carry 
the charge to the inside of a hollow sphere, where the charge is stripped off again 
by reverse field emission onto a pointed metal electrode which is connected to 
the inside of the sphere. The principle of this electrostatic generator is shown in 
Fig. 2.3. Electrical charges in a metallic conductor collect on the outside and it is 
therefore possible to continuously accumulate electrical charge by deposition to the 
inside surface of a hollow metallic sphere. If the whole system is placed into a high 
pressure vessel filled with an electrically inert gas like Freon or SF¢, voltages as 
high as 20 million volts can be reached. 

The high voltage can be used to accelerate electrons as well as protons or ions. 
In the latter two cases more than double the accelerating voltage can be achieved 
in a Tandem Van de Graaff accelerator. If a proton beam must be accelerated, the 
accelerating process would start with negatively charged hydrogen ions H™ from 
a plasma discharge tube which are then accelerated say from ground potential 
to the full Van de Graaff voltage +V. At that point the two electrons of the 
negative hydrogen ion are stripped away by a thin foil or gas curtain resulting by 
charge exchange in positively charged protons which can be further accelerated 
between the potential + V and ground potential to a total kinetic energy Exin = e2V. 
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Fig. 2.3. Van de Graaff accelerator (schematic) 
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High electrostatic voltages from a Van de Graaff generator cannot be applied 
directly to just two electrodes as shown in Fig. 2.1. Because of the great distance 
between the electrodes necessary to avoid voltage break down the fields would not 
be distributed uniformly along the axis of the acceleration column. Therefore, the 
voltage is applied to a series of resistors connected to iris electrodes which allow a 
uniform distribution of the electrical field along the acceleration column as shown 
in Fig. 2.3. A more detailed review of the development of electro-static high voltage 
generators can be found in [13]. 


2.2 Electric Field Components 


In our discussion we concentrate first on the interaction of particles with electro- 
static fields. Such field components function both as focusing and accelera- 
tion devices. Electro-static fields are commonly employed for low energy, non- 
relativistic particles. As was discussed earlier, magnetic devices are most effective 
at high energies when particle velocities are close to the speed of light. At lower 
velocities, magnetic fields loose their efficiency and are often replaced by more 
economic electric field devices and at very low energies electric fields are used 
almost exclusively. 


2.2.1 Electrostatic Deflectors 


The electric field E between two parallel metallic electrodes is uniform and can be 
used to deflect a particle beam. To get a uniform field, we generate equipotential 
surfaces by placing metallic electrodes at, for example, x = +G =const. and 
applying a voltage difference V between the electrodes. The Lorentz force of the 
electric field on a charged particle is by virtue of d’ Alembert’s principle equal to the 
centrifugal force and is for a horizontal deflection 


2, 
eZE, = (2.7) 


Here we have assumed that the electric field is parallel to the vector from the 
particle to the center of curvature. That is true for parallel plates which are curved to 
follow the curvature or almost true for straight parallel plates if the deflection angle 
is very small. Solving for the curvature, we get 


_ eZE, — eZV 1 y (2.8) 
pp yAmc? p2 2G Exiny +17 
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where 2G is the distance and V the voltage between the electrodes. We kept here the 
relativistic notation to cover the rare use of electrostatic fields on high energy beams 
for small deflections which cannot be done by magnetic fields. For nonrelativistic 


particles, (2.8) reduces to : = “©: or in case of an ion beam with charge 


Amv? 
multiplicity Z and kinetic energy per nucleon Exin = smv> 


1 (iu) = eZE,. (Vim) 


pm) = FAB eV)” oe 


where E,, = V/ (2G)is the electric field between the electrodes. 


2.2.2 Electrostatic Focusing Devices 


The most simple electro-static device with focusing properties is an iris electrode 
on some potential and coaxial with the path of a charged particle beam as shown in 
Fig. 2.4. 

To determine the field configuration and focusing properties, we note that 
the electric potential distribution V(r, z) in the vicinity of the iris is rotationally 
symmetric and expanding into a Taylor series about r = 0 this symmetry requires 
all odd terms of the expansion to vanish. 


at d*Vo (2) 
24 ort 


1.. 
V(r, z) = Vo(z) + 50 @rt+ a (2.10) 
Derivatives with respect to r are indicated with a dot and derivatives with respect 
to z with a prime. To be a real potential solution (2.10) must also be a solution of 
the Laplace equation 


OV Ia a _ 


AV 
or r or dz? 


(2.11) 
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Fig. 2.4 Iris electrode 
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Inserting (2.10) into (2.11) results in 


oa? Veg chs 1 0*Vo Ley 9 
0=VW+=-7.7r + OV aa VIS aVor +: (2.12) 
2 ar4 2 art 2 


where the coefficients of each term r” must Pe equal to zero eSepetaiely to give 2Vo + 


Vi = 0, Vo =Oand 22% 4 Li" — aM = 32% 
and the potential function is 
1 La Vo (z) 
Viz =V(2)--Wwr+ a 243 
(7,2) = Vo@)—qVo @r + a ae (2.13) 
The on-axis (r = 0) field component is 

E, = —Vo(2) , (2.14) 

and from VE = 0 or “he a -1 Zé (rE,) , we get by integration 

r OE, 1 

E, = =5 oe = 5Vo (z) r. (2.15) 


Knowing the field components, we can derive the focusing properties by 
integrating the radial equation of motion m7 = mvr" = qE,, where v and q are the 
particle velocity and charge, respectively. We use Fig. 2.5 to define the integration 


q f° q_ [° 9Er 
Palle Baa / 
TL ny? [ on amv J, "Re 


and solve in thin lens approximation (r = cost, v = cost) 


(Ey — E}) . (2.16) 
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Fig. 2.5 Focusing by an iris electrode 
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With }mv? = qVo and setting E = —V’, (2.16) becomes r, — r, = 4 eal and the 
focal length of the iris electrode is 
1 VW-V; 
~=— 2 1 (2.17) 
f 4Vo 
and the transformation matrix is finally 
1 0 
Miris = V35—-Vi . (2.18) 
Vo 


From the transformation matrix or focal length it is obvious that there is no 
focusing for a symmetric iris electrode where V, = V{. On the other hand, an 
asymmetric potential is not possible without additional electrodes. We investigate 
therefore the properties of an iris doublet. 


2.2.3 Iris Doublet 


We now investigate the particle dynamics for an iris doublet as shown in Fig. 2.6. 
Between both electrodes a distance d apart, the potential varies linearly from V; 
to V2. The doublet has three active parts, two iris electrodes and the drift space 
between them. The transformation matrices for both iris electrodes are 


1 0 1 0 
M, = (ws ‘ and Mo = (ws 4 : (2.19) 


4dV 4dV2 


The transformation matrix for the drift space between the electrodes can be 
derived from the particle trajectory 


4 Z 7 Zz r pi _ 
ra=n+ fr@a=n+ fF a, 2.20 
(=r : (Z) ee Serer ar ere (2.20) 


Fig. 2.6 Focusing by an iris doublet 
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The particle momentum varies between the electrodes from py = /2mE in to py + 
Ap(2) = sf 2m (Exin + 4 ae z) and the integral in (2.20) becomes 


d 


2d J/V; 
: M+ VV, 
(3515 


q 2Exin d V2 —Vi_ 
=v Bad 
0 1+ 247 q (V2 1 kin 


The particle trajectory at the location of the second electrode is r(d) = rp = ry + 


een and its derivative r; = r,J/Vi//V2 from which we can deduce the 


transformation matrix 


Ma={ Vetv% |, (2.22) 


We may now collect all parts and get the transformation matrix for the iris doublet 


1 2d 
s(R+1) & 

Mar = MroMaM, = ( isis lem (2.23) 
8dR2 2R2 


where R = ./V2/ ./V;. Unfortunately, this doublet is still not very convenient since 
it still changes the energy of the particle as indicated by the fact that the determinant 
det(Mapn) = 1/R. As indicated earlier this focusing device is also an accelerating 
structure. Any two adjacent irises along a high voltage accelerating structure act like 
a focusing device while accelerating particles. 


2.2.4 KEinzellens 


To obtain a focusing device that does not change the particle energy, we combine 
two doublets to form a symmetric triplet as shown in Fig. 2.7. The transformation 


Fig. 2.7 Structure of an Einzellens 
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matrix for such an Einzellens is then the product of two symmetric doublets 


mi, m 
Met = Moa (V2, Vi) Mia (Vi. V2) = ( . a (2.24) 
m2, M22 
where 
_ 3R 3 __ 2d 3R—1 
PAY = a OR? M12 = "R TER? (2.25) 
__ 3(R?=1)(1—R)3—R) _ 3 3R . 
m2 Sak ; ag = 4 5 


and R = me = 4/ se vy = 4/l+ £. The Einzellens displays some 


peculiar focusing properties depending on the potentials involved compared with 
the particle’s kinetic energy. The focal length of the Einzellens is 


1 3 > 
7 = Bak (1—R*) (R-1)G—R). (2.26) 
Varying the potential V, we obtain varying focusing conditions as summarized in 
the following table and plotted as a function of R in Fig. 2.8. 

The results of focusing properties in an Einzellens are compiled in the follow- 
ing table. Depending on the chosen voltage the Einzel-lens can be focusing or 
defocusing. 


VV<-Vy, -Vij<V<00<V<8V, V>8YV, 
R imaginary O0<R<1 1<R<3 R>3 
1/f no solution >0 >0 <0 

n/a focusing focusing de-focusing 


30 5 


Fig. 2.8 Focusing 8d/3f from (2.26) in an Einzellens as a function of R 
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The practical focusing regime is limited to 0 < R < 1. For 1 < R < 3 the 
focusing is very weak and for R > 3 the Einzellens is defocusing. 


2.3 Acceleration by rf Fields 


The most successful acceleration of particles is based on the use of rf fields 
for which by now powerful sources exist. Very high accelerating voltages can 
be achieved in resonant rf cavities far exceeding those obtainable in electrostatic 
accelerators of similar dimensions. Particle acceleration in linear accelerators as 
well as in circular accelerators are based on the use of rf fields and we will in 
the following sections and in the next chapter discuss the principles of the more 
important types of particle accelerators. 


2.3.1 Basic Principle of Microwave Linear Accelerators 


The principle of the linear accelerator based on microwave fields and drift tubes 
was proposed by Ising [6] and Wideroe. [14] The accelerator consists of a series 
of coaxial metallic tubes where the accelerating field is generated in gaps between 
adjacent tubes. In this method particles are accelerated by repeated application of rf 
fields. Wideroe constructed such an accelerator and was able to accelerate potassium 
ions up to 50 keV. 

While the principle is simple, the realization requires specific conditions to 
ensure that the particles are exposed to only accelerating rf fields. The particles 
travel through the metallic tubes while the field is not suitable for acceleration as 
shown in Fig. 2.9. The tubes shield the particles from external rf fields and the length 
of the tube segments are chosen such that the particles reach the gap between two 
successive tubes only when the rf field is accelerating. 


Synchronicity Condition 


For efficient acceleration the motion of the particles must be synchronized with the 
rf fields in the accelerating sections. The distance between the center of two adjacent 
gaps must be equal to the travel time of the particles from one gap to the next. The 
length of the drift tubes are chosen such that the particles travel for most of the rf 
period in the field free interior and emerge in a gap to the next drift tube at a moment 
the field is accelerating. The length of the shielding tubes is therefore almost as long 
as it take the particles to travel in a full rf period. In this case, we have synchronism 
between particle motion and rf field and the length of the ith drift tube/section is 


i a Tet 7 (2.27) 
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Fig. 2.9 Wideroe linac structure (schematic) 


where v; is the velocity of the particles in the ith section and T;, the rf period. 

Stimulated by the successful acceleration of potassium ions by Wideroe, a group 
led by Sloan and Lawrence at Berkeley were able to build a 50kW rf generator 
oscillating at 10 MHz and delivering a gap voltage of 42kV. Applying this to 30 
acceleration tubes they were able to accelerate mercury ions to a total kinetic energy 
of 1.26 MeV [11]. 

In the 1920s when this principle was developed it was difficult to build high 
frequency generators at significant power. In 1928 rf generators were available only 
up to about 7 MHz and numerical evaluation of (2.27) shows that this principle was 
useful only for rather slow particles like low energy protons and ions. The drift 
tubes can become very long for low rf frequencies and particles traveling with, for 
example, half the speed of light would require a drift tube length of 10.7 m at 7 MHz. 
Such long drift tubes add up quickly to a very long accelerators before the particles 
approach the speed of light. To reduce the length of the tubes, higher frequencies 
are required. 

Further progress in the development of rf linear accelerators therefore depended 
greatly on the development of rf equipment at high frequency which happened 
during World War IJ in connection with the development of radar systems. In 
1937, Hansen and the Varian brothers invented the klystron at Stanford. Soon the 
feasibility of high power klystrons had been established [2] which to this date is 
one of the most efficient rf amplifiers available. The first klystron was developed 
for 3,000 MHz which is still the preferred frequency for high energy electron linear 
accelerators. The klystron principle is economically feasible from about 100 MHz 
to more than 10 GHz. With such a wide range of high frequencies available, the 
principle of rf acceleration in linear accelerators has gained quick and continued 
prominence for the acceleration of protons as well as electrons. 

Going to higher frequencies, however, the capacitive nature of the Wideroe 
structure becomes very lossy due to electromagnetic radiation. To overcome this 
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Fig. 2.10 Alvarez linac structure (schematic) 


difficulty, Alvarez [1] proposed to enclose the gaps between the tubes by metallic 
cavities (Fig. 2.10). The acceleration section would now be composed of a series of 
tubes forming, together with the outer enclosure, a resonant cavity. 

This Alvarez structure is still the preferred preaccelerator to accelerate protons 
and ions from a few hundred keV out of a Cockroft-Walton electrostatic generator 
to a few hundred MeV for injection into a booster synchrotron. Because of the lower 
velocity of protons and ions at up to a few hundred MeV the operating frequency 
for proton linacs is generally around 200 MHz. 

Radio frequencies of 3,000 MHz and higher are desired for electron acceleration. 
In Chap. 18.4 we will discuss in more detail the basic features and scaling of high 
frequency accelerating structures to give the interested reader the tools to understand 
the scaling and limitations of basic linear accelerator physics. For more detailed 
discussion of rf aspects in linear electron accelerators, the reader is referred to the 
literature [4, 7, 9]. 


Problems 


2.1 (S). Derive the geometry of electrodes for a horizontally deflecting electric 
dipole with an aperture radius of 2cm which is able to deflect an electron beam 
with a kinetic energy of 10 MeV by 10 mrad. The dipole be 0.1 m long. What is the 
electric field required between the electrodes? 


2.2 (S). Calculate the minimum power rating for the motor driving the charging 
belt of a Van de Graaff accelerator while producing a charge current of 100 mA at 
5 MV. 


2.3 (S). Calculate the length for the first four drift tubes of a Wideroe linac for 
the following parameters: starting kinetic energy is 100 keV, the energy gain per 
gap is 1 MeV, and the microwave frequency 7 MHz. Assume the gaps to be of 
zero length for simplicity. Perform the calculations for both electrons and singly 
charged potassium ions (Ax = 39.0983 amu ~ 39; 1 amu ~ 1 GeV) and compare 
the results. 
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Chapter 3 
Circular Accelerators 


Parallel with the development of electrostatic and linear rf accelerators the potential 
of circular accelerators was recognized and a number of ideas for such accelerators 
have been developed over the years. Technical limitations for linear accelerators 
encountered in the early 1920s to produce high-power rf waves stimulated the 
search for alternative accelerating methods or ideas for accelerators that would use 
whatever little rf fields could be produced as efficiently as possible. 

Interest in circular accelerators quickly moved up to the forefront of accelerator 
design and during the 1930s made it possible to accelerate charged particles to many 
million electron volts. Only the invention of the rf klystron by the Varian brothers 
at Stanford in 1937 gave the development of linear accelerators the necessary boost 
to reach par with circular accelerators again. Since then both types of accelerators 
have been developed further and neither type has yet outperformed the other. In 
fact, both types have very specific advantages and disadvantages and it is mainly the 
application that dictates the use of one or the other. 

Circular accelerators are based on the use of magnetic fields to guide the charged 
particles along a closed orbit. The acceleration in all circular accelerators but the 
betatron is effected in one or few accelerating cavities which are traversed by the 
particle beam many times during their orbiting motion. This greatly simplifies the 
rf system compared to the large number of energy sources and accelerating sections 
required in a linear accelerator. While this approach seemed at first like the perfect 
solution to produce high energy particle beams, its progress soon became limited 
for the acceleration of electrons by copious production of synchrotron radiation. 

The simplicity of circular accelerators and the absence of significant synchrotron 
radiation for protons and heavier particles like ions has made circular accelerators 
the most successful and affordable principle to reach the highest possible proton 
energies for fundamental research in high energy physics. Protons are being 
accelerated into the TeV range in the Large Hadron Collider (LHC) at CERN in 
Geneva, Switzerland [1]. 
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For electrons the principle of circular accelerators has reached a technical and 
economic limit at about 28 GeV [2] due to synchrotron radiation losses, which 
make it increasingly harder to accelerate electrons to higher energies [3]. Further 
progress in the attempt to reach higher electron energies is being pursued through 
the principle of linear colliders [4, 5], where synchrotron radiation is avoided. 

Many applications for accelerated particle beams, however, exist at significantly 
lower energies and a multitude of well developed principles of particle acceleration 
are available to satisfy those needs. We will discuss only the basic principles behind 
most of these low- and medium-energy accelerators in this text and concentrate in 
more detail on the beam physics in synchrotrons and storage rings. Well documented 
literature exists for smaller accelerators and the interested reader is referred to the 
bibliography at the end of this text. 


3.1 Betatron 


The first “circular electron accelerator” has been invented and developed a hundred 
years ago in the form of an electrical current transformer. Here we find the electrons 
in the wire of a secondary coil accelerated by an electro motive force generated by 
a time varying magnetic flux through the area enclosed by the secondary coil. This 
idea was picked up independently by several researchers [6, 7]. Wideroe finally 
recognized the importance of a fixed orbit radius and formulated the Wideroe 5- 
condition, which is a necessary although not sufficient condition for the successful 
operation of a beam transformer or betatron as it was later called, because it 
functions optimally only for the acceleration of beta rays or electrons [8]. 

The betatron makes use of the transformer principle, where the secondary coil 
is replaced by an electron beam circulating in a closed doughnut shaped vacuum 
chamber. A time-varying magnetic field is enclosed by the electron orbit and the 
electrons gain an energy in each turn which is equal to the electro-motive force 
generated by the varying magnetic field. The principle arrangement of the basic 
components of a betatron are shown in Fig. 3.1. 


return yoke excitation coil 


vacuum chamber 


Fig. 3.1 The principle of acceleration in a betatron (schematic) 
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The accelerating field is determined by integrating Maxwell’s equation 
VxE : B (3.1) 
x E=-— ; 
ot 


and utilizing Stokes’s theorem, we obtain the energy gain per turn 


oD 
kas aa, (3.2) 
or 


where @ is the magnetic flux enclosed by the integration path, which is identical to 
the design orbit of the beam. The particles follow a circular path under the influence 
of the Lorentz force in a uniform magnetic field. We use a cylindrical coordinate 
system (r,@,y), where the particles move with the coordinate y clockwise along 
the orbit. From (1.74) we get for the particle momentum 


cp = ycemv = ecrB,. (3.3) 


The accelerating force is equal to the rate of change of the particle momentum and 
can be obtained from the time derivative of (3.3). This force must be proportional to 
the azimuthal electric field component Ey on the orbit 


d d dB 
es —e (Ga + Ze) = cEy. (3.4) 


Following Wideroe’s requirement for a constant orbit dr/dt = 0 allows the contain- 
ment of the particle beam in a doughnut shaped vacuum chamber surrounding the 
magnetic field. The induced electric field has only an angular component Ey since 
we have assumed that the magnetic field enclosed by the circular beam is uniform 
or at least rotationally symmetric. While noting that for a positive rate of change for 
the magnetic field the induced azimuthal electric field is negative, the left hand side 
of (3.2) then becomes simply 


f kas =— / E,Rdg = —27RE,. (3.5) 


On the other hand, we have from (3.4) 


cE, = —eR (3.6) 


and using (3.5), (3.6) in (3.2) we get 


—__. = Ink. (3.7) 


d®@ dB, (R) 
dt dt 
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Noting that the complete magnetic flux enclosed by the particle orbit can also be 
expressed by an average field enclosed by the particle orbit, we have ® = 1 R*B(R), 
where B(R) is the average magnetic induction within the orbit of radius R. The rate 
of change of the magnetic flux becomes 


do 5 dB, (R) 


and comparing this with (3.7) we obtain the Wideroe 5-condition 


B1(R) = 3B (R), (3.9) 


which requires for orbit stability that the field at the orbit be half the average flux 
density through the orbit. This condition must be met in order to obtain orbital-beam 
stability in a betatron accelerator. By adjusting the total magnetic flux through the 
particle orbit such that the average magnetic field within the orbit circle is twice the 
field strength at the orbit, we are in a position to accelerate particles on a circle with 
a constant radius R within a doughnut shaped vacuum chamber. 

The basic components of a betatron, shown in Fig. 3.1, have rotational symmetry. 
In the center of the magnet, we recognize two magnetic gaps of different aperture. 
One gap at R provides the bending field for the particles along the orbit. The other 
gap in the midplane of the central return yoke is adjustable and is being used to 
tune the magnet such as to meet the Wideroe +-condition. The magnetic field is 
generally excited by a resonance circuit cycling at the ac frequency of the main 
electricity supply. In this configuration the magnet coils serve as the inductance and 
are connected in parallel with a capacitor bank tuned to the ac frequency of 50 or 
60 Hz. 

The rate of momentum gain is derived by integration of (3.4) with respect to 
time and we find that the change in momentum is proportional to the change in the 
magnetic field 


dB 
A p=R f THar= erABy (3.10) 


The particle momentum depends only on the momentary magnetic field and not on 
the rate of change of the field. For slowly varying magnetic fields the electric field is 
smaller but the particles will make up the reduced acceleration by travelling around 
the orbit more often. While the magnet cycling rate does not affect the particle 
energy it certainly determines the available flux of accelerated particles per unit 
time. The maximum particle momentum is determined only by the orbit radius and 
the maximum magnetic field at the orbit during the acceleration cycle 


CPmax = ecRBmax(R). (3.11) 
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The betatron principle works for any charged particle and for all energies since 
the stability condition (3.9) does not depend on particle parameters. In praxis, 
however, we find that the betatron principle is unsuitable to the acceleration of heavy 
particles like protons. The magnetic fields in a betatron as well as the size of the 
betatron magnet set practical limits to the maximum momentum achievable. Donald 
Kerst built the largest betatron ever constructed with an orbit radius of R = 1.23 m, 
a maximum magnetic field at the orbit of 8.1 kG and a total magnet weight of 350 
tons reaching the maximum expected particle momentum of 300 MeV/c at 60 Hz. 

For experimental applications we are interested in the kinetic energy of the 
accelerated particles. In case of electrons the rest mass is small compared to the 
maximum momentum of cp = 300 MeV and therefore the kinetic electron energy 
from this betatron is 


Exin © cp = 300 MeV. (3.12) 


In contrast to this result, we find the achievable kinetic energy for a proton to be 
much smaller 


| 2 
Bose (cp) 


57 = 48Mev, (3.13) 
Mpc 


because of the large mass of protons. 

The betatron produces a pulse of accelerated particles once per ac cycle. To gain 
the maximum energy, the ac field is biased by a dc current and acceleration occurs 
from the minimum ac field to the maximum ac field. At the maximum field the beam 
can be ejected for applications. 

Different, more efficient accelerating methods have been developed for protons, 
and betatrons are therefore used exclusively for the acceleration of electrons as 
indicated by it’s name. Most betatrons are designed for modest energies of up 
to 45 MeV and are used to produce electron and hard x-ray beams for medical 
applications or in technical applications to, for example, examine the integrity of 
full penetration welding seams in heavy steel containers. 


3.2 Weak Focusing 


The Wideroe $-condition is a necessary condition to obtain a stable particle orbit 
at a fixed radius R. This stability condition, however, is not sufficient for particles 
to survive the accelerating process. Any particle starting out with, for example, a 
slight vertical slope would, during the acceleration process, follow a continuously 
spiraling path until it hits the top or bottom wall of the vacuum chamber and gets 
lost. Constructing and testing the first, although unsuccessful, beam transformer, 
Wideroe recognized [8] the need for beam focusing, a need which has become a 


fundamental part of all future particle accelerator designs. First theories on beam 
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stability and focusing have been pursued by Walton [9] and later by Steenbeck, who 
formulated a stability condition for weak focusing and applied it to the design of 
the first successful construction and operation of a betatron in 1935 at the Siemens- 
Schuckert Company in Berlin reaching an energy of 1.9 MeV [10] although at a 
very low intensity measurable only with a Geiger counter. The focusing problems in 
a betatron were finally solved in a detailed orbit analysis by Kerst and Serber [11]. 

To derive the beam stability condition we note that (1.74) is true only at the ideal 
orbit r = R. For any other orbit radius r the restoring force is 


2 
F, = —— —evB,. (3.14) 
Here we use a cartesian coordinate system which moves with the particle along the 
orbit with x pointing in the radial and y in the axial direction. 

In a uniform magnetic field the restoring force would be zero for any orbit. To 
include focusing we assume that the magnetic field at the orbit includes a gradient 
such that for a small deviation x from the ideal orbit, r = R + x = R(1 + x/R), the 
magnetic guide field becomes 


Cot Sabi (3.15) 
= ‘ — x = . oe 2 . 
2 » ax % Boy 0x R 
After insertion of (3.15) into (3.14) the restoring force is 
pee (1 =) B (1 =) (3.16) 
Pa — —)-—evBo, (1-—n—), : 
R R ” R 
where we assumed x < R and defined the field index 
R OB, 
= —-— — 3ST 
. Boy ox ( ) 
With (1.74) we get for the horizontal restoring force 
F.= 2 oll =i), (3.18) 


The equation of motion under the influence of the restoring force in the deflecting 
or horizontal plane is with F, = ymx 


¥+a2x=0, (3.19) 


which has the exact form of a harmonic oscillator with the frequency 


oO, = . l-—-n= avl—n, (3.20) 


3.2 Weak Focusing 65 


where wo is the orbital revolution frequency. The particle performs oscillations about 
the ideal or reference orbit with the amplitude x(z) and the frequency w,. Because 
this focusing feature was discovered in connection with the development of the 
betatron we refer to this particle motion as betatron oscillations with the betatron 
frequency w,. From (3.20) we note a stability criterion, which requires that the 
field index not exceed unity to prevent the betatron oscillation amplitude to grow 
exponentially, 


wel: (3.21) 


The particle beam stability discussion is complete only if we also can show that 
there is stability in the vertical plane. A vertical restoring force requires a finite 
horizontal field component B, and the equation of motion becomes 


ymy = evB,. (3.22) 


a8; _ 9%) _ 0 can be integrated and the horizontal field 


Maxwell’s curl equation = 7 
component is with (3.15), (3.17) 


OB, Boy Boy 
B, = *dy=-— *dy = —n—. 3.23 
; a pe ly armas (3.23) 


Insertion of (3.23) and (1.74) into (3.22) results in the equation of motion for the 
vertical plane in the form of 


y+ory=0, (3.24) 
where the vertical betatron oscillation frequency is 
Wy = woJ/n. (3.25) 


Particles perform stable betatron oscillations about the horizontal mid plane with 
the vertical betatron frequency wy as long as the field index is positive 


n>0. (3.26) 
In summary, we have found that a field gradient in the magnetic guide field can 
provide beam stability in both the horizontal and vertical plane provided that the 
field index meets the criterion 


O<n<l, (3.27) 


which has been first formulated and applied by Steenbeck [10] and is therefore also 
called Steenbeck’s stability criterion. 
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A closer look at (3.20) shows that the field index actually provides defocusing 
in the horizontal plane. The reason why we get focusing in both planes is that there 
is a strong natural focusing from the sector type magnet which is larger than the 
defocusing from the field gradient. This focusing is of geometric nature and relates 
to the length of the orbit. A particle travelling, for example, parallel to and outside 
the ideal orbit is deflected more because it follows a longer path in the uniform 
magnetic field than a particle following the ideal orbit leading to effective focusing 
toward the ideal orbit. Conversely, a particle traveling parallel to and inside the ideal 
orbit is deflected less and therefore again is deflected toward the ideal orbit. 

The stability condition (3.21) actually stipulates that the defocusing from the 
field index in the horizontal plane be less than the focusing of the sector magnet 
allowing to choose the sign of the field index such that it provides focusing in the 
vertical plane. Basically the field gradient provides a means to distribute the strong 
sector magnet focusing. This method of beam focusing is known as weak focusing 
in contrast to the principle of strong focusing, which will be discussed extensively 
in the remainder of this text. 


3.3 Adiabatic Damping 


During the discussion of transverse focusing we have neglected the effect of 
acceleration. To include the effect of acceleration into our discussion on beam 
dynamics, we use as an example the Lorentz force equation for the vertical motion. 
The equation of motion is 


d . 
qe) = ev-By, (3.28) 


where we used the fields B = (B,, By, 0) in a cartesian coordinate system (x, y, z). 
Evaluating the differentiation, we get the equation of motion at the equilibrium orbit 


ymy + ymy = ew@oRB,. (3.29) 


Inserting (3.23) into (3.29) results in the equation of motion in the vertical plane 
under the influence of accelerating electrical and focusing magnetic fields 


ag Be ot 

J+ Ey + nawpy = 0, (3.30) 
where E is the energy gain per unit time. This is the differential equation of a damped 
harmonic oscillator with the solution 


atyt 


y =yoe “ cos ayt, (3.31) 
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where wy  w./n and the damping decrement 


Le (3.32) 
y= —=. : 
a 3 
For technically feasible acceleration E the damping time t,) = a, ' is very long 


compared to the oscillation period and we therefore may consider for the moment 
the damping as a constant. The envelope ymax = yoe ~” of the oscillation (3.31) 
decays like 


d 1? dt (3.33) 
Ymax = 2 Eom , : 


which after integration becomes 


a Vz (3.34) 
YO,max E 

The betatron oscillation amplitude is reduced as the particle energy increases. This 
type of damping is called adiabatic damping. Similarly, the slope y’ as well as the 
horizontal oscillation parameters experience the same effect of adiabatic damping 
during acceleration. This damping is due to the fact that the longitudinal particle 
momentum is increasing during acceleration while the transverse momentum is not. 
For a particle beam we define a beam emittance in both planes by the product €,, = 
Umax Ul ax? where u stands for x or y. Due to adiabatic damping this beam emittance 
is reduced inversely proportional to the energy like 


eva. (3.35) 


No specific use has been made of the principle of betatron acceleration to derive 
the effect of adiabatic damping. We therefore expect this effect to be generally valid 
for any kind of particle acceleration. 

The development of the betatron was important for accelerator physics for several 
reasons. It demonstrated the need for particle focusing, the phenomenon of adiabatic 
damping and stimulated Schwinger to formulate the theory of synchrotron radiation 
[3]. He realized that the maximum achievable electron energy in a betatron must be 
limited by the energy loss due to synchrotron radiation. Postponing a more detailed 
discussion of synchrotron radiation to Chap. 24, we note that the instantaneous 
synchrotron radiation power is given by Py « E?F 4 , where F', is the transverse 
force on the particle and the energy loss per turn to synchrotron radiation in a 
circular electron accelerator is given by (24.41). The energy loss increases rapidly 
with the fourth power of energy and can lead quickly to an energy limitation of the 
accelerator when the energy loss per turn becomes equal to the energy gain. 
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Most types of circular particle accelerators utilize compact accelerating cavities, 
which are excited by rf sources. Particles traverse this cavity periodically and 
gain energy from the electromagnetic fields in each passage. The bending magnet 
field serves only as a beam guidance system to allow the repeated passage of the 
particle beam through the cavity. Technically, this type of accelerator seem to be 
very different from the principle of the betatron. Fundamentally, however, there is 
no difference. We still rely on the transformer principle, which in the case of the 
betatron looks very much like the familiar transformers at low frequencies, while 
accelerating cavities are transformer realizations for very high frequencies. Electric 
fields are generated in both cases by time varying magnetic fields. 

Since the cavity fields are oscillating, acceleration is not possible at all times and 
for multiple accelerations we must meet specific synchronicity condition between 
the motion of particles and the field oscillation. The time it takes the particles to 
travel along a full orbit must be an integer multiple of the oscillation period for the 
radio frequency field. This synchronization depends on the particle velocity, path 
length, magnetic fields employed, and on the rf frequency. Specific control of one 
or more of these parameters defines the different types of particle accelerators to be 
discussed in the following subsections. 


3.4.1 Microtron 


The schematic configuration of a microtron [12] is shown in Fig.3.2. Particles 
emerging from a source pass through the accelerating cavity and follow then a 
circular orbit in a uniform magnetic field B, leading back to the accelerating cavity. 
After each acceleration the particles follow a circle with a bigger radius till they 
reach the boundary of the magnet. 

The bending radius of the orbit can be derived from the Lorentz force equation 
(1.74) 


1 ecB ecBy 


= = ; 3.36 
pcp -yBmc? — 
and the revolution time for a particle traveling with velocity v is 
20 20 
=P ame (3.37) 
v e By 


The revolution time is therefore proportional to the particle energy and inversely 
proportional to the magnetic field. It is interesting to note that for subrelativistic 
particles, where y ~ 1, the revolution time is constant even though the particle 
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magnetic shield for beam,extraction 


electron 
source 


Fig. 3.2 The principle of a microtron accelerator (schematic) 


momentum increases. The longer path length for the higher particle momentum 
is compensated by the higher velocity. As particles reach relativistic energies, 
however, this synchronism starts to fail. To still achieve continued acceleration, 
specific conditions must be met. 

A particle having completed the nth turn passes through the cavity and it’s energy 
is increased because of acceleration. The change in the revolution time during the 
(n+1)st turn compared to the nth turn is proportional to the energy increase Ay. The 
increase in the revolution time must be an integer multiple of the radio frequency 
period. Assuming that the revolution time along the first innermost circle, when the 
particle energy is still y ~ 1, is equal to one rf period we conclude that synchronism 
is preserved for all turns if 


Ay=1 (3.38) 


or integer multiples. In order to make a microtron functional the energy gain from 
the accelerating cavity in each passage must be 


AE.= 511 keV for electrons and 
AE,= 938 MeV for protons. (3.39) 


While it is possible to meet the condition for electrons it is technically impossible at 
this time to achieve accelerating voltages of almost | GV in an accelerating cavity 


70 3 Circular Accelerators 


extraction magnet 


accelerated beam 


, —_—! = 

‘ : beam from 
P P inflection 

accelerating section magnet source 


Fig. 3.3. Race track microtron [13] (schematic) 


of reasonable length. The principle of microtrons is therefore specifically suited for 
the acceleration of electrons. 

The size of the magnet imposes a practical limit to the maximum particle energy. 
A single magnet scales like the third power of the bending radius and therefore 
the weight of the magnet also scales like the third power of the maximum particle 
energy. Single magnet microtrons are generally used only to accelerate electrons to 
energies up to about 25-30 MeV. 

To alleviate the technical and economic limitations as well as to improve control 
of the synchronicity condition, the concept of a race track microtron has been 
developed [13, 14]. In this type of microtron the magnet is split in the middle and 
normal to the orbit plane and pulled apart as shown in Fig. 3.3. The space opened 
up provides space for a short linear accelerator which allows the acceleration of 
electrons by several units in y thus reducing the number of orbits necessary to reach 
the desired energy. The magnets are flat and scale primarily only like the square of 
the bending radius. 


3.4.2 Cyclotron 


The synchronicity condition of a microtron proved to be too severe for the successful 
acceleration of heavier particles like protons. In drawing this conclusion, however, 
we have ignored the trivial synchronicity condition Ay = 0. This condition 
demands that the particle energy be nonrelativistic which limits the maximum 
achievable energy to values much less than the rest energy. This limitation is of 
no interest for electrons since electro-static accelerators would provide much higher 
energies than that. For protons, however, energies much less than the rest energy 
of 938 MeV are of great interest. This was recognized in 1930 by Lawrence and 
Edlefsen [15] in the process of inventing the principle of the cyclotron and the first 
such device was built by Lawrence and Livingston in 1932 [16]. 
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Fig. 3.4 Principle of a 
cyclotron [16] (schematic). 
Vertical (top) and horizontal 
(bottom) mid plane cross 
section 
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The cyclotron principle employs a uniform magnetic field and an rf cavity that 
extends over the whole aperture of the magnet as shown in Fig. 3.4. The accelerating 
cavity has basically the form of a pill box cut in two halves, where the accelerating 
fields are generated between those halves and are placed between the poles of the 
magnet. Because of the form of the half pill boxes, these cavities are often called 
the D’s of a cyclotron. The particle orbits occur mostly in the field free interior of 
the D’s and traverse the accelerating gaps between the two D’s twice per revolution. 
Due to the increasing energy, the particle trajectories spiral to larger and larger radii. 
The travel time within the D’s is adjusted by the choice of the magnetic field such 
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that it is equal to half the radio frequency period. The principle of the cyclotron is 
basically the application of the Wideroe linac in a coiled up version to save space 
and rf equipment. Fundamentally, however, the use of field free tubes or D’s with 
increasing path length between accelerating gaps is the same.The revolution time in 
a cyclotron is given by (3.37) where now y = | and the acceleration of ions with 
a charge multiplicity Z is allowed. Keeping the magnetic field constant, we have a 
constant revolution frequency and may therefore apply a constant radio frequency 


ZeB, 


~ 2mmcy 


St h = const. (3.40) 


The principle of the cyclotron is limited to nonrelativistic particles. Protons 
are sufficiently nonrelativistic up to kinetic energies of about 20-25MeV or 
about 2.5 % of the rest energy. As the particles become relativistic the revolution 
frequency becomes smaller and the particles get out of synchronism with the radio 
frequency fi. 

The radio frequency depends on the charge multiplicity Z of the particles to 
be accelerated and on the magnetic field B,. Evaluating (3.40), the following 
frequencies are required for different types of particles 


See [MHz]= 1.53 - By [kG] for protons, 
= 0.76 - By [kG] for deuterons, (3.41) 
= 0.76 -By[kG] for Het*. 
A closer inspection of the synchronicity condition, however, reveals that these 
are only the lowest permissible rf frequencies. Any odd integer multiple of the 
frequencies (3.41) would be acceptable too. 
As long as particles do not reach relativistic energies, the maximum achievable 


kinetic energy E,in depends on the type of the particle, the magnetic field B, and the 
maximum orbit radius R possible in the cyclotron and is given by 


— (cp) 7 Leak 


Ekin = 1 3.42 
" ame 2 mc? 2 mc? oa) 
Examples of numerical relations are 
Exin[MeV]= 0.48B, [kG?]R? [m7] for protons, 
= 0.24B;[kG*]R*[m*] _ for deuterons, (3.43) 


= 0.48B[kG7]R*[m*]__ for He*¥. 
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The particle flux reflects the time structure of the radio frequency field. For a 
continuous radio frequency field the particle flux is also “continuous” with micro 
bunches at distances equal to the oscillation period of the accelerating field. For a 
pulsed rf system obviously the particle flux reflects this macropulse structure on top 
of the micropulses. 


3.4.3 Synchro-Cyclotron 


The limitation to nonrelativistic energies of the cyclotron principle is due only to 
the assumption that the radio frequency be constant. This mode of operation for rf 
systems is desirable and most efficient but is not a fundamental limitation. Technical 
means are available to vary the radio frequency in an accelerating cavity. 

As the technology for acceleration to higher and higher energies advances, the 
need for particle beam focusing becomes increasingly important. In the transverse 
plane this is achieved by the weak focusing discussed earlier. In the longitudinal 
phase space stability criteria have not been discussed yet. Veksler [17] and McMillan 
[18] discovered and formulated independently the principle of phase focusing, 
which is a fundamental focusing property for high energy particle accelerators based 
on accelerating microwave frequency fields and was successfully tested only one 
year later [19]. We will discuss this principle of phase focusing in great detail in 
Chap. 9. 

Both the capability of varying the rf frequency and the principle of phase 
focusing is employed in the synchro-cyclotron. In this version of the cyclotron, the 
microwave frequency varies as the relativistic factor y deviates from unity. Instead 
of (3.40) we have for the revolution frequency or microwave frequency 


ZeB, 
fe —h, (3.44) 
2m ymec 
where / is an integer called the harmonic number. Since B = const the radio 
frequency must be adjusted like 
fi ~ 1/y (3.45) 


to keep synchronism. The momentary particle energy y(t) can be derived from the 
equation of motion 1/p = ZeB/(cp) which we solve for the kinetic energy 


V Exin(Exin + 2mc?) = eZByp. (3.46) 


The largest accelerator ever built based on this principle is the 184 inch synchro 
cyclotron at the Lawrence Berkley Laboratory (LBL) in 1946 [20]. The magnet 
weight was 4,300 tons, produces a maximum magnetic field of 15kG and has a 
maximum orbit radius of 2.337m. From (3.46) we conclude that the maximum 
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kinetic proton energy should be Exinmax = 471 MeV while 350 MeV have been 
achieved. The discrepancy is mostly due to the fact that the maximum field of 
15 kG does not extend out to the maximum orbit due to focusing requirements. The 
principle of the synchro cyclotron allows the acceleration of particles to rather large 
energies during many turns within the cyclotron magnet. This long path requires 
the addition of weak focusing as discussed in connection with the betatron principle 
to obtain a significant particle flux at the end of the acceleration period. From the 
discussion in Sect. 3.2 we know that efficient focusing in both planes requires the 
vertical magnetic field component to drop off with increasing radius. For equal 
focusing in both planes the field index should be n = - and the magnetic field 
scales therefore like 


1 
By(p) ~ —. 7 
y(p) Vp (3.47) 
The magnetic field is significantly lower at large radii compared to the center of the 
magnet. 

This magnetic field dependence on the radial position leads also to a modification 
of the frequency tracking condition (3.45). Since both the magnetic field and 
the particle energy change, synchronism is preserved only if the rf frequency is 
modulated like 


_, Bylp()] 
v0) 
Because of the need for frequency modulation, the particle flux has a pulsed macro 


structure equal to the cycling time of the rf modulation. A detailed analysis of the 
accelerator physics issues of a synchro-cyclotron can be found in [21]. 


Set 


(3.48) 


3.4.4 Isochron Cyclotron 


The frequency modulation in a synchro cyclotron is technically complicated and 
must be different for different particle species. A significant breakthrough occurred 
in this respect when Thomas [22] realized that the radial dependence of the magnetic 
field could be modified in such a way as to match the particle energy y. The 
condition (3.48) becomes in this case 


_ Ble] 
yo 


To reconcile (3.49) with the focusing requirement, strong azimuthal variations of 
the magnetic fields are introduced 


OBy(p, ~) 
dp 


Set 


= const. (3.49) 


£0. (3.50) 


3.4 Acceleration by rf Fields 75 


In essence, the principle of weak focusing is replaced by strong focusing, to be 
discussed later, with focusing forces established along the particle trajectory while 
meeting the synchronicity condition only on average in each turn such that 


1 
a f Blo gldy ~ y(t). (3.51) 
ua 


Isochron cyclotrons produce a continuous beam of micro bunches at the rf frequency 
and are used frequently for acceleration of protons and ions for cancer therapy. 

The development of circular accelerators has finally made a full circle. Starting 
from the use of a constant radio frequency field to accelerate particles we found the 
need for frequency modulation to meet the synchronicity condition for particles 
through the relativistic transition regime. Application of sophisticated magnetic 
focusing schemes, which are now known as strong focusing, finally allowed to 
revert back to the most efficient way of particle acceleration with constant fixed 
radio frequency fields. 


3.4.5 Synchrotron 


The maximum particle energy is limited to a few hundred MeV as long as one stays 
with the basic cyclotron principle because the volume and therefore cost for the 
magnet becomes prohibitively large. Higher energies can be achieved and afforded 
if the orbit radius R is kept constant. In this case the center of the magnet is not 
needed anymore and much smaller magnets can be employed along the constant 
particle orbit. Equation (3.36) is still applicable but we keep now the orbit radius 
constant and have the design condition 


1 ecB,, 

— = — =const. (3.52) 

R cp 

This condition can be met for all particle energies by ramping the magnet fields 

proportional to the particle momentum. Particles are injected at low momentum 
and are then accelerated while the bending magnet fields are increased to keep the 
particles on a constant radius while they gain energy. The particle beam from such a 
synchrotron is pulsed with a repetition rate determined by the magnetic field cycling. 
The synchronicity condition 


ZecBy 
fit = ; 


On ymc* 


h, (3.53) 


is still valid, but because the magnetic field is varied proportional to the particle 
momentum we expect the frequency to require adjustment as long as there is 
sufficient difference between particle energy and momentum. 
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Fig. 3.5 Magnet arrangement in a synchrotron [23] 


For highly relativistic particles a solution for particle acceleration has been 
found which does not require a prohibitively large magnet and where the radio 
frequency fields can be of constant frequency for optimal efficiency. This is the 
case for electron synchrotrons with an initial energy of at least a few tens of MeV. 
For this reason electrons are generally injected into a synchrotron at energies of 
more than about 10-20 MeV from a linear accelerator or a microtron. Figure 3.5 
shows an example of a synchrotron used for the injection of electrons into a storage 
ring [23]. 

For heavier particles, however, we are back to the need for some modest 
frequency modulation during the early phases of acceleration. From (3.53) we 
expect the revolution frequency to vary like 


Z 
2 


fev (t) = By) BD) (3.54) 
tcp 


To preserve the synchronicity condition, the radio frequency must be an integer 
multiple of the revolution frequency and must be modulated in proportion of the 
varying revolution frequency. The ratio of the rf frequency to revolution frequency 
is called the harmonic numberdefined by 


Sit = h frev- (3.55) 


The maximum energy in a synchrotron is determined by the ring radius R, and 
the maximum magnetic field B,, and is 


CPmax (GEV) = VExin (Exin + 2mc?) = C,By [T]R [m], (3.56) 
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where 


GeV 
C, = 0.2997926———. (3.57) 
Tm 


Early synchrotrons have been constructed with weak focusing bending magnets 
which in addition to the dipole field component also included a field gradient 
consistent with a field index meeting the focusing condition (3.27). More detailed 
information about early weak focusing synchrotrons can be obtained from [21, 24]. 

With the discovery of the principle of strong focusing by Christofilos [25] and 
independently by Courant et. al. [26] in 1952 much more efficient synchrotrons 
could be designed. The apertures in the magnets could be reduced by up to an order 
of magnitude thus allowing the design of high-field magnets at greatly reduced 
overall magnet size and cost. The physics of strong focusing will be discussed in 
great detail in subsequent chapters. 

Synchrotrons are the workhorse in particle acceleration and are applied for 
electron acceleration as well as proton and ion acceleration to the highest energies. 
In more modern proton accelerators superconducting magnets are employed to reach 
energies in excess of | TeV. 


3.4.6 Storage Ring 


Although not an accelerator in the conventional sense, a particle storage ring can be 
considered as a synchrotron frozen in time. While the basic functioning of a storage 
ring is that of a synchrotron, particle beams are generally not accelerated but only 
stored to orbit for long times of several hours. Bruno Touschek and R. Wideroe 
invented this principle in 1941 while working (not completely by their own free 
will) at the Hamburg University for application in high energy physics to bring 
two counter rotating beams of particles and antiparticles into collision and study 
high energy elementary particle processes. A newer and more copious application 
of the storage ring principle arose from the dedicated production of synchrotron 
radiation for basic and applied research and technology. The principles, details and 
functioning of synchrotrons and storage rings will occupy most of our discussions 
in this text. 


3.4.7 Summary of Characteristic Parameters 


It is interesting to summarize the basic principles for the different particle acceler- 
ators discussed. All are based on two relations, the equation of motion (3.14) and 
the synchronicity condition (3.37). Depending on which parameter in these two 
relations we want to keep constant or let vary, different acceleration principles apply 
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Table 3.1 Parameter properties for different acceleration ee 


Principle wey [Ow 
Sit 
Microtron const. const. const.* 


Cyclotron ee const. jeonst. ‘const. _| const.# 


Synchro-cyclotron pulsed 


Isochron cyclotron const.* 


Proton synchrotron pulsed 


Sismonijachvioat eae oe 1 lesa 


* continuous beam, although rf-modulated 


with varying advantages and disadvantages. In Table 3.1 these parameters and their 
disposition are compiled for the acceleration principles discussed above. 

There are two particle parameters and three technical device parameters which 
define the mode of accelerator operation. Interestingly enough, most of the dis- 
cussed acceleration methods have their proper application and are used either as 
stand alone accelerators for research and technology or are part of an acceleration 
chain for high energy particle accelerators. 

For example, it makes no sense to construct a proton synchrotron, where the 
protons must be injected directly from the source at very low energies. The proper 
way is to first accelerate the protons with electro-static fields, for example in a 
Cockcroft-Walton accelerator followed by a medium energy linear accelerator (e.g. 
an Alvarez structure) to reach a high enough energy of a few hundred MeV for 
efficient injection into a synchrotron. Similarly, we accelerate an electron beam first 
in a linac or microtron before injection into a synchrotron or storage ring. 


Problems 


3.1 (S). Consider the Kerst betatron cycling at 60Hz. Electrons are injected at 
50 keV kinetic energy into this betatron. Calculate the magnetic field on the orbit 
at injection and the energy gain per turn for the first turn and at a time when the 
electron has gained 20 MeV. Discuss the reason for the difference in the energy gain 
per turn (use linear dependence of field, B + Bowt). 


3.2 (S). What is the total excitation current in each of the two coils for a betatron 
with an orbit radius of R = 0.4m, a maximum electron momentum of cp = 42 MeV 
and a gap of g = 10cm between the poles (hint: apply Ampére’s law). 


3.3 (S). Calculate the frequency variation required to accelerate protons or 
deuterons in a synchro cyclotron from a kinetic energy of Exing = 100keV to 
an end energy of Exin = 600MeV. Keep the magnetic field constant and ignore 
weak focussing. Derive formula for the rf frequency as a function of the kinetic 
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energy and generate a graph of fir/fizo VS. Exin from just a few points. How big 
would the frequency swing be for electrons ? 


3.4 (S). Calculate the electron beam current in the Kerst betatron that would 
produce a total synchrotron radiation power of 1 Watt at cp = 300 MeV. 


3.5 (S). Try to “design” a microtron for a maximum electron energy of E = 
25 MeV at a magnetic field of B = 2140G. (a) What is the diameter of the 
last circular trajectory at 25 MeV ? (b) Sketch a cross section of the magnet with 
excitation coils. Magnet poles must extend radially at least by 1.5 gap heights 
beyond the maximum orbit to obtain good field quality. Use a total coil cross section 
of 5cm*. Choose your own gap height. (c) What is the electrical power required to 
operate each coil assuming copper and a copper fill factor of 75 %. That means 
75 % of the coil cross section is copper and the rest is for insulation and cooling. Do 
you think your coil needs water cooling? (for simplicity assume the coil length to 
be equal to the length of the last trajectory plus 10 %.) (d) How does the electrical 
power requirement change if you change the number of turns in the coil thereby 
changing the electrical current. Keep in either case 75 % fill factor. 


3.6 (S). Consider the Fermilab 400 GeV/c synchrotron which has a circumference 
of 6,000 m. Protons are injected at 10 GeV/c momentum. Calculate the frequency 
swing necessary for synchronicity during the acceleration cycle. 
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Tools We Need 


Chapter 4 
Elements of Classical Mechanics* 


Based on d’Alembert’s principle, we formulate Hamilton’s integral principle by 
defining a function L = L(qj, q;, t) such that for any mechanical system the variation 
of the integral ce Ldt vanishes along any real path (Fig. 4.1) so that 


5 f Loa = 0. (4.1) 
to 


Here, the variables (q;, g;, t) are the coordinates and velocities, respectively, and 
t is the independent variable time. We may expand this function and get 


a OL aL 
Ldt = —6qid —6q,dt. 4.2 
af J Sagdnare fo goa a 


The second term can be modified using the assumption of the variational theorem 
which requires that 6g; = 0 at the beginning and end of the path. The second term 
can be integrated by parts and is 


OL oL d Ld 
—6q;dt = dqidt = es 


/ / : a ;dt (4.3) 
04: dq; dt Ogi at oo 


to dt Ogi 


=0 


Both terms can now be combined for 


1 OL sd OL 
sf Ldt = (p> (5 = am) dqidt = 0. (4.4) 
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Fig. 4.1 Variational principle L(t) 


L(tg) 


This integral is zero for any arbitrary path if and only if the integrand vanishes 
for each component ; independently. The resulting equations are called the Euler- 
Lagrange equations 


oe - ig =0. (4.5) 
dtdgi qi 
Bypassing a more accurate discussion [1], we guess at the nature of the Euler- 
Lagrange equations by considering a falling mass m. The kinetic energy is T = 
Smv> and the potential energy V = gx, where g is the gravitational force. If we 
stL=T-V= Smv> — gx and apply (4.5), we get mv = g which is the well 
known classical equation of motion for a falling mass in a gravitational field. The 
time independent Lagrangian can be defined by 


L=T-V (4.6) 


and the Lagrange function therefore has the dimension of an energy. Furthermore, 
in analogy with basic mechanics like a falling mass, we can define the momenta of 
a system by 


OL 
| eee (4.7) 
0gi 
and call them the generalized canonical momenta. We use a capital P for the 
canonical momentum to distinguish it from the ordinary momentum p. Both are 
different only when electromagnetic fields are involved. 
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4.1 How to Formulate a Lagrangian? 


To formulate an expression for the Lagrangian is a creative process of physics. 
Whatever expression one might propose, it should be independent of a particular 
reference system and therefore Lorentz invariant. Earlier, we have learned that the 
product of two 4-vectors is Lorentz invariant and the product of two, not necessarily 
different, 4-vectors is therefore a good choice to form a Lagrangian. We investi- 
gate, for example, the product of the momentum-energy (cp<,cp*,cp*,iE*) = 
(0, 0,0, imc) and the differential space-time 4-vectors (dx*,dy*,dz*,icdr) in the 
particle rest frame and get 


1 
— (dx*, dy*, dz*, icdr) (cp*, cp, cp® iE") = —mc*dt = —mc? 1 — Bde. 
é 
(4.8) 
This expression has the dimension of an energy and is Lorentz invariant. We 


consider therefore this as the Lagrangian for a particle at rest being observed from 
a relatively moving laboratory system 


L= —me?/1 — Bp. (4.9) 


The conjugate momentum is from (4.7) for the x-component 


=v. 


P, = —m——— = ymv,; (4.10) 
J1— fp? 
and the equation of motion oe a uh becomes 
oa (4.11) 
dt 


indicating that the particle is in uniform motion with velocity f. 

The Lagrangian (4.9) is consistent with classical experience if we set B < 1 
and L = —mc?/1— p? ~ —mc? + }mv?. Since we use only derivatives of the 
Lagrangian, we may ignore the constant —mc? and end up with the kinetic energy 
of the free particle. 


4.1.1 The Lagrangian for a Charged Particle in an EM-Field 


The interaction between charged particle and electromagnetic field depends only on 
the particle charge and velocity and on the field. We try therefore the product of field 
and velocity 4-vector. Formulating this product in the laboratory system, where the 
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fields have been generated, we get. 
e (Ax, Ay, Az, if) y (Ux, Uy, Uz,1) = ey (Av —@). (4.12) 


Noting that ydt =d?, the extension to the Lagrange function in the presence of 
electromagnetic fields is 


L=-—mc? /1— p? + eAv —e@. (4.13) 
The canonical momentum is from (4.7) 
mv 
P= —+ecA=ymv+eA=p+eA, (4.14) 


JF 


where p is the ordinary momentum. Equation (4.13) is consistent with L = T — V, 
where the potential V = ef — eAv. 


4.2 Lorentz Force 


The conjugate momenta in Cartesian coordinates r = (x, y, z) can be derived from 
(4.5) with (4.13) 


P= a = eV (Av) —eVd = e(vV)A +e [v x (Vx A)] —eV¢, (4.15) 


where we used the algebraic relation (A.18). Insertion into 


doL dP 4d 
ae a q 2 + eA) =e(vV)A+el[v x (V x A)| —eV¢d 
results with r= v and a = A + (vV)A in an expression for the ordinary 
momentum p 
d 0A 
ae = neg telex (V x A)] —e Vo. (4.16) 


Converting potentials to fields, we recover the Lorentz force FL, = oP or 


F, = cE +e(vxB). (4.17) 
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4.3 Frenet-Serret Coordinates 


A particle trajectory follows a path described by 
r(z) = ro(z) + dr(z). (4.18) 


Here ro(z) is the ideal path for beam dynamics and an orthogonal coordinate system 
moves along this path with its origin at r9(z) as shown in Fig. 4.2. For this Frenet- 
Serret coordinate system we define three vectors 


u,(Z) unit vector L to trajectory 
dro (z 
u.(z) = “° 


uy(z) = u,(z) X u,(z) unit binormal vector 


unit vector || to trajectory (4.19) 


to form an orthogonal coordinate system moving along the trajectory with a 
reference particle at ro(z). In beam dynamics, we identify the plane defined by 
vectors uw, and u,(z) as the horizontal plane and the plane orthogonal to it as the 
vertical plane, parallel to uy. Change in vectors are determined by curvatures. 


du,(z du, 
HO guy, and WO < wu.(o, (4.20) 
iz z 


where (ie Ky) are the curvatures in the horizontal and vertical plane respectively. 
The particle trajectory can now be described by 


r(x, y, Z) = ro(z) + x(z)ux(z) + y(z)uy(z), (4.21) 


individual particle trajectory 


ideal beam path 


Fig. 4.2 Frenet-Serret coordinate system 
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where ro(z) is the location of the coordinate system’s origin (reference particle) 
and (x, y) are the deviations of a particular particle from ro(z). The derivative with 
respect to zis then 


du. oe 7 of 


d 
qi: Z= we x(z) —— LQ + x'(z)ux(z) + y'(z)uy(z) (4.22) 


or with (4.19) and (4.20) 
dr = u,dx + uydy + u-hdz, (4.23) 


where 
h=1+ Koxx + Koyy. (4.24) 


Using these Frenet-Serret coordinates, we are able to describe particle trajectories 
much more efficient than we could do in Cartesian coordinates. Essentially, we 
have transformed away the ideal path or the geometry of the design beam transport 
line which is already well known to us from the placement of beam guidance 
elements. The new coordinates measure directly the deviation of any particles from 
the reference particle. 

We may use these relations to introduce a transformation, from the Cartesian 
coordinate system to curvilinear Frenet-Serret coordinates, in the Lagrangian 


L = -mce?,/1— fp? + erA — ed. In the new coordinates, /1—f2 = 
rT 1- 5 (4? + ¥? + h?27), TA = XA, + Ay + AZA, and the Lagrangian becomes in 
curvilinear coordinates of beam dynamics 


ne —me? 1-5 (P2432 + 2) + € (tA + 5Ay + AEA) —€f. (4.25) 


4.4 Hamiltonian Formulation 


Like any other mechanical system, particle beam dynamics in the presence of 
external electromagnetic fields can be described and studied very generally through 
the Hamiltonian formalism. The motion of particles in beam transport systems, 
expressed in normalized coordinates, is that of a harmonic oscillator and deviations 
caused by nonlinear restoring forces appear as perturbations of the harmonic 
oscillation. Such systems have been studied extensively in the past and powerful 
mathematical tools have been developed to describe the dynamics of harmonic 
oscillators under the influence of perturbations. Of special importance is the 
Hamiltonian formalism which we will apply to the dynamics of charged particles. 
Although this theory is well documented in many text books, for example in [1, 2], 
we will shortly recall the Hamiltonian theory with special attention to the application 
in charged particle dynamics. 
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The canonical variables in the Hamiltonian theory are the coordinates and 
momenta rather than coordinates and velocities used in the Lagrangian. We use 
a coordinate transformation (qj, gi,t) ==> (qi, P:,t) through the definition of the 
momenta P; = 0L/0q; and define the Hamiltonian function by 


HGP) = > Pi - LG, 4). (4.26) 


In analogy to the Lagrangian, we find that g;P; = 2T and the Hamiltonian which 
does not depend on the time explicitly is therefore the sum of kinetic and potential 
energy 


H=T+V. (4.27) 


This will become useful later since we often know forces acting on particles 
which can be derived from a potential. Similar to the Euler-Lagrange equations, we 
define Hamiltonian equations by 


oP ¢ = +4 (4.28) 
7 = fj, an = = ie i 
Ogi OP; 4 
With L = —mc*,/1— B? + eAv — e¢ and replacing velocities with momenta the 
Hamiltonian becomes 
H(qi,P;) = )_ GP: + mc? V1 — B? — Aq + ed, (4.29) 


where gq = (41,92,--,9i,-.) and A = (Aj,A2,..,Aj,..) , etc. and the canonical 
momentum is defined in (4.14). The canonical momentum P is from (4.14) the 
combination of the ordinary particle momentum p = ymg and field momentum 
eA. Insertion into the Hamiltonian and reordering gives (H — ep) = mc + 
C7 (P- eA) , or 


? (P— eA)’ — (H— eg) = —m’c4, (4.30) 
The Hamiltonian (4.30) is equal to the square of the length of the energy 


momentum 4-vector [cP,iE] , where E = H — ed, and is therefore Lorentz invariant. 
A more familiar form is 


H=eo+ ye (P — eA)? + m2ct. (4.31) 


In nonrelativistic mechanics, the Hamiltonian becomes with 6 < 1 and ignoring 
the constant mc? 


Aetass © 5m? + eg, (4.32) 


which is the sum of kinetic and potential energy. 
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4.4.1 Cyclic Variables 


The solution of the equations of motion become greatly simplified in cases, where 
the Hamiltonian does not depend on one or more of the coordinates or momenta. 
In this case one or more of the Hamiltonian equations (4.28) are zero and the 
corresponding conjugate variables are constants of motion. Of particular interest 
for particle dynamics or harmonic oscillators are the cases where the Hamiltonian 
does not depend on say the coordinate g; but only on the momenta P;. In this case 
we have 


H= A(q, «++ Gi-1;,qi41 «-- ,P,Po sietegeh ened) (4.33) 
and the first Hamiltonian equation becomes 


oH 


a —P; —0 or P; = const. (4.34) 
Ogi 


Coordinates q; which do not appear in the Hamiltonian are called cyclic coordinates 
and their conjugate momenta are constants of motion. From the second Hamiltonian 
equation we get with P; = const. 


oH 


— = qj = a4; = const, 
Opi 


which can be integrated immediately for 
qi(t) =ajt+ ci, (4.35) 


where c; is the integration constant. It is obvious that the complexity of a mechanical 
system can be greatly reduced if by a proper choice of canonical variables some or 
all dependence of the Hamiltonian on space coordinates can be eliminated. We will 
derive the formalism that allows the transformation of canonical coordinates into 
new ones, where some of them might be cyclic. 

Example: Assume that the Hamiltonian does not depend explicitly on the time, 


then 24 = 0 and the momentum conjugate to the time is a constant of motion. 


ot 
From the second Hamilton equation, we have ju = ty = | and the momentum 
conjugate to the time is therefore the total energy p; = H =const. The total energy 
of a system with a time independent Hamiltonian is constant and equal to the value 


of the Hamiltonian. 


4.4.2, Canonical Transformations 


For mechanical systems which allow in principle a formulation in terms of cyclic 
variables, we need to derive rules to transform one set of variables to another 
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set, while preserving their property of being conjugate variables appropriate to 
formulate the Hamiltonian for the system. In other words, the coordinate transfor- 
mation must preserve the variational principle (4.1). Such transformations are called 
canonical transformations q,; = f,(qi, Pi, t) and Py = gx(qi, Pi, t), where (qj, Pi, t) 
are the old and (, P;, t) the new coordinates. The variational principle reads now 


5 f (x un-1) dt=0 and 5 f (= iA) dt=0. (4.36) 
k k 


The new Hamiltonian H need not be the same as the old Hamiltonian H nor need 
both integrands be the same. Both integrands can differ, however, only by a total 
time derivative of an otherwise arbitrary function G 


3 — dG 
.P, —H = q.P, —-H+ —. 4.37 
2 k Xi k + 7 ( ) 


After integration aC dt becomes a constant and the variation of the integral 
obviously vanishes under the variational principle (Fig. 4.1). The arbitrary function 
G is called the generating function and may depend on some or all of the old and 
new variables 


G = G(dks Ge, Pe. Pe t) with O<k<N. (4.38) 


The generating functions are functions of only 2N variables, coordinates and 
momenta. Of the 4N variables only 2N are independent because of another 2N 
transformation equations (4.36). We may now choose any two of four variables to be 
independent keeping only in mind that one must be an old and one a new variable. 
Depending on our choice for the independent variables, the generating function may 
have one of four forms 


Gi= Gi(q, 4.0), G3= G3(P, 4.0), 


= S 4.39 
Go= Gr(q,P,1), Ga= GilP,P, 1), oe 


where we have set gq = (q1,q2,...qn) etc. We take, for example, the generating 
function Gj, insert the total time derivative 


0G, Ok 0G, OP, 0G, 
ss 4.4 
= jay a ar i et oy 


in (4.37) and get after some sorting 


Dan (m-Fe)- ma (e+ oe) G -H+ S) <0. (4.41) 
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Both, old and new variables are independent and the expressions in the brackets 
must therefore vanish separately leading to the defining equations 


dG, 7 dG; — 0G; 


P, = —., 
Odk gk 


Variables for which (4.42) hold are called canonical variables and the transforma- 
tions (4.36) are called canonical. 

Generating functions for other pairings of new and old canonical variables can 
be obtained from G; by Legendre transformations of the form 


Go(q, P,t) = Gi(q.g,t) + gP. (4.43) 


Equations (4.42) can be expressed in a general form for all four different 
types of generating functions. We write the general generating equation as G = 
G(xx, Xx, t), where the variables x, and x, can be either coordinates or momenta. 
Furthermore, x, and x, are the old and new coordinates or momenta respectively 
and the (yx,¥,) are the conjugate coordinates or momenta to (x, X,). Then 


0 2 

Ye = E—G(Xe. Xx, 1), 
OX, 

_ 0 _ 

y= F— G(x, Xx, t), (4.44) 
OX, 

H=H rer Xx, t) 

= at Xk» Xk, 0). 


The upper signs are to be used if the derivatives are taken with respect to coordinates 
and the lower signs if the derivatives are taken with respect to momenta. It is not 
obvious which type of generating function should be used for a particular problem. 
However, the objective of canonical transformations is to express the problem at 
hand in as many cyclic variables as possible. Any form of generating function 
that achieves this goal is therefore appropriate. To illustrate the use of generating 
functions for canonical transformations, we will discuss a few very examples. For 
an identity transformation we use a generating function of the form 


and get with (4.44) andi = 1,2,...N the identities 


P; — P,, and di => + = = di- (4.46a) 
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A transformation from rectangular (x,y,z) to cylindrical (r,g, z) coordinates is 
defined by the generating function 


G(P,q) = —P,rcosy — Pyrsing — P,z (4.47) 


and the transformation relations are 


= je = rcosg, P, =—% = + P,cosy + Pysing, 

ne : a ; 
y= ~iy =n, Py = “ie = —P, sing + P,cosg, (4.48) 
2 = ap, T= % fp =a, HTe 


Similarly, relations for the transformation from rectangular to polar coordinates 
can be derived from the generating function 


G = —P,rcosg sin} — Pyrsing sin’ — P-rcos 0. (4.49) 


It is not always obvious if a coordinate transformation is canonical. To identify a 
canonical transformation, we use Poisson brackets [1] defined by 


(7 gx Of “et ) 


4. 
Ogi OP; OP; Ogi ( 30) 


[fi(qi. Pj), &k(Qi: P;)| = = 


i 


It can be shown [1] that the new variables ¢;, P, or (4.36) are canonical if and only 
if the Poisson brackets 


[P.P1=0 [9.q])=9  [41, Pj] = Ab; (4.51) 


where 6; is the Kronecker symbol and the factor A is a scale factor for the 
transformation. To preserve the scale in phase space, the scale factor must be 
equal to unity, A = 1. While the formalism for canonical transformation is 
straight-forward, we do not get a hint as to the optimum set of variables for a 
particular mechanical system. In the next sections we will see, however, that specific 
transformations have been identified and developed which prove especially useful 
for a whole class of mechanical systems. 


4.4.3 Curvilinear Coordinates 


The choice of a particular coordinate system, of course, must not alter the physical 
result and from this point of view any coordinate system could be used. However, it 
soon becomes clear that the pursuit of physics solutions can be mathematically much 
easier in one coordinate system that in another. For systems which are symmetric 
about a point we would use polar coordinates, for systems which are symmetric 
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about a straight line we use cylindrical coordinates. In beam dynamics there is no 
such symmetry, but we have a series of magnets and other components aligned along 
some, not necessarily straight, line. The collection of these elements is what we call 
a beam line. The particular arrangement of elements is in most cases not determined 
by physics but other more practical considerations. The matter of fact is that we 
know about the “ideal” path and that all particle should travel along a path being 
defined by the physical centers of the beam line elements. In a Cartesian coordinate 
system fixed to the stars the result of “ideal” beam dynamics would be a complicated 
mathematical expression trying to describe the “ideal” path in which we have no 
interest, since we already know where it is. What we are interested in is the deviation 
a particular particle might have from the ideal path. The most appropriate coordinate 
system would therefore be one which moves along the ideal path. In Sect. 4.3 we 
have introduced such a curvilinear reference system also known as the Frenet-Serret 
reference system. The transformation from Cartesian to Frenet-Serret coordinates 
can be derived from the generating function formed from the old momenta and the 
new coordinates 


G(z, x,y, Pox, Pox, Poy) = — (cP, — ecAc) [ro(z) + xXU,(z) + yuy(z) | : (4.52) 
The momenta and fields in the old Cartesian coordinate system are designated with 


the index , and the new canonical momenta P in the Frenet-Serret system are then 
in both systems while noting that the transverse momenta are the same 


(cP, — ecA,h) = -< = (cP, — ecA,), h, 
Zz 
dG 
(cP, — ecAy) = = = (cP, — ecAx),, (4.53) 
x 
dG 
(cP, —ecAy) = a = (cP, = ety) 4 
y 


with h as defined in (4.24). The Hamiltonian H, = ef + cy mc? + (P — eA)? 
in Cartesian coordinates transforms to the one in curvilinear coordinates of beam 
dynamics 


P, — eA,h)? 


H=ep+ of m2c2 + ( 71 + (Py — eA,” + (Py — eAy) . (4.54) 


For a particle travelling through a uniform field B,, we have A = (0,0,A,) = 
(0, 0, —Byx) , Pry = pxy, and the Hamiltonian is with A, = Ach 


1 
Ay = ed + cy) meet ee Per Rp (P, + eByhx) (4.55) 
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The distinction, we make here on fields in curvilinear and Cartesian coordinates 
stems from the practice to build magnets in a certain way. Dipole magnets are 
designed carefully to have a uniform field in the beam area along the curved path, 
which is not consistent with the transformation of a uniform dipole field in Cartesian 
coordinates. 


4.4.4 Extended Hamiltonian 


The Hamiltonian as derived so far depends on the canonical variables (qj, P;) 
and the independent variable f or z defined for individual particles. This separate 
treatment of the independent variable can be eliminated by formulating an extended 
Hamiltonian in which all coordinates are treated the same. 

Starting with H(q),q2...q¢,P1, P2,P3...P.,t), we introduce the independent 
variables (go, Po) by setting 


qo=t and Poy = —H (4.56) 
and obtain a new Hamiltonian 
H(qo, 91.92--- 9%, Po, P1,P2,P3...P1) =H + Po =0 (4.57) 


and Hamilton’s equations are then 


dg; _ 9H. 
a. eee Tor PS, 1,200, (4.58) 
do gi 


In particular for i = 0 the equations are 


dqo 
Pat qo=tt+c (4.59) 


and 


= = Pyo=- 4. 
7 a0 7  e H+ C2 (4.60) 


The momentum conjugate to the time is equal to the Hamiltonian and since H 4 
H (t) for static fields, it follows that 


dPo 
ar =0 = H-=const. (4.61) 
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Now, the independent variable is no more distinguishable from all other coordi- 
nates, the Hamiltonian is expressed as a function of coordinates and momenta only. 


4.4.5 Change of Independent Variable 


Since no particular coordinate is designated as the independent variable, we may use 
any of the coordinates as that. For example, we prefer often to use the longitudinal 
coordinate z as the independent variable rather than the time ¢. More generally, 
consider to change the independent variable from q; to q; . Defining, for example, q3 
as the new independent variable, we solve H for P3 


P3 = —K(qo, 91,92; 9 o+ + Of, Po, P1, Po, P4, i . Pr) (4.62) 


and define a new extended Hamiltonian 


K=P3+K=0. (4.63) 
Then the equations 
OK dq3 
— ==], 4.64 
OP3 dq3 ( -) 
ok dP OK 
62 ee (4.64b) 
0q3 dq3 0q3 
0K dqi OK 
a (4.64c) 
OP i#3 dq3 OP ix3 
0K dP; OK 
” = 3 —_ (4.644) 
Ogi¢3 dP3 Ogi¢3 
with the Hamiltonian 
K=-p3. (4.65) 


As an example, to use the longitudinal coordinate z rather than the time ¢ as the 
independent variable, we start with (4.54) 


1 
H (x, y,z,t) = e¢ + _ (cP, — ecA-h) + py + mct, (4.66) 
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where p?, = p; + p;. The longitudinal momentum is 


cP, = ceA;h+ hla — ep) — (cp1)* — m2ct = ceA,h + hy/ 2p? — ee cae 


(4.67) 


where E? = (H — ep)” = (cp) + (me?)” has been used. We further normalize to 
the momentum p and use trajectory slopes, x’ = dx/dz = p,/p,; etc. rather than 


momenta. With this, the new Hamiltonian is K (x,x’, y, y’,z) = —P-/p or using 
P./p = eA-/p + hy/1 =p} /p? and p/p? = x? + y? 
eA-h 


— hf — x? — y?, (4.68) 


K(x,x',y, yz) =- 


In beam dynamics, we restrict ourselves to paraxial beams, where x’ < | and 
y < 1, and the momentum p ~ p-. Note, p may not be the canonical momentum 
if there is an electromagnetic field present, but P = p+ eA is canonical. In this last 
step, we seem to have lost terms involving transverse vector potential components. 
This meets with the requirements of almost all beam transport lines, where we 
use predominantly transverse fields which can be derived from the A,-component 
only. This is not true when we consider, for example, solenoid fields which occur 
rather seldom and will be treated separately as perturbations. Finally, we separate 
the ideal particle momentum po from the momentum deviation 6 = Ap/po and 
while ignoring higher order terms in 6 replace 1/p = 1/ [po (1 + 6)] © a (1 —6) 
in the Hamiltonian for 


Azh 
K(x.x.9.9.2) © ea eats ]— x? —y?, (4.69) 
0 


As discussed before, magnetic fields for particle beam dynamics can be derived 
from a single component A, of the vector potential and the task to determine 
equations of motion is now reduced to that of determining the vector potential for 
the magnets in use. The equations of motion are from (4.69) 


dK _ yw _ __ ec dAzh _ = =? — +2 
oh x am it (1-8) —kaV1—x? -y?, 
dK _— _.w _ _ ec dAzh _ _ ee AD oD, 
2 y ae (1 — 45) — koy f1 — x? — y?. 


With hB, = ae and hB, = _ the equations of motion become finally in 


paraxial approximation 


(4.70) 


x" + £<Byh (1 — 8) — kor =0, 


y" — £Byh (1-8) — key = 0. 


(4.71) 
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These are the equations of motion in curvilinear coordinates under the influence of 
the magnetic field (By, By). 


Problems 


4.1 (S). Show that the Hamiltonian transforms like H, = Ht, if the independent 
variable is changed from f to @. 


4.2 (S). Derive from the Lagrangian (4.25) the equation of motion. 


4.3. Show that the transformations [a.), c.) for upper signs, d.) for € = 0] are 
canonical and [b.), c.) for lower signs, d.) for ¢ 4 0 J are not: 


“ w= A P= b.) g=rcosy, p=rsiny 
q2 = X2 Pp2 = X2 


25 a =x, pi = x £ x2, 


: d.)gq= qe, p= poet 
q2 = X1 £ X2, pr = X2 


Show the formalism you use. 
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Chapter 5 
Particle Dynamics in Electro-Magnetic Fields 


The most obvious components of particle accelerators and beam transport systems 
are those that provide the beam guidance and focusing system. Whatever the 
application may be, a beam of charged particles is expected by design to follow 
closely a prescribed path along a desired beam transport line or along a closed orbit 
in case of circular accelerators. The forces required to bend and direct the charged 
particle beam or provide focusing to hold particles close to the ideal path are known 
as the Lorentz forces and are derived from electric and magnetic fields through the 
Lorentz equation. 


5.1 The Lorentz Force 


For a particle carrying a single basic unit of electrical charge the Lorentz force is 
F=eE+e[vxB], (5.1) 


where e is the basic unit of electrical charge [1]. 

The vectors EF and B are the electrical and magnetic field vectors, respectively, 
and v is the velocity vector of the particle. The evolution of particle trajectories 
under the influence of Lorentz forces is called beam dynamics or beam optics. 
The basic formulation of beam dynamics relies only on linear fields which are 
independent of or only linearly dependent on the distance of a particular particle 
from the ideal trajectory. The mathematical description of particle trajectories in the 
presence of only such linear fields is called linear beam dynamics. 

The Lorentz force has two components originating from either an electrical field 
FE or a magnetic field B. For relativistic particles (v + c) we find that the force from 
a magnetic field of 1 T, for example, is equivalent to that for an electrical field of 
300 MV/m. Since it is technically straight forward to generate magnetic fields of the 
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order of 1 T, but rather difficult to establish the equivalent electric fields of 3 MV/cm, 
it becomes apparent that most beam guidance and focusing elements for relativistic 
particle beams are based on magnetic fields. At low particle energies (v < c) this 
preference is less clear and justified since the effectiveness of magnetic fields to 
bend particles is reduced proportional to the particle velocity B = v/c. 


5.2 Fundamentals of Charged Particle Beam Optics 


Magnetic as well as electric fields can be produced in many ways and appear 
in general in arbitrary directions and varying strength at different locations. It 
is impossible to derive a general mathematical formula for the complete path of 
charged particles in an arbitrary field distribution. To design particle beam transport 
systems, we therefore adopt some organizing and simplifying requirements on the 
characteristics of electro-magnetic fields used. 

The general task in beam optics is to transport charged particles from point A to 
point B along a desired path. We call the collection of bending and focusing magnets 
installed along this ideal path the magnet lattice and the complete optical system 
including the bending and focusing parameters a beam transport system. Two 
general cases can be distinguished in beam transport systems. Systems that display 
neither symmetry nor periodicity and transport systems that include a symmetric or 
periodic array of magnets. Periodic or symmetric transport systems can be repeated 
an arbitrary number of times to produce longer transport lines. A specific periodic 
magnet lattice is obtained if the arrangement of bending magnets forms a closed 
loop. In our discussions of transverse beam dynamics, we will make no particular 
distinction between open beam transport lines and circular lattices except in such 
cases when we find the need to discuss special eigensolutions for closed periodic 
lattices. We will therefore use the terminology of beam transport systems when we 
discuss beam optics results applicable to both types of lattices and refer to circular 
accelerator lattices when we derive eigenfunctions characteristic only to periodic 
and closed magnet lattices. 


5.2.1 Particle Beam Guidance 


To guide a charged particle along a predefined path, magnetic fields are used which 
deflect particles as determined by the equilibrium of the centrifugal force and 
Lorentz force 


myv-k + e[v x B] = 0, (5.2) 


where & = (kx, Ky, 0) is the local curvature vector of the trajectory which is pointing 
in the direction of the centrifugalforce. 
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We assume in general that the magnetic field vector B is oriented normal to the 
velocity vector v. This means we restrict the treatment of linear beam dynamics 
to purely transverse fields. The restriction to purely transverse field components 
has no fundamental reason other than to simplify the formulation of particle beam 
dynamics. The dynamics of particle motion in longitudinal fields will be discussed 
in Chap. 9. As mentioned earlier, the transverse components of the particle velocities 
for relativistic beams are small compared to the particle velocity v,, (vy K vz, vy < 
Uz, Uz ®& vs). While we use a curvilinear coordinate system (x, y, z) following the 
ideal path, we sometimes need to follow a particular particle trajectory for which 
we use the coordinate s. With these assumptions, the bending radius for the particle 
trajectory in a magnetic field is from (5.2) with p = ymv 


ec 
Kyvy = Tap bs (5.3) 


and the angular frequency of revolution of a particle on a complete orbit normal to 
the field B is 


ec 
QO, => ea 


; (5.4) 


which is also called the cyclotron or Larmor frequency [2]. The sign in (5.3) has 
been chosen to meet the definition of curvature in analytical geometry where the 
curvature is negative if the tangent to the trajectory rotates counterclockwise. Often, 
the beam rigidity, defined as 


Bel = =, (5.5) 


is used to normalize the magnet strength. Using more practical units the expressions 
for the beam rigidity and bending radius become 


10 
and 
Lat) — ® — 9999 2D 
; ai) = Be 0.2998 FE GeV)’ (5.7) 


where we have dropped the sign for the bending radius. For relativistic particles this 
expression is further simplified since 8 ~ 1. The deflection angle in a magnetic 
field is 


ae (5.8) 
p 
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or for a uniform field like in a dipole magnet of arc length @,, the deflection angle is 
0 = Ly, /p. 

In this textbook, singly charged particles will be assumed unless otherwise noted. 
For multiply charged particles like ions, the electrical charge e in all equations must 
be replaced by e Z if, for example, ions of net charge Z are to be considered. Since it 
is also customary not to quote the total ion energy, but the energy per nucleon, (5.7) 
becomes for ions 


(m7!) = 0.29982 BO 


A BE (GeV/u) ’ ©.) 


> | 


where E is the total energy per nucleon and A the atomic mass. 

Beam guiding or bending magnets and focusing devices are the most obvious 
elements of a beam transport system and we will shortly discus such magnets in 
more detail. Later, in Chap. 6, we will introduce all multipole magnets in a more 
formal way. 


5.2.2 Particle Beam Focusing 


Similar to the properties of light rays, particle beams also have a tendency to spread 
out due to an inherent beam divergence. To keep the particle beam together and 
to generate specifically desired beam properties at selected points along the beam 
transport line, focusing devices are required. In photon optics that focusing is 
provided by glass lenses. The characteristic property of such focusing lenses is that 
a light ray is deflected by an angle proportional to the distance of the ray from the 
center of the lens (Fig. 5.1). With such a lens a beam of parallel rays can be focused 
to a point and the distance of this focal point from the lens is called the focal length. 


focal point 


! 


focusing lens focal length 


Fig. 5.1 Principle of focusing 
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Any magnetic field that deflects a particle by an angle proportional to its 
distance r from the axis of the focusing device will act in the same way as a glass 
lens does in the approximation of paraxial, geometric optics for visible light. If f is 
the focal length, the deflection angle a is defined from Fig. 5.1 by 


a=—-. (5.10) 


A similar focusing property can be provided for charged particle beams by the use 
of azimuthal magnetic fields B, with the property 


e c 
: ze ~gpPe! = Saat (5.11) 
where ¢ is the path length of the particle trajectory in the magnetic field By and g 
is the field gradient defined by B, = gr or by g = dB,/dr. Here we have assumed 
the length ¢ to be short compared to the focal length such that r does not change 
significantly within the magnetic field. If this is not allowable, the product B,¢ must 
be replaced by the integral { By do. 

To get the focusing property (5.10) we require a linear dependence on r of either 
the magnetic field B, or of the magnet length. We choose the magnetic field to 
increase linearly with the distance r from the axis of the focusing device while the 
magnet length remains constant. 

A magnetic field that provides the required focusing property of (5.11) can be 
found, for example, in a conductor carrying a uniform current density. Clearly, 
such a device does not seem very useful for particle beam focusing. To improve 
the “transparency” for particles, Panofsky and Baker [3] proposed to use a plasma 
lens “which contains a longitudinal arc of nearly uniform current density” and a 
similar device has been proposed in [4]. Still another variation of this concept is the 
idea to use an evenly distributed array of wires, called the wire lens [5], simulating 
a uniform longitudinal current distribution. The strength of such lenses, however, 
is not sufficient for focusing of high energy particles even if we ignore the obvious 
scattering problems. Both issues, however, become irrelevant, where focusing is 
required in combination with particle conversion targets. Here, for example, a 
Lithium cylinder, called a Lithium lens, carrying a large pulsed current can be used 
to focus positrons or antiprotons emerging from conversion targets [6, 7]. 

A different type of focusing device is the parabolic current sheet lens. In its 
simplest form, the current sheet lens is shown in Fig. 5.2. The rotational symmetric 
lens produces an azimuthal magnetic field which scales inversely proportional to r, 
By, ~ 1/r. Since the length of the lens scales like ¢ ~ r’, the deflection of a particle 
trajectory increases linear with r as desired for a focusing lens. 

The field strength depends on the particular parameter of the paraboloid used for 
the current sheet and the electrical current. The magnetic field is from Maxwell’s 
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Fig. 5.2 Parabolic current sheet lens (schematic) 


equation 


Mo I (A) 


By (T) = (5.12) 


2m r(m) 


and with € = ar’ the product of the field gradient g = 0B,/dr and the length ¢ is 
= Om) 
gh) > 58 (m~')7(A). (5.13) 
1 


The use of a parabolic shape for the current sheet is not fundamental. Any form 
with the property £ ~ r? will provide the desired focusing properties. A geometric 
variation of such a system is used in high energy physics to focus a high energy 
K-meson beam emerging from a target into the forward direction [8, 9]. Since the 
decaying kaon beam produces neutrinos among other particles this device is called 
a neutrino horn. On a much smaller scale compared to the neutrino horn a similar 
focusing devices can be used to focus positrons from a conversion target into the 
acceptance of a subsequent accelerator [10, 11]. 

This type of lens may be useful for specific applications but cannot be considered 
a general focusing device, where an aperture, free of absorbing material, is required 
to let particles pass without being scattered. The most suitable device that provides 
a material free aperture and the desired focusing field is called a quadrupole magnet. 
As will be discussed in Chap.6 the magnetic field can be derived in Cartesian 


coordinates from the scalar potential V = —gxy 
aV 
—=~—= B, = gy, (5.14) 
Ox 
oV 
——= By = gx. (5.15) 
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Fig. 5.3. Magnetic field | y 
pattern for a quadrupole 
magnet 


Such fields clearly deflect a particle trajectory proportional to its distance from 
the optical axis as we would expect for a focusing element. Magnetic equipotential 
surfaces with a hyperbolic profile will be suitable to create the desired fields. The 
field pattern of a quadrupole magnet is shown schematically in Fig. 5.3 

In beam dynamics, it is customary to define an energy independent focusing 
strength. Similar to the definition of the bending curvature in (5.3) we define a 
focusing strength k by 


€ 
p BE 


and the focal length of the magnetic device is from (5.11) 
f=, (5.17) 
In more practical units, the focusing strength is given in analogy to (5.7) by 


g (T/m) 


k (m7?) = 0.2998 AEG. 


(5.18) 


Multiplication with Z/A gives the focusing strength for ions of charge multiplicity 
Z and atomic weight A. Consistent with the sign convention of the Frenet-Serret 
coordinate system, the field directions are chosen such that a positively charged 
particle like a proton or positron moving at a distance x > 0 parallel to the z-axis is 
deflected toward the center (focusing), while the same particle with a vertical offset 
from the z-axis (y > 0) becomes deflected upward (defocusing). 

Quadrupole magnets are focusing only in one plane and defocusing in the other. 
This property is a result of Maxwell’s equations but does not diminish the usefulness 
of quadrupole magnets as focusing elements. A combination of quadrupoles can 
become a system that is focusing in both planes of a Cartesian coordinate system. 
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From paraxial light optics it is known that the total focal length of a combination of 
two lenses with focal lengths f; and /) and separated by a distance d is given by 


1 1 1 d 
=—+—- . 

f fi fh fih 
A specific solution is f, = —f2 and a quadrupole doublet with this property is 
focusing in both the horizontal and vertical plane with equal focal length 1/f = 
d/|fif2|. Equation (5.19) allows many other solutions different from the simple 
assumption made here. The fundamental observation is here that there exist indeed 


combinations of focusing and defocusing quadrupoles which can be made focusing 
in both planes and are therefore useful for charged particle beam focusing. 


(5.19) 


5.3. Equation of Motion 


We use magnetic fields to guide charged particles along a prescribed path or at least 
keep them close by. This path, or reference trajectory, is defined geometrically by 
straight sections and bending magnets only. In fact it is mostly other considerations, 
like the need to transport from an arbitrary point A to point B in the presence of 
building constraints, that determine a particular path geometry. We place dipole 
magnets wherever this path needs to be deflected and have straight sections in 
between. Quadrupole and higher order magnets do not influence this path but 
provide the focusing forces necessary to keep all particles close to the reference 
path. 

The most convenient coordinate system to describe particle motion is the Frenet- 
Serret system that follows with the particle along the reference path. In other words, 
we use a curvilinear coordinate system as defined mathematically by (4.19). The 
curvatures are functions of the coordinate z and are nonzero only where there are 
bending magnets. In deriving the equations of motion, we limit ourselves to the 
horizontal plane only. The generalization to both horizontal and vertical plane is 
straightforward. We calculate the deflection angle of an arbitrary trajectory for an 
infinitesimal segment of a bending magnet with respect to the ideal trajectory. Using 
the notation of Fig.5.4 the deflection angle of the ideal path is dgo = dz/po or 
utilizing the curvature to preserve the directionality of the deflection 


dg = +Kko dz, (5.20) 


where ko is the curvature of the ideal path. The deflection angle for an arbitrary 
trajectory is then given by 


dy = +k ds. (5.21) 
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Fig. 5.4 Particle trajectories 
in deflecting systems. 
Reference path z and 
individual particle trajectory s 
have in general different 
bending radii 


individual 
particle trajectory 


reference 
path 


The ideal curvature kp is evaluated along the reference trajectory u = 0 for 
a particle with the ideal momentum. In linear approximation with respect to the 
coordinates the path length element for an arbitrary trajectory is 


ds = (1 + ko u) dz + O(2), (5.22) 


where u = x or y is the distance of the particle trajectory from the reference 
trajectory in the deflecting plane. 

The magnetic fields depend on z in such a way that the fields are zero in magnet 
free sections and assume a constant value within the magnets. This assumption 
results in a step function distribution of the magnetic fields and is referred to as the 
hard edge model, generally used in beam dynamics. The path is therefore composed 
of a series of segments with constant curvatures. To obtain the equations of motion 
with respect to the ideal path we subtract from the curvature « for an individual 
particle the curvature ko of the ideal path at the same location. 

Since u is the deviation of a particle from the ideal path, we get for the equation 
of motion in the deflecting plane with respect to the ideal path from Fig.5.4 and 
(5.20), (5.21) with u” = —(dg/dz— dgo/dz), 


u’ = —(1+ Kou)k + ko, (5.23) 


where the derivations are taken with respect to z. In particle beam dynamics, we 
generally assume paraxial beams, wv? < 1 since the divergence of the trajectories 
u’ is typically of the order of 10~? rad or less and terms in w? can therefore be 
neglected. Where this assumption leads to intolerable inaccuracies the equation of 
motion must be modified accordingly. 

The equation of motion for charged particles in electromagnetic fields can be 
derived from (5.23) and the Lorentz force. In case of horizontal deflection, the 
curvature is K = kK, and expressing the general field by its components, we have 
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from (5.3) 


ee koe + Lime 5.24) 
ke = Tyg (koe + x + 5mx +...), (5. 
where we expanded the field into components up to second order. Such magnetic 
field expansions will be discussed in much detail in Chap. 6. Here, we use just the 
three lowest order multipoles, a bending magnet, a quadrupole and a sextupole. 

A real particle beam is never monochromatic and therefore effects due to small 
momentum errors must be considered. This can be done by expanding the particle 
momentum in the vicinity of the ideal momentum po 


1 1 1 
= w —(1=—5 +...). 5.25) 
Pp po(l+4) a ) ¢ 


We are now ready to apply (5.23) to the horizontal plane, set u = x and k = k, and 
get with (5.23), (5.24), while retaining only linear and quadratic terms in 6,x and 
y, the equation of motion 


x" + (k + KG,)x = Kox(8 — 8°) + (k + kG) x8 


— tmx’ — Kokx” + O(3). (5.26) 


Here, we have used energy independent field strength parameters as defined in (5.3) 
and (5.16). 

It is interesting to identify each term with well known observations and terminol- 
ogy from geometric light optics. The (k + «j,)x-term describes the focusing effects 
from quadrupoles and a pure geometrical focusing from bending in a sector magnet. 
Sector magnets are the natural bending magnets for a curvilinear coordinate system. 
However, in a uniform field sector magnet particles travel longer path for x > 0 
and a shorter path for x < 0 leading directly to a focusing effect in the deflecting 
plane. In the nondeflecting plane there is no focusing. A dispersive effect arises 
from ko,(5 — 6”) which reflects the varying deflection angle for particles which 
do not have the ideal design energy. Focusing is also energy dependent and the 
term (k + Koy) x5 gives rise to chromatic aberrations describing imaging errors due 
to energy deviation. The term —kko,x? has no optical equivalent (it would be a 
focusing prism) and must be included only if there is focusing and bending present 
in the same magnet like in a synchrotron magnet. The last term we care about 
here is the sextupole term —tmx? which introduces both chromatic and geometric 
aberration. The chromatic aberration from sextupoles can be used to cancel some 
of the chromatic aberration (chromaticity) from quadrupoles, but in doing so we 
introduce a quadratic effect which leads to geometric aberrations. This is similar to 
the chromatic correction in optical systems by using different kinds of glasses. We 
will discuss these perturbatory effects in much more detail later. 
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The equation of motion in the vertical plane can be derived in a similar way by 
setting u = y in (5.23) and « = xy. Consistent with the sign convention of the 
Frenet-Serret coordinate system (5.24) becomes for the vertical plane 


Ky = Koy + ky + mxy + ...O(3) (5.27) 
and the equation of motion in the vertical plane is 
y” —(k- Koy)Y = Koyd — (k- Koy)y8 + mxy + Koyky? + O(3). (5.28) 


Of course, in most cases Koy = 0. In particular, we find for cases, where the 
deflection occurs only in one plane say the horizontal plane, that the equation of 
motion in the vertical plane becomes simply 


y" — ky = —kyé + mxy + O(3), (5.29) 


which to the order of approximation considered is independent of the strength of the 
horizontal bending field. 

The magnet parameters ko,k, and m are functions of the independent coor- 
dinate z. In real beam transport lines, these magnet strength parameters assume 
constant, non zero values within individual magnets and become zero in drift spaces 
between the magnets. The task of beam dynamics is to distribute magnets along the 
beam transport line in such a way that the solutions to the equations of motion result 
in the desired beam characteristics. 


5.4 Equations of Motion from the Lagrangian 
and Hamiltonian* 


In this section, we will formulate the Lagrangian and Hamiltonian suitable for 
the study of particle beam dynamics. Specifically, we will work in the curvilinear 
coordinate system and use the longitudinal coordinate z as the independent variable 
rather than the time ¢. This is of particular importance because the time is measured 
along each particular trajectory and is therefore evolving differently for each particle 
in relation to the z-coordinate. The time is related to the particle position s = vt 
along its trajectory and through its velocity while the z-coordinate can function as a 
general reference for all particles. 

We will study both the Lagrangian and Hamiltonian formulation together to 
clearly define canonical momenta and facilitate the study of particle dynamics with 
the support of the full Hamiltonian theory. Depending on the problem at hand, it 
may be easier to start with one or the other formulation. 
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5.4.1 Equations of Motion from Lagrangian 


In Chap. 4 we have derived the Lagrangian (4.25) in the curvilinear coordinate 
system of beam dynamics 


L=—me yc? — 32 — 3? — 12 + e (4Ay + JAy + h2Acz) — eg, (5.30) 


which controls the movement of charged particles in an electromagnetic field. The 
magnetic fields can be derived from the potentials by 


B=VxA (5.31) 
Lf a(hacs) Ay] LP AAs AhAcs)] | [AAy _ AAs 
= _— — |X => — Z, 
h dy dz h| dz Ox “ Ox dy 


where h = | + k,x + kyy, while the electric fields are E = —VV. The equations of 
motion are the Lagrangian equations and are in component form 


d 
7 (ymx) = ymhk, 2 + e (vB, —hzBy) + eEy, (5.32a) 
d 2 2 a . 
FP (ymy) = ymhky Z +e (—xB, + hzB,) + eEy, (5.32b) 
d ee , 
tr (ymhz) = —-ym (kK x + Ky y) zte (xB, — y By) + eE., (5.32c) 


where f = 1/2 + 3? + h?z and y the relativistic factor. The first two equations 
describe the transverse particle motion which we will later call betatron motion or 
betatron oscillations. The third equation describes the longitudinal or synchrotron 
oscillation, where the main restoring force comes from the accelerating microwave 
field eE,. 

It is customary to replace the time variable by the position variable z along the 
ideal path. Each particle travels along its own path s at a velocity v =ds/dt and we 


change the independent variable with the substitution 


d d dad ud 


= =v = F 
dt ds ds dz s’ dz 


(5.33) 


where the quantity 
f= fx? +y?2 +P. (5.34) 
The primes are used to indicate a derivation with respect to z like s’ = ds/dz. The 


Lagrangian with z as the independent variable rather than ¢ can be derived from 
(5.30) with (5.33) to give with the momentum deviation 6 = (p — po) /po from the 
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ideal momentum po 


= e é 
La! y.ys2) = 8 + (= 8) Ar + Y'Ay + hes) _ / (5.35) 


Applying this to (5.32), the equations of motion are with p = myv 


I 
Ey 
vs ed =(jaii= 5) (nBy — y'B.) + 8? mit? (5.36a) 
0 
Ey 
x= y = kyh+ (1—68) on (nB, — x'B.) +s” ae (5.36b) 
0 
is” 1 
gos [Kix + Ky +2 (ax + Gy/)] (5.36c) 
eas E, 
= 22 (xB, — y'B,) _ ek, 
h po ; ymv 


So far, no approximations were made and the equations of motion are fully 
Hamiltonian or symplectic. Equations (5.36), however, are not suited for analytical 
treatment and we use therefore often the paraxial approximation also known from 
geometric light optics where particle trajectories are assumed to stay in the vicinity 
of the optical path keeping all slopes small (x’ < 1,y < 1,s’ = 1). Equation in 
(5.36c) describes again synchrotron motion and degenerates in the case where there 
are no electric fields to an equation that can be used to replace the factor s”/s’ in the 
betatron equations. Since s’* ~ 1 for paraxial beams and terms like («‘, xi) vanish 
in this approximation, we have s”/s’ ~ 0 and (5.36) becomes 


E, 
x = Kh— (1-8) 7 (hB, — y'B.) + ma? (5.37a) 
0 
F e ; eEy 
yl = kyh + (18) (By — xB.) + TS. (5.37b) 
0 


Of course, strictly speaking, these equations are not anymore symplectic, which 
is of no practical consequence as far as beam optics goes. Yet, in modern circular 
accelerators, particle beam stability can often be assured only by numerical tracking 
calculations. This process applies the equations of motion very often and even small 
approximations or deviations from symplecticity can introduce false dissipating 
forces leading to erroneous results. 
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5.4.2. Canonical Momenta 


The Lagrangian (5.35) defines the canonical momenta by derivation with respect to 
velocities 


aL os’ ep e 
P= = 1—- 1 —6) —A, ; 
“ Ox’ Ox! ( “ty a Po eee) 
x 
— (1- 2 )+ ee 
s! ymv2 Po 
aL os’ ed e 
Py = = 1- 1—6) —A, 38b 
yay ay ( = +(1=¢) a” (5.38b) 
p 
- *(1- id )+0-5 <a, 
s! ymv2 Po - 


Note, in this formulation, the canonical momenta are dimensionless because they 
are normalized to the total momentum p. 


5.4.3 Equation of Motion from Hamiltonian 


Knowledge of the Lagrangian and canonical momenta gives us the means to 
formulate the Hamiltonian of the system. In doing so, we use conjugate coordinates 


(q;, P;) only, ignore the electric field and get from (5.38) x = (2 _ £A,) s’, ete. 
and the Hamiltonian H = H(x, Px, y, Py, z) is by definition with (5.35) 


H=xXP,+y'Py—-L (ee x,y, y,z) (5.39) 


Any? Ag\? 
= —“Ah—s 1-(P,-5 ‘) - (r,- =) 
P P ‘2? 


2 


2 
From (5.34) and (5.38), we have s? = 9? ( — £Ax) ez (Py - £Ay) +h? or 


2 2 
(h/s’ iy =1- (Py _ £A,) = (P, - £A,) and introducing this in the Hamiltonian, 
we get finally 


e eA, : eAy 
H(x, Py, y, Py, z) = —-A;h— hy} 1— | Py —|{P,-—-—], (5.40) 
Pp Dp : Pp 


where for practical applications, we set e/p ~ (1 —46)e/po. We may restrict 


ourselves further to paraxial beams for which ( P,,) — pAxy < 1 allowing to 
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expand the square root and the Hamiltonian is in lowest order 
e 
Hw —(1—6)—A,h—-h 
Po 


é. 


Ax]? eAy |? 
+4[P-0-8 ‘| +45[7,-c-3 . (5.41) 
Po . Po 


Replacing in (5.40) the normalized canonical momenta (Po, Pr.) by normalized 
ordinary momenta (Px; Py) and setting py = x and py = y’, the Hamiltonian 
assumes a more familiar form 


K(x, x.y.y.2) & — Ach (1-6) -hV1—x?2-y?, (5.42) 


where the momenta p,.y or (x’, y’) in the presence of fields are not canonical anymore 
and where second order terms in 6 are dropped. As we will see, however, beam 
dynamics is based predominantly on fields which can be derived from a potential 
of the form A(0,0,A,) and consequently, the ordinary momenta are indeed also 
canonical. We seem to have made a total circle coming from velocities (x,y) to 
slopes (x’, y’) in the Lagrangian to normalized canonical momenta (Px. Py) back to 
slopes (x’, y.) which we know now to be canonical momenta for most of the fields 
used in beam dynamics. 

The equations of motion can now be derived from the Hamiltonian (5.42) in 
curvilinear coordinates. 


ey ee (5.43) 


where P, = x’ — pat and P’. = x”. The magnetic field hB, = (% _ as) does not 
depend on z, e.g. dA,/dz = 0. While ignoring any coupling into the vertical plane 
(y = 0), the equation of motion (5.43) is, 


aa = ~£ 0-9 9% «fF ¥, (5.44) 
Po x 


or with k. # 0, Ky, = 0,4 = 1 4+ k,x and expanding only to second order in 
x,xX,y,y,5 


x” — =1(1=6)hk 1—x?—y? (5.45) 


2 


= (1—8)h+ K,(1— 4x? — ty”) 


2 


—5 + 46- (1-8) Sex te +0 (3). 
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The general curvature « can be expanded into, for example, a dipole «,, a quadrupole 
kx and a sextupole field Sx for ; = Kytkx+ 5mx?+ O(3) resulting in the equation 
of motion 

x’ = —K, —kx- Sx + kyS + kd — K2x + «2x8 — kkyx? + ky + O (3), or 


x" + (k+ 42) x = «5 + (K+ 2) x8 — bmx? — kx? + O (3) (5.46) 
in agreement with (5.26). Similarly, we may derive the equation of motion for the 
vertical plane and get with = — os = —Ky + ky + mxy + O(3) 


y" — (k—) y = 8 — (k— 5) yb + mxy + kigy’ + O (3) (5.47) 


in agreement with (5.28). 


5.4.4 Harmonic Oscillator 


Particle dynamics will be based greatly on the understanding of harmonic oscillators 
under the influence of perturbations. We therefore discuss here the Hamiltonian for 
a harmonic oscillator. To do that, we start from (5.42), eliminate the magnetic field 
A, = 0, ignore the curvature (1 = 1) and remember that we have to reintroduce the 
potential by a function V. Furthermore, we use the time t = z/c as the independent 
variable again. With this, we derive from (5.42) the Hamiltonian 


K(x, ¥,2 % —V-vV1-2x2 = —V— (1 - 5x7) : (5.48) 


The potential for a harmonic oscillator derives from a restoring force —Dx and is 
—35Dx°. A new Hamiltonian is then 


K = tx? + SD (5.49) 
and the equations of motion are 


aK 


De = —x" = Dx, (5.50) 
1X 
OK 


The Hamiltonian could have been formulated directly considering that it is equal 
to the sum of kinetic T and potential V energy K = T + V. 
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5.4.5 Action-Angle Variables 


Particularly important for particle beam dynamics is the canonical transformation 
from Cartesian coordinates (w, w, g) to action-angle variables (J, yy, y). This class 
of transformations is best suited for harmonic oscillators like charged particles 
under the influence of focusing restoring forces. We assume the equations of 
motion to be expressed in normalized coordinates of particle beam dynamics 
with the independent variable g instead of the time. As we will discuss later, 
it is necessary in beam dynamics to transform ordinary Cartesian coordinates 
(x,x’,z) into normalized coordinates (w,w,g). The generating function for the 
transformation to action-angle variables (J, y, g) is of the form G, in (4.39) which 
can be written with some convenient constant factors as 


G = —}vw’ tan(y — 9), (5.52) 


where ? is an arbitrary phase. Applying (4.44) to the generating function (5.52) we 
get with w =dw/dg 


dG 


— =w=-vwtan(y — 8), (5.53a) 
ow 
dG 1 vw 


Solving for w and w the equations take the form 


w= [cosy —?), (5.54a) 


w= —VJ2vJ sin(y — 8). (5.54b) 


To determine whether the transformation to action-angle variables has led us 
to cyclic variables we will use the unperturbed Hamiltonian, while ignoring per- 
turbations, and substitute the old variables by new ones through the transformations 
(5.54). The generating function (5.52) does not explicitly depend on the independent 
variable g and the new Hamiltonian is therefore given by 


H=vJ. (5.55) 


The independent variable y is obviously cyclic and from 0H/dv = 0 = J we 
find the first invariant or constant of motion 


J = const. (5.56) 
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The second Hamiltonian equation 


oH : (5.57) 
—=.-|s =v fs 
oJ 
defines the frequency of the oscillator which is a constant of motion since the action 
J is invariant. The betatron frequency or tune 


v = vo = const, (5.58) 


and the angle variable y is the betatron phase. Eliminating the betatron phase w 
from (5.54), we obtain an expression of the action in normalized coordinates 


P= logy os (5.59) 


Both terms on the r.h.s. can be associated with the potential and kinetic energy 
of the oscillator, respectively, and the constancy of the action J is synonymous with 
the constancy of the total energy of the oscillator. 


5.5 Solutions of the Linear Equations of Motion 


Equations (5.26), (5.28) are the equations of motion for strong focusing beam 
transport systems [12, 13], where the magnitude of the focusing strength is a free 
parameter. No general analytical solutions are available for arbitrary distributions 
of magnets. We will, however, develop mathematical tools which make use of 
partial solutions to the differential equations, of perturbation methods and of 
particular design concepts for magnets to arrive at an accurate prediction of particle 
trajectories. One of the most important “tools” in the mathematical formulation of a 
solution to the equations of motion is the ability of magnet builders and alignment 
specialists to build magnets with almost ideal field properties and to place them 
precisely along a predefined ideal path. In addition, the capability to produce almost 
monochromatic particle beams is of great importance for the determination of the 
properties of particle beams. As a consequence, all terms on the right-hand side 
of (5.26), (5.28) can and will be treated as small perturbations and mathematical 
perturbation methods can be employed to describe the effects of these perturbations 
on particle motion. 

We further notice that the left-hand side of the equations of motion resembles 
that of a harmonic oscillator although with a time dependent frequency. By a proper 
transformation of the variables we can, however, express (5.26), (5.28) exactly in 
the form of the equation for a harmonic oscillator with constant frequency. This 
transformation is very important because it allows us to describe the particle motion 
mostly as that of a harmonic oscillator under the influence of weak perturbation 


5.5 Solutions of the Linear Equations of Motion 117 


terms on the right-hand side. A large number of mathematical tools developed 
to describe the dynamics of harmonic oscillators become therefore available for 
charged particle beam dynamics. 


5.5.1 Linear Unperturbed Equation of Motion 


In our attempt to solve the equations of motion (5.26), (5.28), we first try to solve 
the homogeneous differential equation 


u'+Ku=0, (5.60) 


where u stands for x or y and where, for the moment, we assume K to be constant 
with K =k+x«2 or K = —(k—- k2), respectively. The principal solutions of this 
differential equation are for K > 0 


C(z) = cos (Vz) and S(z) = = sin (Vz) ; (5.61) 


and for K < 0 
1 
VIK| 


These linearly independent solutions satisfy the following initial conditions 


C() =cosh(iKiz) and S() = sinh ( VIK{z) (5.62) 


c(0) = 1, C’(0) = dC/dz = 0, (5.63) 

S(0) =0, S’(0) = dS/dz = 1. , 
Any arbitrary solution u(z) can be expressed as a linear combination of these two 
principal solutions 


u(z) = C(z)uo + S(z)uy, (5.64) 
ul (z) = C'(z)uo + S'(z)up, 


where uo, uy are arbitrary initial parameters of the particle trajectory and derivatives 
are taken with respect to the independent variable z. 

In a general beam transport system, however, we cannot assume that the magnet 
strength parameter K remains constant and alternative methods of finding a solution 
for the particle trajectories must be developed. Nonetheless it has become customary 
to formulate the general solutions for K = K(z) similar to the principal solutions 
found for a harmonic oscillator with a constant restoring force. Specifically, 
solutions can be found for any arbitrary beam transport line which satisfy the 
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initial conditions (5.63). These principal solutions are the so-called sine like and 
cosine like solutions and we will derive the conditions for such solutions. For the 
differential equation 


u’ + K(z)u=0 (5.65) 


with a time dependent restoring force K(z), we make an ansatz for the general 
solutions in the form (5.64). Introducing the ansatz (5.64) into (5.65) we get after 
some sorting 


[S”(z) + K(z)S(z)]uo + [C’(z) + K(z)C(z)]up = 0. 


This equation must be true for any pair of initial conditions (uo, uj) and therefore 
the coefficients must vanish separately 


et) + K(z)C(z) = 0, (5.66) 

S"(z) + K(z)S(z) = 0. 
The general solution of the equation of motion (5.65) can be expressed by a linear 
combination of a pair of solutions satisfying the differential equations (5.66) and the 
boundary conditions (5.63). 

It is impossible to solve (5.66) analytically in a general way that would be correct 
for arbitrary distributions of quadrupoles K(z). Purely numerical methods to solve 
the differential equations (5.66) maybe practical but are conceptually unsatisfactory 
since this method reveals little about characteristic properties of beam transport 
systems. It is therefore not surprising that other more revealing and practical 
methods have been developed to solve the beam dynamics of charged particle beam 
transport systems. 


5.5.2. Matrix Formulation 


The solution (5.64) of the equation of motion (5.65) may be expressed in matrix 


formulation 
u(z) | _ | C(z) S(z) uo 
ba 7 Ee zo | om 


If we calculate the principal solutions of (5.65) for individual magnets only, we 
obtain such a transformation matrix for each individual element of the beam 
transport system. Noting that within each of the beam line elements, whether it 
be a drift space or a magnet, the restoring forces are indeed constant, we may 
use within each single beam line element the simple solutions (5.61) or (5.62) for 
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the equation of motion (5.65). With these solutions, we are immediately ready to 
form transformation matrices for each beam line element. In matrix formalism, we 
are able to follow a particle trajectory along a complicated beam line by repeated 
matrix multiplications from element to element. This procedure is widely used 
in accelerator physics and lends itself particularly effective for applications in 
computer programs. With this method we have completely eliminated the need to 
solve the differential equation (5.65), which we could not have succeeded in doing 
anyway without applying numerical methods. The simple solutions (5.61), (5.62) 
will suffice to treat most every beam transport problem. 


5.5.3 Wronskian 


The transformation matrix just derived has special properties well-known from the 
theory of linear homogeneous differential equation of second order [14]. Only a few 
properties relevant to beam dynamics shall be repeated here. We consider the linear 
homogeneous differential equation of second order 


ul" + v(z)u’ + w(z)u = 0. (5.68) 


For such an equation, the theory of linear differential equations provides us with a 
set of theorems describing the properties of the solutions 


e there is only one solution that meets the initial conditions u(z) = uo and 
u' (Zo) = Ug atz = 20, 

¢ because of the linearity of the differential equation, c u(z) is also a solution if 
both u(z) is a solution and if c = const. , 

¢ if u,(z) and u2(z) are two solutions, any linear combination thereof is also a 
solution. 


The two linearly independent solutions u;(z) and u2(z) can be used to form the 


Wronskian determinant or short the Wronskian 


u1(Z) u2(z) 
ui (Z) u),(z) 


= Uj Ul _ U2). (5.69) 


This Wronskian has remarkable properties which become of great fundamental 
importance in beam dynamics. Both uw; and up are solutions of (5.68). Multiplying 
and combining both equations like 


uf + v(z)u, + w(zjuy = 0 |-—u2 
uy + v(z)u, + w(z)u2 = 0 | om 
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gives 

(ujus — uguy) + v(z)(uyu5 — zu) = 0, 
which will allow us to derive a single differential equation for the Wronskian. 


Making use of (5.69) and forming the derivative dW/dz = ujuy — u2u', we obtain 
the differential equation 


sill + v(z)W(z) = 0, (5.70) 
dz 


which can be integrated immediately to give 
W(2) = Woe 0°, (5.71) 


In the case of linear beam dynamics, we have v(z) = 0 as long as we do 
not include dissipating forces like acceleration or energy losses into synchrotron 
radiation and therefore W(z) = Wo = const. We use the sine and cosine like 
solutions as the two independent solutions and get from (5.69) with (5.63) 


Wo = CoS — CoSo = 1. (5.72) 


For the transformation matrix of an arbitrary beam transport line with negligible 
dissipating forces, we finally get the general result 


C(z) S(z) 


cose ™ (5.73) 


W(z) = | 


This result will be used repeatedly to prove useful general characteristics of 
particle beam optics, in particular, this is another formulation of Liouville’s theorem 
stating that the phase space density under these conditions is preserved. From the 
generality of the derivation, we conclude that the Wronskian is equal to unity, or 
phase space preserving, for any arbitrary beam line that is described by (5.68) if 
v(z) = O and w(z) = K(z). 


5.5.4 Perturbation Terms 


The principal solutions of the homogeneous differential equation give us the basic 
solutions in beam dynamics. We will, however, repeatedly have the need to evaluate 
the impact of perturbations on basic particle motion. These perturbations are 
effected by any number of terms on the r.h.s. of the equations of motion (5.26), 
(5.28). The principal solutions of the homogeneous equation of motion can be used 
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to find particular solutions P(z) for inhomogeneous differential equations including 
perturbations of the form 


P"(z) + K(z)P(z) = pz), (5.74) 


where p(z) stands for any one or more perturbation terms in (5.26), (5.28). For 
simplicity, only the z-dependence is indicated in the perturbation term although in 
general they also depend on the transverse particle coordinates. A solution P(z) of 
this equation can be found from 


P() = / “POC Dae (5.75) 


where G(z,Z) is a Green’s function which can be constructed from the principal 
solutions of the homogeneous equation 


G(z, Z) = S(z)C(Z) — C(z)S(Z). (5.76) 


After insertion into (5.75) a particular solution for the perturbation can be found 
from 


P(z) = S(z) [ racwee — C(z) [ rasan (5.77) 


The general solution of the equations of motion (5.26), (5.28) then is given by the 
combination of the two principal solutions of the homogenous part of the differential 
equation and a particular solution for the inhomogeneous differential equation 


u(z) = aC,(z) + bS,(z) + 6P. (2), (5.78) 


where the coefficients a and b are arbitrary constants to be determined by the initial 
parameters of the trajectory. We have also used the index , to indicate that these 
functions must be defined separately for u = x and y. 

Because of the linearity of the differential equation we find a simple superposi- 
tion of the general solutions of the homogeneous equation and a particular solution 
for the inhomogeneous equations for any number of small perturbations. This is an 
important feature of particle beam dynamics since it allows us to solve the equation 
of motion up to the precision required by a particular application. While the basic 
solutions are very simple, corrections can be calculated for each perturbation term 
separately and applied as necessary. However, these statements, true in general, 
must be used carefully. In special circumstances even small perturbations may have 
a great effect on the particle trajectory if there is a resonance or if a particular 
instability occurs. With these caveats in mind one can assume that in a well defined 
particle beam line with reasonable beam sizes and well designed and constructed 
magnets the perturbations are generally small and that mathematical perturbations 
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methods are applicable. Specifically, we will in most cases assume that the (x, y) 
amplitudes appearing in some of the perturbation terms can be replaced by the 
principal solutions of the homogeneous differential equations. 


Dispersion Function 


One of the most important perturbations derives from the fact that the particle beams 
are not quite monochromatic but have a finite spread of energies about the nominal 
energy cpo. The deflection of a particle with the wrong energy in any magnetic 
or electric field will deviate from that for an ideal particle. The variation in the 
deflection caused by such a chromatic error Ap in bending magnets is the lowest 
order of perturbation given by the term 6/9, where 6 = Ap/po « 1. We will ignore 
for now all terms quadratic or of higher order in 6 and use the Green’s function 
method to solve the perturbed equation 


ul" + K(z)u = Kou(z)6. (5.79) 


In (5.78) we have derived a general solution for the equation of motion for any 
perturbation and applying this to (5.79), we get 


u(z) = aC, (z) + bS,(z) + 6D,(2), 


u'(2) = aC) (2) + dS. (2) + 6D'.(2), (5.80) 
where we have set P,,(z) = 5D, (z) and used (5.77) to obtain 
D,(Z) = is Kou(Z) [Su(z)Cu(Z) io Ci(z)Su(Z)] dz. (5.81) 
0 


We have made use of the fact that like the perturbation the particular solution must 
be proportional to 6 as well. The function D,,(z) is called the dispersion function and 
the physical interpretation is simply that the function 6 D,,(z) determines the offset 
of the reference trajectory from the ideal path for particles with a relative energy 
deviation 6 from the ideal momentum cpp. 

This result shows that the dispersion function generated in a particular bending 
magnet does not depend on the dispersion at the entrance to the bending magnet 
which may have been generated by upstream bending magnets. The dispersion 
generated by a particular bending magnet reaches the value D,(L,,) at the exit of 
the bending magnet of length L, and propagates from there on through the rest of 
the beam line just like any other particle trajectory. This can be seen from (5.81), 
where we have for z > Lm 


Tin Im 
D,(2) = S.A2) i ey(@)Cu@)dz — Cy(2) 7 Ku @SuDee, (5.82) 
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which has exactly the form of (5.64) describing the trajectory of a particle starting 
with initial parameters at the end of the bending magnet given by the integrals. With 
the solution (5.80) we can expand the 2 x 2-matrix in (5.67) into a 3 x 3-matrix, 
which includes the first order chromatic correction 


u(z) C2) Suz) Du(2) u(Zo) 
W(i2p=HlCOIS@ DE u' (Zo) (5.83) 
6 0 0 1 6 


Here we have assumed that the particle energy and energy deviation remains 
constant along the beam line. This representation of the first order chromatic 
aberration will be used extensively in particle beam optics. 


Problems 


5.1 (S). Derive (5.32a) and (5.32.c) from the Lagrange equations. Show all steps. 


5.2 (S). Derive the Lagrangian (5.35) from (5.30) (Hint: Its the variational principle 
6 y Ldt = 0 that needs to be transformed). 


5.3 (S). Verify the numerical validity of (5.7). 
5.4 (S). Show that (5.77) is indeed a solution of (5.74). 


5.5 (S). Transform the Hamiltonian (5.49) of a harmonic oscillator into action- 
angle variables and show that the frequency is v = sD. Derive the equation of 
motion. 


5.6. Show the validity of the transformation equations (5.54a) and (5.54b). Interpret 
the physical meaning of (5.56) and (5.57). 
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Chapter 6 
Electromagnetic Fields 


Beam dynamics is effected by electromagnetic fields. Generally, magnetic fields 
are used for relativistic particle guidance and focusing while electric fields are 
mostly used in the form of electro-static fields or microwaves for acceleration of 
the particles. In this chapter, we will discuss in more detail the magnetic fields and 
their generation as they are used in beam dynamics. From (1.52), (1.51) we know 
how to derive static electric and magnetic fields from a vector or scalar potential by 
solving their Laplace equations. 


6.1 Pure Multipole Field Expansion 


Special desired effects on particle trajectories require specific magnetic fields. 
Dipole fields are used to bend particle beams and quadrupole magnets serve, 
for example, as beam focusing devices. To obtain an explicit formulation of the 
equations of motion of charged particles in an arbitrary magnetic field, we derive 
the general magnetic fields consistent with Maxwells equations. 

Although we have identified a curvilinear coordinate system moving together 
with particles to best fit the needs of beam dynamics, we use in this section first, 
for simplicity, a fixed, right-handed Cartesian coordinate system (x, y, z). By doing 
so, we assume straight magnets and neglect the effects of curvature. Later in this 
chapter, we will derive both the electromagnetic fields and equations of motion in 
full rigor. 
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6.1.1 Electromagnetic Potentials and Fields 
for Beam Dynamics 


Earlier we have derived the potentials from the wave equation in a charge and 
current free static environment. This is the beam environment and we want to 
formulate fields for beam dynamics there. In the same environment Maxwell’s 
equations reduce to VB = 0 and V x B = 0 and can be used directly. Based 
on these equations, the magnetic fields can be derived from potentials by (1.51) as 
previously defined. Electrostatic fields are derived from a scalar potential alone 
according to (1.52). 

In beam dynamics we use mostly purely transverse magnetic fields and from 
the definition of the magnetic field by the vector potential we find that only the 
component A, # 0 collapsing practically to a scalar. To simplify math, we try to 
formulate a complex potential for transverse only fields and set 


P(z) =A, (z) + iV (z), (6.1) 


where z = x+iy. We define also a complex field which we hope to derive from 
the complex potential. The usual derivation of fields from potentials with B = 
By,+iBy = — a however, does not work as can be shown by back-substitution. 
On the other hand, the conjugate complex form 


se : _oP 
BY = B, —1By =i— (6.2) 
: az 


is a valid, Maxwell compliant formulation. This is true because only the second 
formulation is an analytical function f (z) = u (x, y) +iv (x, y) meeting the Cauchy- 
Riemann conditions 


= and =— (6.3) 


and are solutions of the Laplace equation. Evaluating (6.2) we get while dropping 
the index . in the non-zero component of the vector potential (A; = A) 


Op GA +i0V CO +i =A 
BY = B, — iB, =i— =i SY foe ie 


= = 6.4 
ae axtidy 1412 dx Ox ore 


because xLy. Similarly, 


0A -0V 
oP 0A + i0V a +15, OA aV 
Pepe ag = eee Oe a (6.5) 
; dz dx + idy a +i dy dy 
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Equating real and imaginary terms on both sides we may now express the field 
components like 


aV OV 

B, =-—— and By, =—-—, or (6.6a) 
Ox ; dy 
A A 

B, = Pics and B, = = (6.6b) 
dy Ox 


which are just the Cauchy-Riemann conditions for the complex magnetic field B*. 
Both field definitions are valid definitions. 

The potential of real magnets can be expanded into a power series defining all 
multipoles. Any function of an analytical function is also an analytical function. 
Especially, the power series 


P@®=)LGet+ by" =>) az =) are (6.7) 


n=0 n>=0 n=0 


is an analytical function and therefore all components P,, are complex solutions of 
the Laplace equation with complex amplitudes 


=i, + ip, (6.8) 


The coefficients 4, are for upright multipoles while the jz, are those of skew 
multipoles. Upright multipoles are characterized by midplane symmetry which 
requires that for y = 0 the horizontal fields vanish B,(y = 0) = O and only 
vertical field components exist B,(y = 0) #4 0. In beam dynamics we almost 
exclusively use upright magnets. This ansatz is not the most general solution of the 
Laplace equation, but includes all main multipole fields used in beam dynamics. 
Later, we will derive a solution that includes all terms allowed by the Laplace 
equation in a curvilinear coordinate system. Both, the real and imaginary part, are 
two independent solutions of the same Laplace equation. All coefficients 2,,, (4, are 
still functions of z although we do not indicate this explicitly. 

We distinguish between the electrical potential V. and the magnetic potential Vin. 
Since the Laplace equation is valid for both the electric as well as the magnetic field 
in a material free region, no real distinction between both fields had to be made. 
In reality, we rarely design devices which include more than one term of the field 
expansion. It is therefore appropriate to decompose the general field potential in 
(6.7) into its independent multipole terms. To keep the discussion simple, we ignore 
here electric fields. 
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6.1.2 Fields, Gradients and Multipole Strength Parameter 


In (6.7) we used general coefficients which must be related to fields and field 
gradients. Furthermore, we are looking for energy independent magnet strength 
parameters which are almost exclusively used in beam dynamics. The particular 
field patterns for multipole magnets can be derived from the complex potential by 
differentiation to get the fields (6.6a). Although fields can be derived from both the 
vector and scalar potential, we will use only the latter to define the fields for beam 
dynamics. 

The first term in (6.7) Co is a constant and will not contribute to transverse fields. 
However Cp (z) and will therefore show up for longitudinal fields which we will 
discuss in Sect. 6.6. In Table 6.1 the scalar potentials are listed for the first five 
multipoles. In this list we have already introduced more practical quantities to be 
further defined. The coefficients (A,,, (4,) have been replaced by field gradients 


1 
An = ——Spy and (6.9) 
n! 
1 
Mn = ~~ Sn» 
n! 


which are defined for upright and skew magnets of order n by 


nail on Rk. 
Sn (T/m’ ) =-+ - : n= 1,2.3.25 and (6.10a) 
Ox"—-1 |x=0 
y=0 
n—-1\ _ on By 
5, (T/m""') = — ai =0" (6.10b) 
y= 


respectively. Following common practice we use special letters for fields and 
gradients in low order multipoles (see Table 6.2, left column). In anticipation of 
formulating equations of motion we further introduce energy independent field 
gradients. Fields and gradients are not convenient for beam dynamics where 
we design energy independent beam transport systems. This we can do by a 
normalization that includes a general energy factor called the beam rigidity or just 


Table 6.1 Magnetic multipole potentials 


Dipole —V, = —B,x — Byy 

Quadrupole —V) = —t2 (7? —y’) + xy, 

Sextupole B= £83 (e = 307} 4 £53 (3x*y—y’) , 

Octupole -V, = ris (x* — 6x7y? + y*) + 5q54 (x3y — xy) , 

Decapole Vs = —z5p9s (x° — 10x3y? + Sxy*) + ip85 (Sxty — 10x?y3 + y’) 
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Table 6.2 Field gradient Dipole B £Bo= 
y 


nomenclature for low order 

: oBy __ 2 OBy _ 
multipoles Quadrupole ax 8 oo ax k 

By e OBy 
Sextupole w= Ss aa Mm 

aBy e OBy 
Octupole zo = 5% ae 

a By e OBy __ 
Decapole za = $5 aoe Ss 


the “Brho’’from its mathematical form as 


sg co PE 
ee Ge 09979 


BE(GV). (6.11) 


This normalization factor is different for electrical and magnetic fields 


E(GV 
bin = BEIGY) for magnetic fields, and (6.12) 
: 0.29979 
2E (GV 
Re= PEN) for electric fields. (6.13) 
, 0.29979 


This difference will obviously vanish for highly relativistic particles (B ~ 1). In 
beam dynamics we use for relativistic beams mostly magnets and therefore we will 
use in this book the beam rigidity for magnetic fields Ry.m unless otherwise noted. 
For low order magnet strength parameters we use x,,k,m for bending magnets, 
quadrupoles and sextupoles, respectively as shown in the right column of Table 6.2. 
In Chap. 4 the particle path in a uniform field B has been derived as an arc with 
radius p 


1 
—_ = —B,. (6.14) 


This equation illustrates directly the normalization with a factor equal to the product 
of Bp. Applied to a bending magnet, for example, we find that the curvature k, = 
1/p is the normalized quantity for the uniform bending field B,. Since we rarely 
deal with vertical bending magnets we drop the index y in B, and the index x in k,. 

In (6.14) the curvature or the field can be treated very generally not just as the 
properties of a bending magnet. Equation (6.19) can be used as the general field and 
we obtain by multiplication with the beam rigidity 


1 1 = 
= — +t de? 44 + = Ya, (6.15) 
p Po : ¢ dX 
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Table 6.3 Upright multipole fields 


Dipole = 5, By 

Quadrupole = kx 

Sextupole = tm (x? = y’) 

Octupole = bq (3 = 3xy’) 
Decapole =4 HSs (x4 — 6x?y? + y4) 


Dipole (90°) 
Quadrupole (45°) 
Sextupole (30°) 
Octupole (22.5°) 
Decapole (18°) 


The angles indicate the orientation of the central pole with respect to the y-axis (the y-axis is at 


90°) 


where 7 is the pure dipole field and the multipole magnet strengths 


C= (6.16) 
n= BE" : 
or in more practical units 
Sy (m™”) = 0.29979 + sy, (T/m"~') . (6.17) 


This gives us immediately the normalization for quadrupoles, sextupoles and 
higher order multipoles. These parameters are used in beam dynamics as the energy 
independent magnet strengths while field gradients would scale with beam energy. 
From Table 6.1 we get by differentiation for upright multipoles the fields for low 
order upright multipole magnets which are compiled in Table 6.3. 

The other class of magnets does not have mid-plane symmetry but the magnets 
have the same field patterns as the corresponding upright magnets, yet are rotated 
about the z-axis by an angle ¢, = 2/(2n), where n is the order of the multipole. 
These magnets are rarely used in beam dynamics and if so mostly as corrections to 
field errors. For example, misaligned quadrupoles can create a skew field causing 
undesired coupling of particle motion between horizontal and vertical plane. Such 
coupling can be compensated by installing skew quadrupoles. From the expressions 
for the multipole potentials in Table 6.1 we obtain again the multipole field 
components which are compiled up to decapoles in Table 6.4. 

The characteristic difference between the two sets of field solutions is that the 
fields of upright linear magnets in Table 6.3 do not cause coupling for particles 
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traveling in the horizontal or vertical midplane, in contrast to the rotated magnet 
fields of Table 6.4 which would deflect particles out of the horizontal midplane. In 
linear beam dynamics, where we use only dipole and upright quadrupole magnets, 
the particle motion in the horizontal and vertical plane are completely independent. 
This is a highly desirable “convenience” without which particle beam dynamics 
would be much more complicated and less predictable. Since there is no particular 
fundamental reason for a specific orientation of magnets in a beam transport 
systems, we may as well use that orientation that leads to the simplest and most 
predictable results. We will therefore use exclusively upright magnet orientation for 
the main magnets and treat the occasional need for rotated magnets as a perturbation. 
In summary, the general magnetic field equation including only the most commonly 
used upright multipole elements are given by 


Cp 1 2 3 
Pcie +ky + mxy + ar (3x y-y)+... (6.18a) 
0 
1 
“B, = a t+ ke+ sm (a? —y’) + er Ge = 3xy’) aagd (6.18b) 
Po — 0 


Sometimes it is interesting to investigate the particle motion only in the horizon- 
tal midplane, where y = 0. In this case we expect the horizontal field components 
B, of all multipoles to vanish and any deflection or coupling is thereby eliminated. 
In such circumstances, the particle motion is completely contained in the horizontal 
plane and the general fields to be used are given by 


~ B= 0 (6.19a) 
Po 

aay — = +kx+ 5mx- + tx? tot Fg yet (6.19b) 
Po Po (n—1)! 


6.1.3 Main Magnets for Beam Dynamics 


The feasibility of any accelerator or beam transport line design depends fundamen- 
tally on the parameters and diligent fabrication of technical components composing 
the system. Not only need the magnets be designed such as to minimize undesirable 
higher order multipole fields but they also must be designed such that the desired 
parameters are within technical limits. Most magnets constructed for beam transport 
lines are electromagnets rather than permanent magnets. The magnets are excited 
by electrical current carrying coils wound around magnet poles or in the case of 
superconducting magnets by specially shaped and positioned current carrying coils. 
In this section, we will discuss briefly some fundamental design concepts and limits 
for most commonly used iron dominated bending and quadrupole magnets as a 
guide for the accelerator designer towards a realistic design. For more detailed 
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discussions on technical magnet designs we refer to related references, for example 
[1, 2]. 

Iron dominated magnets are the most commonly used magnets for particle beam 
transport systems. Only where very high particle energies and magnetic fields are 
required, superconducting magnets are used with maximum magnetic fields of 6— 
10T compared to the maximum field in an iron magnet of about 2T. Although 
saturation of ferromagnetic material imposes a definite limit on the strength of iron 
dominated magnets, most accelerator design needs can be accommodated within 
this limit. 

We are now in a position to determine the fields for any multipole. This will be 
done in this section for magnetic fields most commonly used in particle transport 
systems, the bending field and the focusing quadrupole field. Only for very special 
applications are two or more multipole field components desired in the same magnet 
like in a gradient bending magnet or synchrotron magnet. 


Deflecting Magnets 
For the bending field n = | and we get from (6.7) the magnetic potential 
Pix, y) =A, +iV) = Cy w+ iy) = Aix py) +iGuiy + Mx). (6.20) 


in case of bending magnets, the skew type is a vertical bending magnet which is 
used in beam dynamics very rarely. The equipotential lines in the transverse (x, y)- 
plane along which the scalar potential is constant are determined for the first order 
potential by 


V, = Avy + 1x = const (6.21) 


and the corresponding electromagnetic field is given in component formulation by 
the vector 


B= (—p1,—A1,0). (6.22) 


Equation (6.22) defines the lowest order transverse field in beam guidance or beam 
transport systems, is uniform in space and is called a dipole field. To simplify the 
design of beam transport systems it is customary to use dipole fields that are aligned 
with the coordinate system such as to exert a force on the particles only in the 
horizontal x- or only in the vertical y-direction. With these definitions, we have for 
a horizontally deflecting magnet (A; 4 0, 41 = 0) and for a vertically deflecting 
magnet (A; = 0, 41 4 0).To design a pure dipole magnet, we would place iron 
surfaces at equipotential lines. Specifically, for a horizontally deflecting magnet the 
equipotential lines are at 


y=+6 (6.23) 
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to define a uniform vertical field within a vertical magnet aperture of 2G. Infinitely 
long magnets are assumed and the equipotential surface is defined by the same line 
anywhere along z. 

As mentioned above, vertical bending magnets are rarely used in accelerator 
physics. Yet, there are special instances, especially in beam transport lines where 
vertical bending magnets are required. In those cases we would just introduce a 
vertical curvature Ky in (6.18a) or (6.19a) cover the vertical dispersion function. 
Outside the bending magnet the dispersion behaves just like a particle trajectory 
and therefore the quadrupoles do not have to be rotated or modified. 


Focusing Device 


The most suitable device that provides a material free aperture and a desired 
focusing field is a quadrupole magnet. The magnetic field can be derived from the 
term n = 2 of the scalar potential (6.7) 


P(x, y) = Co (x t iy)? = Cy ( —y* + idxy). (6.24) 


Similar to the dipole case, both the real and imaginary parts are two independent 
solutions of the same Laplace equation and therefore the potential for both 
components can be written in the form 


Po(x,y) = Ap +iV2 = Ag(x? —y*) — 2poxy + i[2Aoxy + po (x? —-y’)]. (6.25) 


Both the real and imaginary solutions are independent solutions with independent 
coefficients. Coefficient —2A. = g is equal to the field gradient for an upright 
quadrupole and —2j12 =g, which is the field gradient of a skew quadrupole. 
Separating both solutions, equipotential lines in the transverse (x, y)-plane for both 
second order potentials can be defined by 


x° —y? =const, for the skew quadrupole and (6.26a) 


xy = const. for the upright quadrupole. (6.26b) 


Magnetic equipotential surfaces with a profile following the desired scalar 
potential (6.1.3) will be suitable to create the desired fields. The field pattern of an 
upright quadrupole magnet (6.26b) is shown schematically in Fig. 6.1 (left) together 
with the pole configuration for a rotated quadrupole Fig. 6.1 (right). 


Synchrotron Magnet 


Sometimes a combination of both, the dipole field of a bending magnet and the 
focusing field of a quadrupole, is desired for compact beam transport lines to form 
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xy = 1/2R? x2. y2= 1/2R2 


Fig. 6.1 Pole shape of an upright quadrupole (/eft) and of a rotated quadrupole magnet (right) 


pole profile 


Fig. 6.2 Pole profile for a synchrotron magnet (schematic) 


what is called a synchrotron magnet. The name comes from the use of such magnets 
for early synchrotron accelerators. The fields can be derived just like the dipole and 
quadrupole fields from the two-term potential (6.7) with n = 1 andn = 2. 

Such a magnet actually is nothing but a transversely displaced quadrupole. The 
field in a quadrupole displaced by xo from the beam axis is By = g(x—Xx0) = gx—gxo 
and a particle traversing this quadrupole at x = 0 will be deflected by the field B, = 
gxo. At the same time, we still observe focusing corresponding to the quadrupole 
field gradient g. The pole cross section of such a magnet is shown in Fig. 6.2. 
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The deviation from parallelism of the magnet poles at the reference trajectory is 
often quantified by the characteristic length defined by 


By 1 
(n= 


Geometrically this characteristic length is equal to the distance from the reference 
trajectory to that point at which the tangents from the two magnet poles at the 
vertical reference plane would touch (Fig. 6.2). 


Higher Order Multipole Magnets 


In a general beam transport line we use bending and quadrupole magnets to guide 
and focus a particle beam. For more sophisticated systems, however, we experience 
chromatic aberrations as is known from light optics. Particles with slightly different 
energies are focused differently and the image becomes blurred. In light optics such 
aberrations are partially corrected by the use of glasses with different refractive 
indices. In particle optics we use sextupoles. As the name indicates this magnet is 
composed of six poles. The complex potential is 


P3 (z) = A3 +iV3 = C3 (x + iy)’ = As (2° — 329”) — bs (3x°y -y’) 
+ i[A3 (3x°y— yy’) + w3 (x? —32y°)]. (6.28) 


Only upright sextupoles are used in beam dynamics for which —6A3 = 53 the ideal 
fields are 


1 
+B =—mxy and - Rt, =-=m (x? — y’) : (6.29) 
Po Po ~ 2 


The pole profile is given by the scalar potential V3 
V3 = 3x’y — y? = const (6.30) 


which describes the center poles along the vertical axis. To get the other poles one 
must rotate the center pole by 60°. The aperture radius R must be chosen like in the 
case of the quadrupole from other consideration related to the application and beam 
requirement. The actual sextupole profile (6.30) is then given for the center pole by 


3x7y —y? = —R?. (6.31) 
The magnet pole shapes for sextupole octupole or higher order magnets are 


shown in Fig.6.3. Odd order multipoles like dipoles, sextupoles, decapoles etc. 
are characterized by central poles along the vertical axis (Fig. 6.3 left). Even order 
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Fig. 6.3 Pole profile for an upright sextupole (/eft) and octupole (right) magnet 


multipoles have no poles along the horizontal or vertical axis (Fig. 6.3 right). The 
profile can be derived directly from the respective potential (6.7). Only the profile of 
one pole must be determined since the other poles are generated by simple rotation 
of the first pole by multiples of the angle 90° /n, where 7 is the order of the multipole. 
Multipoles of higher order than sextupoles are rarely used in accelerator physics but 
can be derived from the appropriate multipole potentials. 

For an arbitrary single higher order multipole the field components can be derived 
from its potential (6.7) 


Pr(x,y) = An +iVn = Cy (x + iy)". (6.32) 


From this equation it is straight forward to extract an expression for the potential 
of any multipole field satisfying the Laplace equation. Since both electrical and 
magnetic fields may be derived from the Laplace equation, we need not make any 
distinction here and may use (6.32) as an expression for the electrical as well as the 
magnetic potential. 

As mentioned before, it is useful to keep both sets of solutions (An,ln) separate 
because they describe two distinct orientations of multipole fields. For a particular 
multipole both orientations can be realized by a mere rotation of the element about 
its axis. Only the solution A, has what is called midplane symmetry with the 
property that B,y(x,y) = Bny(x, —y). In this symmetry, there are no horizontal field 
components in the midplane, B,,,.(x, 0) = 0, and a particle travelling in the horizontal 
mid plane will remain in this plane. We call all magnets in this class upright magnets. 

The magnets defined by 4, #~ O we call rotated or skew magnets since they 
differ from the upright magnets only by a rotation about the magnet axis. In real 
beam transport systems, we use almost exclusively magnetic fields with midplane 
symmetry. 
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Fig. 6.4 Simulation of the dipole field through a vacuum chamber which is magnetic (uw = 1.8) 
(left). The same situation is shown on the right side after annealing of the vacuum chamber 


Vacuum Chamber Material 


We have made great efforts to optimize the multipole field quality, but much of 
this can be destroyed again with the installation of a vacuum chamber. The vacuum 
chamber must be made of material which is non-magnetic. This is no problem with 
Aluminum or Copper but great care must be exercised with steel chambers. Non- 
magnetic material with a permeability of some ~ = 1.01 or = 1.02 should be 
used. If the permeability is greater, the vacuum chamber walls concentrate magnetic 
flux which distorts the desired field. A field simulation with vacuum chamber is 
shown in Fig. 6.4! where we note the field concentration in two parts of the vacuum 
chamber (left) which has a permeability of 44 = 1.8. The simulation is for the 
NSRRC booster where beam could not be stored at injection energy of 150 MeV 
because of the magnetic properties of the vacuum chamber. After annealing to about 
1,050°C the permeability was reduced to 4 ~ 1.01 — 1.02 and the effect of the 
vacuum chamber has been clearly eliminated (right). Similar effects on the ideal 
magnetic field can occur in any other multipole. While the perturbation seems small 
and barely noticeable it is big enough to prevent storage of a beam in a circular 
accelerator. 


6.1.4 Multipole Misalignment and “‘Spill-down” 


In beam dynamics it is very important to align magnets very precise. However, there 
are limits and we need to know what happens if we misalign magnets. We consider 
first only a rotational misalignment by the angle 6. The scalar potential is then 


PAE Oar, (6.33) 


'The author thanks Jyh-Chyuan Jan, Cheng-Ying Kuo and Ping J. Chou from NSRRC, Taiwan for 
the pictures showing the effect of a magnetized vacuum chamber based on simulations. 
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Expanding this, we get for small rotations such that nd < 1 
P, (7,9) = Cree = C,r"ei”? (1 — ind). (6.34) 


The rotational error 6 has not altered the original magnetic field, but has added 
a small skew component of the same magnet. Much more dramatic are lateral 
misalignments. Here, we start from (6.32) and misplace the magnet by the amount 
bz = bx+idy. 


Py, (x, y) = Cn (2 + 62)”. (6.35) 
This can be expanded for 


P,, (x,y) = Cy (¢ + 8z)" (6.36) 
m Cz" + Cy [(i)2"'8z + 2" 7827 +... + ("52"). 


The original field is still preserved, but now many lower order terms appear. 
Actually, for a lateral misalignment all lower order magnetic field components 
appear, a phenomenon that is called “spill-down”. These lower order fields cause 
orbit distortions, focusing errors and errors in the chromaticity, which all have to be 
compensated. 


6.2 Main Magnet Design Criteria 


In this section we will shortly discuss the design criteria for the main beam dynamics 
magnets like bending magnets and quadrupoles. For more detailed studies on 
magnets the reader is referred to relevant texts like [2]. 


6.2.1 Design Characteristics of Dipole Magnets 


The expressions for the magnetic potentials give us a guide to design devices that 
generate the desired fields. Multipole fields are generated mostly in one of two ways: 
as iron dominated magnets, or by proper placement of electrical current carrying 
conductors. The latter way is mostly used in high field superconducting magnets, 
where fields beyond the general saturation level of about 2 T for iron are desired. 
In iron dominated magnets, fields are determined by the shape of the iron 
surfaces. Just like metallic surfaces are equipotential surfaces for electrical fields, 
so are surfaces of ferromagnetic material, like iron in the limit of infinite magnetic 
permeability, equipotential surfaces for magnetic fields. Actually, for practical 
applications the permeability only has to be large just like the conductivity must be 
large to make a metallic surface an equipotential surface. This approximate property 
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of iron surfaces can be exploited for the design of unsaturated or only weakly satu- 
rated magnets. For preliminary design calculations, we assume infinite permeability. 
Where effects of finite permeability or magnetic saturation become important, the 
fields are determined numerically by mathematical relaxation methods. In this text, 
we will not be able to discuss the details of magnet design and construction but 
will concentrate only on the main magnet features from a beam dynamics point of 
view. A wealth of practical experience in the design of iron dominated accelerator 
magnets, including an extensive list of references, is compiled in a review article by 
Fischer [1] and a monograph by Tanabe [2]. 


Excitation Current and Saturation in a Bending Magnet 


A dipole field can be generated, for example, in an electromagnet as shown in 
Fig. 6.5 where the beam would travel normal to the cross section into the center 
of the magnet. 

The magnetic field B is generated by an electrical current J in current carrying 
coils surrounding magnet poles. A ferromagnetic return yoke surrounds the excita- 
tion coils providing an efficient return path for the magnetic flux. The magnetic field 
is determined by Ampere’s law 


B : 
Vx —=bLo/, (6.37) 


T 


where ju; is the relative permeability of the ferromagnetic material and j is the 
current density in the coils. Integrating (6.37) along a closed path like the one shown 
in Fig. 6.5 and using Stokes’ theorem gives 


B 
SGBy +: / do = pole, (6.38) 
[lr 


iron 


excitation 
coil 


pole gap 


return 
choke 


magnet pole integration path 


Fig. 6.5 Cross section of a dipole magnet (schematic) 
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where Bo is the magnetic field in the center of the magnet aperture between and 
normal to the parallel magnet poles with a gap distance of 2G. The integral term 
in (6.38) is zero or negligibly small in most cases assuming infinite or a very large 
permeability within the magnetic iron. Jio¢ = 2J¢oi1 is the total current flowing in the 
complete cross section of both coils. Solving (6.38) for the total current in each coil 
we get in more practical units 


1 
Teoi(A) = —Bo (T) G(m), (6.39) 
Lo 


which is proportional to the magnetic field and the aperture between the magnet 
poles. 

As a practical example, we consider a magnetic field of 1 T in a dipole magnet 
with an aperture of 2G =10cm. From (6.39), a total electrical excitation current 
of about 40,000 A is required in each of two excitation coils to generate this field. 
Since the coil in general is composed of many turns, the actual electrical current is 
much smaller by a factor equal to the number of turns and the total coil current [coi 
is therefore often measured in units of Ampere-turns. For example, a coil composed 
of 40 windings with sufficient cross section to carry an electrical current of 1,000 A 
would provide the total required current of 40,000 A-turns. 

As arule of thumb to get a good field quality within an aperture width equal to the 
full gap height the pole width should be at least 3-times the full gap height. Narrower 
pole profiles require shimming of the pole profile. There are elaborate way to shape 
the pole profile for a bending magnet [2] but there are also more simple ways. The 
drop-off of the field towards the side of the poles can be to some extend extended 
further out by adding to the pole profile a straight line shim to slightly reduce the 
pole gap around the edges of the poles. This shim need not be more elaborate than 
a line segment to reduce the gap followed by a horizontal section to the edge of the 
pole. Such shims may start around half a full gap size from the center with a gentle 
slope and rarely a thickness of more than 0.5—1 mm. We will discuss such shims in 
more detail in connection with quadrupole design. 

Saturation effects are similar to those in a quadrupole magnet which will be 
discussed in the next section. Like in any magnet the first sign of saturation show up 
most likely at the pole root where the poles join the return yoke. That is so because 
much magnetic flux comes into the pole from the sides along the length of the pole 
thus increasing the magnetic flux density. One way out is to shape the pole pieces 
like wedges with increasing cross section towards the return yoke. Any saturation 
in the return yoke is easily avoided by increasing the thick ness of the iron in the 
return yoke. 


6.2.2 Quadrupole Design Concepts 


Quadrupoles together with bending magnets are the basic building blocks for 
charged particle beam transport systems and serve as focusing devices to keep 
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the particle beam close to the desired beam path. The magnet pole profile for a 
quadrupole can be derived the same way as that for a dipole magnet. Placing an iron 
boundary in the shape of a hyperbola generates the equipotential surface required 
for an upright quadrupole, or mathematically 


xy = const. (6.40) 


The inscribed radius of the iron free region is R and the constant in (6.40) is 


therefore (R/ wel = $R? as shown in Fig.6.1. The pole shape or pole profile 
for a quadrupole with bore radius R is then defined by the equation 


xy = £3R’. (6.41) 
Similarly, the pole profile of a rotated quadrupole is given by 
ra a = +R’. (6.42) 


This is the same hyperbola as (6.41) but rotated by 45°. Both (6.41) and (6.42) 
describe four symmetrically aligned hyperbolas which become the surfaces of 
the ferromagnetic poles producing an ideal quadrupole field. Magnetization at 
alternating polarity of each pole generates a sequence of equally strong north and 
south poles. 

In a real quadrupole, we cannot use infinitely wide hyperbolas but must cut- 
off the poles at some width. In Fig.6.6 some fundamental design features and 
parameters for a real quadrupole are shown and we note specifically the finite pole 
width to make space for the excitation coils. Since only infinitely wide hyperbolic 
poles create a pure quadrupole field, we expect the appearance of higher multipole 
field errors characteristic for a finite pole width. 


Pole Profile Shimming 


While in an ideal quadrupole the field gradient along, say, the x-axis would be 
constant, we find for a finite pole width a drop off of the field and gradient 
approaching the corners of poles. Different magnet designer apply a variety of pole 
shimming methods. In this text we use tangent shimming as described below. The 
field drop off at the pole edge can be reduced to some extend if the hyperbolic pole 
profile continues into its tangent close to the pole corner as indicated in Fig. 6.6. 
This adds some iron to increase the field where the field would otherwise fall 
below the desired value. The starting point of the tangent determines greatly the 
final gradient homogeneity in the quadrupole aperture. In Fig. 6.7 the gradient along 
the x-axis is shown for different starting points of the tangent. There is obviously an 
optimum point for the tangent to minimize the gradient error over a wide aperture. 
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Fig. 6.7 Empirical field gradient and pole profile shimming for a particular quadrupole as 
determined by numerical simulations with the program MAGNET [3] 


Application of tangent shimmingmust be considered as a fine adjustment of the 
field quality rather than a means to obtain a large good field aperture as becomes 
apparent from Fig. 6.7. The good field aperture is basically determined by the width 
of the pole. While optimizing the tangent point, we find an empirical correlation 
between gradient tolerance (Fig. 6.8) within an aperture region x < X,, and the pole 
width expressed by the minimum pole distance A. The good field region increases 
as the pole gets wider. For initial design purposes, we may use Fig. 6.8 to determine 
the pole width from A based on the desired good field region X,, and gradient field 
quality. 

The final design of a magnet pole profile is made with the help of computer codes 
which allow the calculation of magnet fields from a given pole profile with satura- 
tion characteristics determined from a magnetization curve. Widely used computer 
codes for magnet design are, for example, MAGNET [3] and POISSON [4]. 
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Fig. 6.8 Field gradient tolerances as a function of pole profile parameters calculated with 
MAGNET 


Field errors in iron dominated magnets have two distinct sources, the finite pole 
width and mechanical manufacturing and assembly tolerances. From symmetry 
arguments, we can deduce that field errors due to the finite pole width produce only 
select multipole components. In a quadrupole, for example, only (2n + 1) - 4-pole 
fields like 12-pole or 20-pole fields are generated. Similarly in a dipole of finite pole 
width only (2n + 1) - 2-pole fields exist. We call these multipole field components 
often the allowed multipole errors. Manufacturing and assembly tolerances on the 
other hand do not exhibit any symmetry and can cause the appearance of any 
multipole field error. 

The particular choice of some geometric design parameters must be checked 
against technical limitations during the design of a beam transport line. One basic 
design parameter for a quadrupole is the bore radius R which depends on the 
aperture requirements of the beam. Addition of some allowance for the vacuum 
chamber and mechanical tolerance between chamber and magnet finally determines 
the quadrupole bore radius. 


Excitation Current and Saturation 


The field gradient is determined by the electrical excitation current in the quadrupole 
coils. Similar to the derivation for a bending magnet, we may derive a relation 
between field gradient and excitation current from Maxwell’s curl equation. To 
minimize unnecessary mathematical complexity, we choose an integration path as 
indicated in Fig. 6.9 which contributes to the integral ¢ B,ds only in the aperture of 
the quadrupole. 

Starting from the quadrupole axis along a path at 45° with respect to the 
horizontal or vertical plane toward the pole tip, we have 


1 R 
= PBas= [Br = Hobo. (6.43) 
Lr 0 
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Fig. 6.9 Determination of 
the field gradient from the 
excitation current 


iron yoke 


integration 
path 


Since B, = gy and By = gx, the radial field component is B, = ,/By + BY = gr 
and the excitation current from (6.43) is given by 


1 T 
Tro (A X turns) = a’ (=) R’(m). (6.44) 
0 


The space available for the excitation coils or coil slot in a real quadrupole 
design determines the maximum current carrying capability of the coil. Common 
materials for magnet coils are copper or aluminum. The electrical heating of the 
coils depends on the current density and a technically feasible balance between 
heating and cooling capability must be found. As a practical rule the current density 
in regular beam transport magnets should not exceed about 6-8 A/mm/?. This is 
more an economical than a technical limit and up to about a factor of two higher 
current densities could be used for special applications. The total required coil 
cross section, however, including an allowance for insulation material between coil 
windings and about 15—20 % for water cooling holes in the conductor depends on the 
electrical losses in the coil. The aperture of the water cooling holes is chosen such 
that sufficient water cooling can be provided with an allowable water temperature 
increase which should be kept below 40°C to avoid boiling of the cooling water at 
the surface and loss of cooling power. A low temperature rise is achieved if the water 
is rushed through the coil at high pressure in which case undesirable vibrations of 
the magnets may occur. The water cooling hole in the conductor must therefore be 
chosen with all these considerations in mind. Generally the current density averaged 
over the whole coil cross section is about 60—70 % of that in the conductor. 

In practical applications, we find the required coil cross section to be significant 
compared to the magnet aperture leading to a long pole length and potential 
saturation. To obtain high field limits due to magnetic saturation, iron with a 
low carbon content is used for most magnet applications in particle beam lines. 
Specifically, we require the carbon content for most high quality magnets to be no 
more than about | %. In Fig. 6.10 the magnetization curve and the permeability as 
a function of the excitation are shown for iron with 0.5 % carbon content. We note 
a steep drop in the permeability above 1.6 T reaching full saturation at about 2 T. 
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Fig. 6.10 Magnetization and 10000 O 
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A magnet has an acceptable saturation level if the magnetic permeability anywhere 
over the cross section of the magnet remains large compared to unity, ju; > 1. 

Severe saturation effects at the corners of the magnet pole profile can be avoided 
if the maximum field gradient, as a rule of thumb, is chosen such that the pole tip 
field does not exceed a value of B, = 0.8 — 1 T. This limits the maximum field 
gradient to gmax = B,/R and the quadrupole length must therefore be long enough 
to reach the focal length desired in the design of the beam transport line. Saturation 
of the pole corners introduces higher-order multipoles and must therefore be kept to 
a minimum. 

Other saturation effects may occur at the pole root where all magnetic flux from 
a pole including fringe fields are concentrated. If the pole root is too narrow, the 
flux density is too high and saturation occurs. This does not immediately affect 
the field quality in the central aperture, but requires higher excitation currents. A 
similar effect may occur in the return yokes if the field density is too high because 
of too small an iron cross section. In Fig.6.11 a permeability plot is shown for a 
magnet driven into severe saturation. Low values of the permeability indicate high 
saturation, which is evident in the pole root. 

By increasing the width of the pole root the saturation is greatly reduced as shown 
in Fig. 6.12. To minimize pole root saturation the pole length should be as short as 
possible because less flux is drawn through the side of the pole. Unfortunately, this 
also reduces the space available for the excitation coils leading to excessively large 
current densities. To reduce this conflict, the pole width is usually increased at the 
pole root rather than shortening the pole length. 

In addition to pole root saturation, we may also experience return yoke saturation, 
which is easily avoided by increasing its thickness. 


6.3 Magnetic Field Measurement 


The quality of the magnetic fields translates immediately into the quality and 
stability of the particle beam. The precision of the magnetic fields determines 
the predictability of the beam dynamics designs. While we make every effort to 


146 6 Electromagnetic Fields 


3069 2955 2870 2697 2385 1675 869 407 
2892 2785 2735 2575 2254 1384 750 
2715 2603 2611 2468 2106 1169 652 
2524 2378 2499 2378 1953 1002 553 
2323 2129 2401 2307 1832 942 480 
2073 1613 2262 1751 899 422 


t) 0 374 0 i) 0 t) 0 i} t) t) 0 1) 0 t) 
i) oO 2000 -\_0 oO 0 i} oO oO 0 i) oO oO 0 t} 
i) oO 2000 20 Oo 0 i} oO oO 0 i} oO oO 0 i‘) 
i) oO 2097 2168 ~s oO 0 i) oO oO 0 i) oO oO 0 t) 
0 oO 2223 2289 2468 0 i} oO oO 0 i) oO oO 0 0 
t} oO 2378 2446 2619 \. 0 t) oO oO 0 i) oO oO 0 0 
tt} oO 2566 2651 2819 3139 i) oO oO 0 i) oO oO 0 0 
i} oO 2807 2910 3076 3160. 0 oO oO 0 0 oO oO 0 0 
0 Oo 3082 3159 3160 3160 316Q oO oO 0 0 oO oO 0 0 
i) oO 3160 3160 3160 3160 3160. 0 oO 0 0 oO oO 0 0 
i) 3160 3160 3160 3160 3160 3122 oO 0 i) oO oO 0 0 
0 3160 3160 3160 3160 3160 2885~. 0 0 0 oO oO 0 0 
3160 3160 3160 3160 3086 2667 22d i) 0 C1) 

3160 3160 3160 3160 2870 2464 1793 \ 0 C1) 0 

3160 3160 3160 2994 2691 2269 1378 739 i) pole root 0 

3160 3135 3017 2832 2531 1990 999 613\_ 0 0 

0 

0 

i) 

t) 

0 

0 

i) 

i) 

Q 


MetoSessveesescoseseesoesessoes 
soo Ses sen eescecsssoasaesoese 


58 BTN 
73 133 38g 

59 91 257 552\_ 
172 412 745 175 
585 981 25 
699 1357 


JNFDCOCSSCSSCFDC ODOC CDC COC OD ODOC OC OOOO 


ecocoeoceC OC COC oOOd 
ecco eoec COC COC oOOd 


Fig. 6.11 Permeability values are plotted in a grid over the iron cross section of a highly excited 
quadrupole. We note the significantly reduced permeability (4. < 100) in the narrow pole root 
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Fig. 6.12 Permeability values are plotted in a grid over the iron cross section of a highly excited 
quadrupole. We note the significantly reduced permeability (4. < 100) in the narrow pole root 
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construct magnets as precise as possible, we cannot avoid the appearance of higher 
multipole fields due to finite pole widths or machining and assembly tolerances. 
Therefore, precise magnetic field measurements are required. While detailed dis- 
cussions of magnetic field measurement technology exceeds the goals of this book, 
the issue is too important to ignore completely and we will discuss this topic in an 
introductory way. For more detailed information, please consult texts like [2]. 


6.3.1 Hall Probe 


The Hall probe is the most commonly used device to measure the magnetic field. 
Its principle is based on the Lorentz force on moving charges. Use a small piece 
of metallic foil, say 1 x 1 mm, send an electrical current in one direction through 
the foil and place the foil into a magnetic field such that the field penetrates the 
plane of the foil. The moving electrons feel the Lorentz force due to the presence 
of the magnetic field and are pulled off a straight path, thus accumulating charge 
on one side of the foil. That charge accumulation causes with the other side of 
the foil a potential difference, the Hall voltage, which can be measured and which 
is proportional to the magnetic field component passing orthogonally through the 
foil. The material of commercial Hall probes is not a metallic foil but some material 
which contains many electrons with great mobility to maximize the sensitivity of the 
probe. The size of the probe is made very small for maximum resolution because 
the probe measures the average field across the area of the foil. Typical areas of a 
Hall probe may be in the jum range which provides a high resolution as desired in 
magnetic field measurements for beam dynamics. Figure 6.13 shows the principle 
functioning of a Hall Probe. 

By computer controlled precise movement of the Hall probe from point to point 
within the magnet aperture, the magnetic field can be mapped to high precision. The 
measurements can then be analysed as to field errors, multipole content and fringe 
field effects. 


Fig. 6.13 Hall probe 
(schematic) (/ activation 
current, B magnetic field, ve 
velocity of electrons, FL 
Lorentz force, + V signal 
voltage) 


148 6 Electromagnetic Fields 


Fig. 6.14 Rotating coil ina * 


magnet to determine higher 7 if id if é 4 
order multipole components 


7 (Bs) 
B / sfi| \ 
{ | > 
\ / 
\ 4 
N 7 
_- —_ 


6.3.2 Rotating Coil 


In practice, however, the particles in a beam integrate through a whole magnet and 
we are therefore bound to do the same with Hall probe measurements. A faster 
method, and actually more precise method for higher order multipole fields, is a 
rotating coil as shown in Fig. 6.14. Here, a coil wound of very thin electrical wire 
is installed coaxial within the magnet aperture. Rotating the coil produces a time 
dependent voltage which includes all fields within the cross section of the coil and 
integrated along the length of the coil. The length of most coils extends well beyond 
the ends of the magnet while very short coils may be used to specifically probe local 
fields like fringe fields in the ends of magnets. As the coil rotates the induced voltage 
is recorded measuring the integrated field along the length of the coil. The induced 
voltage is V = -2 and the magnetic flux 


P= Lar | BO) ds, (6.45) 


where Leg is the effective length of the magnet. The integration is taken from the 


axis to the radial extent of the coil. With _ = a? as = LerB (s) 7 the induced 


voltage is V = —LerB (s) 7 and the integrated voltage is 
/ Vdt = -Les | B(s) ds. (6.46) 
0 
With B, = 7 and By, = -". we get B(s) = —B, sin @ + By cos@ = 4 and 
/ Vdt = —Legr A (A) = > [Pn COs (nO a Vn) + gn sin (nO + Wn) ’ (6.47) 


where we have also introduced the Fourier transform of the signal. The vector 
potential is used to determine the fields because of simplicity of math. The Fourier 
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transform will help us to determine the multipole strength and orientation. For the 


n-multipole 
/ Vdt 


SS effAn (6.48) 


n 


II 


—Lesr |Cn| rj (cos n@ cos W, — sinné sin W,) 


= Pn COs (nO + Wn) So dn sin (n + Wn) ? 


where ro is the radius of the coil and A, = ReP, = |C,|1rj cos (nO + w,). To 
maximize the signal, the coil radius r9 should be about 80 % of the aperture radius. 
Larger coils would not fit the magnet aperture. The phase 7, defines the orientation 
of the n-multipole. From (6.48) the multipole strength is 


Pat 
Lett |Cy| = ere (6.49) 
0 
and the orientation 
Wy = — arctan Ls (6.50) 
Pn 


From the Fourier coefficients (p,, gn) of the measured signal { Vdt and knowledge 
of the coil size ro we can determine the strength C, and orientation y, of all 
multipole limited only by the sensitivity of the experimental setup. The magnetic 
fields are given by 


Bux — iBry = iP! = inC, |z|"* &-D? (6.51) 
= n|C,| 7! {—sin [(n— 1) 6 + Wy] + icos[(n — 1) 6 + Y]} 


or 
Bee = nr! [—Un cos(n—1)0+A,sin(n—1) 6], and 
Bry = —nr,—' [+An cos (n— 1) @ — fy sin (n — 1) 4]. (6.52) 
with 
Brain (To) 


IC | sin Wn = —Un = An nr! 
Brain (10) 


n—1 
nro 


IC, | COS Wn = +An = —Dn (6.53) 
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Fig. 6.15 Twin coil to determine higher order multipole components 


The field components at an arbitrary radius r are finally 


Bryx es . 
== (=) [b, sin(n — 1) 9 + a,cos(n—1)6], and 


Brain To 
B r n-l 

= (<) [b, cos (n — 1) 0 + an sin (n— 1) 6], (6.54) 
Brain TO 


where Brain is the main magnet field at 7).The signal obtained from a rotating coil 
can be used to determine the strength and orientation of higher multipoles. 


Practical Considerations 


The signals from higher multipoles are measured in the presence of the strong main 
field. The dynamic range of the equipment and integrator may not be wide enough 
to yield precise multipole information. It would be a great advantage if the signal 
from the main field could be compensated or at least be reduced to the level of the 
multipole signal. This is possible with multiple coils as shown in Fig. 6.15. 

Here the signals from two coils are processed such that the main field is “bucked” 
out. There is an outer coil at r;, 73 and an inner coil 72, 74. The signal from the outer 


coil is 
, Vdt 


and from the inner coil 


| Vdt 


= Le mo Y~ [Cal (71 — 74) cos (08 + Wn) (6.55) 


outer coil 


= Lem Y > |Cy| (75 — 1) cos (nO + Yn) (6.56) 


inner coil 
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forming a combined signal 
/ Vdt 


Here, m, and mj are the turns in the outer and inner coil, respectively. Defining 


Bi = |2|,B1 = 


signal from the uncompensated outer coil 


/ Vdt 


and for the compensated coil signal 


/ Vdt 


The signal sensitivity for the uncompensated coil is 


= Lett) |Cul [mo (Hf — 74) + mi (75 — 1) ] cos (nd + Wn). 


compensated n>0 


(6.57) 


r3 
Tr! 


r4 
r2 


p= = and = - the combined signal (6.57) is for the 


= Let Y | |Cul r1Sw cos (nO + Yn) (6.58) 


uncompensated n>0 


= Let mo Y > [Cul 7} 5p cos (08 + Wn). (6.59) 


compensated n>0 


Sy = 1—(—B1)* (6.60) 


where N represents the order of the main magnet field and the compensated coil has 
the sensitivity s, for the nth-order multipole 


Sn = 1— (—B1)" — wp” [1 — —B2)"1, (6.61) 


where 7 represents the mth order multipole. By choosing parameters such that s, 
becomes zero for the desired values of n, we may eliminated electronically the large 
signal from the main magnet field. For example, in case of a quadrupole, we would 
like to compensate the quadrupole field and the dipole field which may appear as a 
“spill-down” from a misaligned quadrupole. In this case, we would want to set 5; = 
1+ 6: —up[1 + Bo] © Oand s2 = 1 — BY — wp[1 — B53] = 0 and build a specific 
measurement coil for quadrupoles. Selecting arbitrarily 4 = 2 and p = 0.625 the 
desired sensitivity will be zero with 6; = 0.5 and 62 = 0.2. All other sensitivities 
are at least 60 % and well known to be included in the analysis. It is not necessary 
that the main fields are bucked perfectly. It’s sufficient if their signal is reduced to 
the level of the higher order multipole signals. 

The whole magnetic measurement would record the signals from both coils 
separately and produce the strength and orientation of the main field forn = N 
according to (6.49) and (6.50) while the same multipole parameters are derived from 
the same equations based on the compensated signal and including the calculated 
sensitivities. 
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Magnetic field measurements have developed very far and have reached a level of 
accuracy and precision that fully meets the demands of beam dynamics. Especially, 
the determination of the multipole content is important to ensure the stability of a 
beam in, for example, a storage ring. While the effects of multipole fields cannot 
be analyzed analytically, we may track particles many times around the storage 
ring in the presence of these multipole fields and thus define beam stability and the 
dynamic aperture. 


6.4 General Transverse Magnetic-Field Expansion* 


In the previous section, we discussed solutions to the Laplace equation which 
included only pure transverse multipole components in a cartesian coordinate 
system thus neglecting all kinematic effects caused by the curvilinear coordinate 
system of beam dynamics. These approximations eliminate many higher-order 
terms which may become of significance in particular circumstances. In preparation 
for more sophisticated beam transport systems and accelerator designs aiming, for 
example, at ever smaller beam emittances it becomes imperative to consider higher- 
order perturbations to preserve desired beam characteristics. To obtain all field 
components allowed by the Laplace equation, a more general ansatz for the field 
expansion must be made. Here we restrict the discussion to scalar potentials only 
which are sufficient to determine all fields [5, 6]. 

Since we use a curvilinear coordinate system for beam dynamics, we use the 
same for the magnetic-field expansion and express the Laplace equation for the 
complex potential P in these curvilinear coordinates 


1f 0d (/av 0 (av d (lov 
av= cle (ee)te(sel te (; =) |=° — 


where h = 1+k,x+kyy and K,, Ky the ideal curvatures in the horizontal and vertical 
plane, respectively. We also assume that the particle beam may be bend horizontally 
as well as vertically. For the general solution of the Laplace equation (6.62) we use 
an ansatz in the form of a power expansion 


ag’ yg=->) > A a (6.63) 
p.qz0 


where we have added the beam rigidity to facilitate the quantities for application in 
beam dynamics and where the coefficients A,,(z) are functions of z. Terms with 
negative indices p and g are excluded to avoid nonphysical divergences of the 
potential at x = 0 or y = O. We insert this ansatz into (6.62), collect all terms 
of equal powers in x and y and get 


Pea oo SIGS aa =o (6.64) 


p=0 q>0 
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where {F,,} represents the collection of all coefficients for the term x’ y’. For (6.64) 
to be true for all values of the coordinates x and y, we require that every coefficient 
Fyq must vanish individually. Setting F,, = 0 leads to the recursion formula 
Apg+2 + Apt2g = —Kx(3p + DAptig — Ky3q + DApo+1 

—3kyGAp+42.q—-1 — 3KxPAp—1.9+2 

—2kKoyq(3p + IAp+ig—1 — 2Kxkyp (3g + IAp-1.941 

— 3K, q(q — VAp42.9—2 — 3K2P(P — WAp—2.942 

—K}p(p? — 3p + 2)Ap—3.g42 — Ky q(q° — 3g + 2)Ap+2.9-3 

=k, 9(q — 1 + 3pq— 3p)Ap+i.g-2 (6.65) 

—KiKyp(p — 1 + 3pq — 39g) Ap—2.9+1 

—Kyq(BKyp’ — Kip + Kg — 2K g + Ap g-1 

—KxpBKg - Kq + Kp” - 2«2p + K)A Hg 

—(3p — 1)pK2A,, — (3q— 1)qK Ang 


Aig = GPA, 44 a KyGA) g—1 = CPAs -_ Kg = 
which allows us to determine all coefficients A,,. We note that all terms on the right 
hand side are kinematic terms originating from the curvilinear coordinate system. 
The derivatives, indicated by a prime, are understood to be taken with respect to the 
independent variable z, like A’ = dA/dz, etc. Equation (6.65) is a recursion formula 
for the field coefficients A,, and we have to develop a procedure to obtain all terms 
consistent with this expression. 


6.4.1 Pure Multipole Magnets 


The Laplace equation is of second order and therefore we cannot derive coefficients 
of quadratic or lower order from the recursion formula. The lowest-order coefficient 
Aoo represents a constant potential independent of the transverse coordinates x and 
y and since this term does not contribute to a transverse field component, we will 
ignore it in this section. However, since this term depends on z we cannot neglect 
this term where longitudinal fields such as solenoid fields are important. Such fields 
will be discussed separately in Sect. 6.6 and therefore we set here for simplicity 


Ago = 0. (6.66) 
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The terms linear in x or y are the curvatures in the horizontal and vertical plane as 
defined previously 


Ajo = —Ky and = Agi = ky, (6.67) 
and 
=-x"= +42 i e -1 
ns <n 2a (6.68) 
k= -y"= — 5 Bx with . rl =a 


Finally, the quadratic terms proportional to x and y are identical to the quadrupole 
strength parameters 


An =—k, Ay =k, Ao =k. (6.69) 


With these definitions of the linear coefficients, we may start exploiting the 
recursion formula. All terms on the right-hand side of (6.65) are of lower order than 
the two terms on the left-hand side which are of order n = p + q + 2. The left-hand 
side is composed of two contributions, one resulting from pure multipole fields of 
order n and the other from higher-order field terms of lower-order multipoles. 

In (6.65) we identify and separate from all other terms the pure multipole terms 
of order n which do not depend on lower-order multipole terms like kinematic terms 
by setting 


Ap+2,q,n + Apg+2,n =0 forp+q+2<n (6.70) 


and adding the index v to indicate that these terms are the pure nth-order multipoles. 
Only the sum of two terms can be determined which means both terms have the 
same value but opposite signs. For n = 3 we have, for example, A379 = —Aj2 or 
Az, = —Ao3 and a comparison with the potentials of pure multipoles of Table 6.5 
shows that A39 = —m_and Az; = m. Similar correlations can be formulated for all 
higher order multipole.” Analogous to dipoles and quadrupole magnets, we may get 
potential expressions for all other multipole magnets. The results up to fifth order 
are compiled in Table 6.6. 

Each expression for the magnetic potential is composed of both the real and the 
imaginary contribution. Since both components differ only by a rotational angle, real 
magnets are generally aligned such that only one or the other component appears. 
Only due to alignment errors may the other component appear as a field error which 
can be treated as a perturbation. 


Consistent with the definitions of magnet strengths, the underlined quantities represent the magnet 
strengths of rotated multipole magnets. 
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Table 6.5 Correspondence between the potential coefficients and multipole strength parameters 


Aoo 
Ajo Aol 
A20 Au Ao2 
A30 Ad Aj2 Ao3 
Ago A31 Ay Aj3 Aoa 
Aso Agi A32 Ao3 Ais Aos 
t 
0 
—Ky Ky 
—k k k 
—m m m —m 
ih r r r = 6 
—d d d —d —d d 
Table 6.6 Magnetic multipole potentials 
Dipole _ oa Vi = kx + ky 
Quadrupole £VW, = $k (x? — y*) + ky, 
Sextupole = V3 = im (<3 — 3xy?) + im (3x°y — y9), 
Octupole V4 = ur (x* 6x7? + y’) + ur (Fy -_ xy) ; 
Decapole V5 = aod (x° 10x37 4 5xy*) t ad (Sx4y 10x? y° 4 y>) 


6.4.2 Kinematic Terms 


Having identified the pure multipole components, we concentrate now on using the 
recursion formula for other terms which so far have been neglected. First, we note 
that coefficients of the same order n on the left-hand side of (6.65) must be split 
into two parts to distinguish pure multipole components Aj, of order n from the 
nth-order terms Ai of lower-order multipoles which we label by an asterisk *. Since 
we have already derived the pure multipole terms, we explore (6.65) for the A* 
coefficients only 


AD oe + A5+24 = rh.s. of (6.65). (6.71) 
For the predetermined coefficients Ajo, Ao; and A; there are no corresponding 
terms A* since that would require indices p and q to be negative. For p = 0 and 


gq = 0 we have 


Ago + Ajo = —KorA10 — KoyAo1 = 0. (6.72) 
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This solution is equivalent to (6.70) and does not produce any new field terms. 
The next higher-order terms for p = 0 and gq = 1 or for p = 1 andqg = O are 
determined by the equations 


A*, +A*, = —k, @— Kyg—K! = C, 

03 + 21 oxS ~ Koy8 Ky ; (6.73) 

Aj) + Aggy = —Koy8 + Korg + ky = D, 

where we set in preparation for the following discussion the right-hand sides equal 

to the as yet undetermined quantities C and D. Since we have no lead how to separate 
the coefficients we set 


a a 29D = = (1 —)D. on 


where 0 < (f,g) < 1 andf = g. The indeterminate nature of this result is an 
indication that these terms may depend on the actual design of the magnets. 

Trying to interpret the physical meaning of these terms, we assume a magnet 
with a pure vertical dipole field in the center of the magnet, B,(0,0,0) ¢ 0, but no 
horizontal or finite longitudinal field components, B,(0, 0,0) = 0 and B,(0, 0,0) = 
0. Consistent with these assumptions the magnetic potential is 


ec * * 

BEY OD = Anny — 5AR ey — SADA (6.75) 
— gAjox’ — gAosy” + O(4). 

From (6.73) we get D = 0, C = —B’ and with (6.74) A}, = A}, = 0. The 

magnetic-field potential reduces therefore to 


ec 
BEY y.2) = kay + Hf? y + fly’ (6.76) 
and the magnetic-field components are 


peBx = —fKl'xy, 


77 
BeBy = +Kx— shal? — 51 —f)i’y?. one 


The physical origin of these terms becomes apparent if we investigate the 
two extreme cases for which f = 0 or f = I separately. For f = 0 


the magnetic fields in these cases are (68x = 0, aE By =k, — 341”) and 
ec 


(452: = —K" xy, BEBy = Kx — Kix?) for f = 1. Both cases differ only in the 
«terms describing the magnet fringe field. In the case of a straight bending 


magnet (B, 4 0) with infinitely wide poles in the x-direction, the horizontal field 
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side view frontal view 


Fig. 6.16 Dipole end field configuration for f = 0 


side view frontal view 


Fig. 6.17 Dipole end field configuration for 0 < f < | 


component B, must vanish consistent with f = 0. The field configuration in the 
fringe field region is of the form shown in Fig. 6.16 and independent of x. 

Conversely, the case 0 < f < | describes the field pattern in the fringe field 
of a bending magnet with poles of finite width in which case finite horizontal field 
components B, appear off the symmetry planes. The fringe fields not only bulge 
out of the magnet gap along z but also spread horizontally due to the finite pole 
width as shown in Fig. 6.17, thus creating a finite horizontal field component off the 
midplane. While it is possible to identify the origin of these field terms, we are not 
able to determine the exact value of the factor f in a general way but may apply 
three-dimensional magnet codes to determine the field configuration numerically. 
The factor f is different for each type of magnet depending on its actual mechanical 
dimensions. 

Following general practice in beam dynamics and magnet design, however, we 
ignore these effects of finite pole width, since they are specifically kept small by 
design, and we may set f = g = 0. In this approximation we get 


A3) = i =0 (6.78) 
and 


Aj3 = —K,K—-Kyk — KX, 
Snes ean a, 6.79 

Ady = —Kyk + kk + KN, we 
Similar effects of finite pole sizes appear for all multipole terms. As before, we 
set f = 0 for lack of accurate knowledge of the actual magnet design and assume 
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that these terms are very small by virtue of a careful magnet design within the 
good field region provided for the particle beam. For the fourth-order terms we have 


therefore with A>, = 0 and 


Ajp = Kxim — Kym — 4kykyk + 42k + KU! + 2k yk + 2K, (6.80) 
Aja = Kym — Kym — Ak,Kyk — 42k — Kl — yal! — 2K! !. 


In the case p = q, we expect Aj, = Aj; from symmetry and get 


2AT, = 2A3, = —Kym — Kym + 22k + 2k — Ke 
F 2k yk) — Qe — Kk + Kye. (6.81) 
With these terms we have finally determined all coefficients of the magnetic 
potential up to fourth order. Higher-order terms can be derived along similar 


arguments. Using these results, the general magnetic-field potential up to fourth 
order is from (6.63) 


é 


BEV 2) = +Ajox + Aory (6.82) 


+ 4A29x7 + SAooy? + Atixy 
+ ZA30x° + SAri?y + FApxy’ + ZAosy" 
+ tA3ox + A53)" (6.83) 
+ sqA4ox" + tAaxy + Anxy + ZAi3xy" 
+ Ht Aoay* + HAlox* + FAS ay? + y°) + HAQyy* + O(5). 
From the magnetic potential we obtain the magnetic field expansion by differen- 


tiation with respect to x or y for B, and B,, respectively. Up to third order we obtain 
the transverse field components in energy independent formulation 


ape = iy — ke + ky (6.84) 
- tmx —y’) + mxy + 5 (—Kyk + kk + ko) 
— gr(x° — 3xy’) — ér(y’ — 32’) 
— tlk + Kym + 27k + Wok + RY — KyKy 
+ ky + Ke? — Ky) (3x°y + y%) 
ar e (Kum — Kym — Ak gkyk + 442k 


+ RY + kyl + 2K) + O(4) 
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and 


ec 


BED) = beth + be (6.85) 


+ bm? =) + may — Heeb gk + ey 
+ ir x? — 3xy’) — ary” = 349) 

— p(k + kya + ZK + Wok + KY — Ky 
+ Kyky Hi? — K2)Q? + 3xy’) 

+ (ky — Kym — Ak ky k — 44erk 


" ry 3 
—K" — 2kyk, — 2K, )Y + O(4), 
where m = 583 andr = ps4 The third component of the gradient in a curvilinear 


coordinate system is B, = — 7 av and collecting all terms up to second order we get 


ape = +KY KKH (Kyk) = yk, + xy 


+ (kek — 5K)? = (ky, — SKY + OB). (6.86) 


Upon closer inspection of (6.84)—(6.86) it becomes apparent that most terms 
originate from a combination of different multipoles. These equations describe 
the general fields in any magnet, yet in practice, special care is taken to limit the 
number of fundamentally different field components present in any one magnet. In 
fact most magnet are designed as single multipoles like dipoles or quadrupoles or 
sextupoles etc. A beam transport system utilizing only such magnets is also called 
a separated-function lattice since bending and focusing is performed in different 
types of magnets. A combination of bending and focusing, however, is being used 
for some special applications and a transport system composed of such combined- 
field magnets is called a combined-function lattice. Sometimes even a sextupole 
term is incorporated in a magnet together with the dipole and quadrupole fields. 
Rotated magnets, like rotated sextupoles s3 and octupoles s4 are either not used or 
in the case of a rotated quadrupole the chosen strength is generally weak and its 
effect on the beam dynamics is treated by perturbation methods. 

No mention has been made about electric field patterns. However, since the 
Laplace equation for electrostatic fields in material free areas is the same as for 
magnetic fields we conclude that the electrical potentials are expressed by (6.82) 
as well and the electrical multipole field components are also given by (6.84)— 
(6.86) after replacing the magnetic field (B,, By, B,) by electric-field components 
(Ex, Ey, Ez). 
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Fig. 6.18 Frenet-Serret 
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6.5 Third-Order Differential Equation of Motion* 


Equations of motions have been derived in Chap. 5 for the transverse (x, z) and 
(y, z) planes up to second order which is adequate for most applications. Sometimes, 
however, it might be desirable to use equations of motion in higher order of 
precision or to investigate perturbations of higher order. A curvilinear Frenet-Serret 
coordinate system moving along the curved trajectory of the reference particle ro (z), 
was used and we generalize this system to include curvatures in both transverse 
planes as shown in Fig. 6.18. 

In this (x, y, z)-coordinate system, a particle at the location s and under the 
influence of a Lorentz force follows a path described by the vector r as shown in 
Fig. 6.18. The change in the momentum vector per unit time is due only to a change 
in the direction of the momentum while the magnitude of the momentum remains 
unchanged in a static magnetic fields. Therefore p = pdr/ds where p is the value 
of the particle momentum and dr/ds is the unit vector along the particle trajectory. 
With 2 = # Be, where tT = Be? the particle velocity v; = _ = ¢ Bc, and we 
obtain the differential equation describing the particle trajectory under the influence 


of a Lorentz force F,. From # = F, = e[v, x B] we get 


d’r ec [dr on (6.87) 
ds? BE | ds 


and to evaluate (6.87) further, we note that 


dr dr/dz_r (6.88) 
ds ds/dz ss’ 


and 


dr id/(/r 
= _— (5) . (6.89) 


ds?! dz 
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With this, the general equation of motion is from (6.87) 


ar 1 drds? ec _,[dr 
= = Ss xB 
dz? 2s? dz dz BE |dz 


In the remainder of this section, we will re-evaluate this equation in terms of more 
simplified parameters. From Fig. 6.18 or (4.21) we have r = rp + xu, + ylly , where 
the vectors u,, “, and u, are the unit vectors defining the curvilinear coordinate 
system. To completely evaluate (6.89), the second derivative d*r/dz? must be 
derived from (4.23) with du, = —K,u,dz — kyuydz and h = 1 + kx + kyy for 


ar 


= (x" — Kyh)uy + (" — Kyh)ily + (2k x! + ky’ + Kx t+ Ky)uz, (6.90) 


and (6.89) becomes with (4.23) and s/* = r’? 


x! d si? y’ d g2 
IN Wy 
(x —K,h— 52 rs ) uy + (> —kKyh— 552 a ) ly 


+ (210! + 2en! tet y- SE w = 3° |Z x8 (6.91) 
x y. x y: 2 dz Zz B E dz . : 

Here the quantities «, and x, are the curvatures defining the ideal particle 
trajectory or the curvilinear coordinate system. This is the general equation of 
motion for a charged particles in a magnetic field B. So far no approximations have 
been made. For practical use we may separate the individual components and get 
the differential equations for transverse motion 


1 x’ ds? e 


I) c 4 a 
» ie th 2 "dz = BE ly B, = hBy), (6.92a) 
1 y’ ds? ec 
"eh — —— — = —s'[hB, — x’B.]. 6.92b 
y Ky 2 sf dz BE | J x 2] ( ) 


Chromatic effectsoriginate from the momentum factor BE which is different for 


particles of different energies. We expand this factor into a power series in 6 


ec e 
= (1640 =0 4.4), (6.93) 
BE po 

where 6 = Ap/po and cpo = fEbo is the ideal particle momentum. A further 

approximation is made when we expand s’ to third order in x and y while restricting 

the discussion to paraxial rays with < landy <1 


SReht 5? +y?)\L— kx — Ky) +... (6.94) 
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Evaluating the derivative ds’*/dz? second-order derivatives x” and y’ appear 
which, neglecting fourth-order terms, can here be replaced by the unperturbed 
equations of motion x" +(k2+k)x = 0 and y”+(k?+k)y = 0. For the magnetic field 
components, we insert in (6.92a), (6.92b) expressions (6.84)-(6.86) while making 
use of (6.93) and (6.94). Keeping all terms up to third order in x, y,x’,y’ and 6, 
we finally obtain equations of motion for a particle with charge e in an arbitrary 
magnetic field derivable from a scalar potential. For the horizontal and vertical plane 
the general equations of motion in a curvilinear coordinate system and including 
chromatic terms up to third-order in (x, y, 5) are (6.95) and (6.96), respectively. 


x" + (2 + Wx = Kb — 8" + 8? — (K+ keeky)y (6.95) 
= (m+ kek + Qkkyk + 2«2ky)xy — Sx —y’) 
— («2 + 2K,k)x? — (Krk, — hk k + Zkyk— 5K) Y? 
+5K(x —y") + KL Qa! + yy) +h ('y — xy) + gy 
—trx( — 3y’) + Ery(" — 3x’) 

” 


+5 (Kym — Mm + 2 pk — 1O0KzK +k! — Kyke, 


+KxK! — ee Kn)? 
—(2kym + kym + Kk + 2k Kyk) xy 
+4(Skm — Tkym + 6K2k +k" — Kyky — 2K yk 
+5K Ky + Kl? - ae — KKyk) xy" 
+2 (10K ek yk + 8k Ky + Kym + 445k +k" + 2khK) + Sky) y° 
2 


—(2K? + 3k) xx? — (KiKy + KyK}) XX Y = Kk eX 


tk x?yl — Kyk xy" = Kekyx'y! — 3 (k + 3k ky) x7y 

+k’ xyy! — 5 (k + «Z)xy? — (2K) — Kx’yy! + 3kyy! — Skyy? 
+ (2K; + k)x8 + (2kxky + Kyd — Klyy’d + Kjxy’d 

thie x” + ys + (3kyk + Kxky = 5 Krk = 5K = sm)y 6 
+(4m + kek + K3)x78 + (m+ 2k2ky + Wyk + 2 k)xyvd 
—(k + 2K2)x8? — (k + 2kxky)yd? + O(4). 


y" + (ko — By = +5 — Ky? + 8? — (KF kky)x (6.96) 


+ (m — 2kyk + 2Kk — 2k xk, XY - 5 m(x* — y’) 
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- (k; — 2kyk)y* — (keky + Zkyk + kk — 5Ky ew 
— 5y(x? — y?) + Ky (xx + yy’) — KLO’y — xy’) + kyr’y’ 
— gry" — 327) — grxQ? — 3y’) 
— tlm — Vkym + 2«Zk — LOK K +R! = Ky 

+ ky FKP KP)? 
+ Qkym + Kym — ick + 2kKyk) xy’ 
— 4 (Sky — Tym + 65k +R + kk! — 22k 

—Sityky — Ke +? + kgkyk) xy 
+4 (-L0kkyk + 8k xk, —Kym+ ARK + kK + 2K, — 5kym) x° 
— (25 — 5k — Wyy? — (kek), + Ky ayy’ — kykhy?y! 

<A xy? = kek yl — Kxkyx’ yy! — £ (ke + 3k xy) xy? 

—k'xx'y + $(k — Ke )x?y — (2x2 + k)xxy +} $k'x i = Lexx? 
+(2K; — ky + (2k ky + k)x8b — Kix'5 + Kix yd 
thay (x? ee (Bick + Keky + dkyk — 3K, + }m)x°5 
+(- sm — 2kyk + K)y"6 —(m— 2k Ky + 2k,k — 2Kyk)xyd 
+(k — 2k) y8? — (k + 2kyky)x8? + O(4). 


In spite of our attempt to derive a general and accurate equation of motion, 
we note that some magnet boundaries are not correctly represented. The natural 
bending magnet is of the sector type and wedge or rectangular magnets require 
the introduction of additional corrections to the equations of motion which are not 
included here. This is also true for cases where a beam passes off center through a 
quadrupole, in which case theory assumes a combined function sector magnet and 
corrections must be applied to model correctly a quadrupole with parallel pole faces. 
The magnitude of such corrections is, however, in most cases very small. Equation 
(6.95) shows an enormous complexity which in real beam transport lines, becomes 
very much relaxed due to proper design and careful alignment of the magnets. 
Nonetheless (6.95) and (6.96) for the vertical plane, can be used as a reference 
to find and study the effects of particular perturbation terms. In a special beam 
transport line one or the other of these perturbation terms may become significant 
and can now be dealt with separately. This may be the case where strong multipole 
effects from magnet fringe fields cannot be avoided or because large beam sizes 
and divergences are important and necessary. The possible significance of any 
perturbation term must be evaluated for each beam transport system separately. 
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In most beam transport lines the magnets are built in such a way that different 
functions like bending, focusing etc., are not combined thus eliminating all terms 
that depend on those combinations like «,k,, Kk or mk, etc. As long as the terms 
on the right-hand sides are small we may apply perturbation methods to estimate 
the effects on the beam caused by these terms. It is interesting, however, to try to 
identify the perturbations with aberrations known from light optics. 

Chromatic terms x,(6 — b+ 3), for example, are constant perturbations for off 
momentum particles causing a shift of the equilibrium orbit which ideally is the 
trivial solution x = 0 of the differential equation x” + (k+ K )x = 0. Of course, this 
is not quite true since «, is not a constant but the general conclusion is still correct. 
This shift is equal to Ax = k,(6 — 5° + 6°)/(k + 2) and is related to the dispersion 
function D by D = Ax/6. In light optics this corresponds to the dispersion of colors 
of a beam of white light (particle beam with finite energy spread) passing through 
a prism (bending magnet). We may also use a different interpretation for this term. 
Instead of a particle with an energy deviation 6 in an ideal magnet «, we can interpret 
this term as the perturbation of a particle with the ideal energy by a magnetic field 
that deviates from the ideal value. In this case, we replace x, (6 — 6* — 6°) by — Ak, 
and the shift in the ideal orbit is then called an orbit distortion. Obviously, here and 
in the following paragraphs the interpretations are not limited to the horizontal plane 
alone but apply also to the vertical plane caused by similar perturbations. Terms 
proportional to x* cause geometric aberrations, where the focal length depends on 
the amplitude x while terms involving x’ lead to the well-known phenomenon of 
astigmatism or a combination of both aberrations. Additional terms depend on the 
particle parameters in both the vertical and horizontal plane and therefore lead to 
more complicated aberrations and coupling. 

Terms depending also on the energy deviation 6, on the other hand, give rise 
to chromatic aberrations which are well known from light optics. Specifically, 
the term (k + 2«)xé is the source for the dependence of the focal length on the 
particle momentum. Some additional terms can be interpreted as combinations of 
aberrations described above. 

It is interesting to write down the equations of motion for a pure quadrupole 
system where only k 4 0 in which case (6.95) becomes 


x! + kx = kx(8 — 8 — 8°) (6.97) 
— Dk x(x? + 3y’) — 3k xx? + kx'yy! + k’xyy’ + O(4). 


We note that quadrupoles produce only second order chromatic aberrations and 
geometric perturbations only in third order. 


6.6 Longitudinal Field Devices 165 
6.6 Longitudinal Field Devices 


General field equations have been derived in this chapter with the only restriction 
that there be no solenoid fields, which allowed us to set Ag9 = O in (6.66), and 
concentrate on transverse fields only. Longitudinal fields like those produced in a 
solenoid magnet are used in beam transport systems for very special purposes and 
their effect on beam dynamics cannot be ignored. We assume now that the lowest- 
order coefficient Ago in the potential (6.63) does not vanish 


Aoo(z) # 0. (6.98) 


Longitudinal fields do not cause transverse beam deflection although there can 
be some amplitude dependent focusing or coupling. We may therefore choose a 
cartesian coordinate system along such fields by setting «. = kK, = O, and the 
recursion formula (6.65) reduces to 


Ag2 + A29 = —Ajo - (6.99) 
Again, we have a solution where Ago + Azo = 0, which is a rotated quadrupole 
as derived in (6.25) and can be ignored here. The additional component of the field 
is Aj, + Aj) = —Ajp and describes the endfields of the solenoid. For reasons of 

symmetry with respect to x and y we have Aj, = A}, and 
Ajo = Ady = —5AGo- (6.100) 

With this, the potential (6.63) for longitudinal fields is 

— V(t, ¥52) = Aoo — GAQo@ + ¥°) = Aoo — ZAG?” (6.101) 


where we have made use of rotational symmetry. The longitudinal field component 
becomes from (6.101) in linear approximation 


B, = +A (6.102) 
and the transverse components 
B= —SAgor = —SBir, (6.103) 
By= 0. 


The azimuthal field component obviously vanishes because of symmetry. Radial 
field components appear whenever the longitudinal field strength varies as is the 
case in the fringe field region at the end of a solenoid shown in Fig. 6.19. 

The strength Bo in the center of a long solenoid magnet can be calculated in the 
same way we determined dipole and higher-order fields utilizing Stokes’ theorem. 
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Fig. 6.19 Solenoid field integration path ’ \ 
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The integral ¢ Bdz is performed along a path as indicated in Fig. 6.19. The only 
contribution to the integral comes from the integral along the field at the magnet 
axis. All other contributions vanish because the integration path cuts field lines at 
a right angles, where Bdz = 0 or follows field lines to infinity where B, = 0. We 
have therefore 


fz z= BoAz = pop Az, (6.104) 


where J is the total solenoid current per unit length. The solenoid field strength is 
therefore given by 


By (x = 0, y = 0) = poptyd. (6.105) 


The total integrated radial field { B,dz can be evaluated from the central field 
for each of the fringe regions. We imagine a cylinder concentric with the solenoid 
axis and with radius r to extend from the solenoid center to a region well outside 
the solenoid. In the center of the solenoid a total magnetic flux of 27r7Bo enters this 
cylinder. It is clear that along the infinitely long cylinder the flux will exit the surface 
of the cylinder through radial field components. We have therefore 


CO 
nPBy = | 2rB,(r)dz, (6.106) 
0 


where we have set z = 0 at the center of the solenoid. The integrated radial field per 
fringe field is then 


lo, e) 
iy B,(r)dz = —4Bor. (6.107) 
0 


The linear dependence of the integrated radial fields on the distance r from 
the axis constitutes linear focusing capabilities of solenoidal fringe fields. Such 
solenoid focusing is used, for example, around a conversion target to catch a highly 
divergent positron beam. The positron source is generally a small piece of a heavy 
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metal like tungsten placed in the path of a high energy electron beam. Through 
an electromagnetic cascade, positrons are generated and emerge from a point like 
source into a large solid angle. If the target is placed in the center of a solenoid 
the radial positron motion couples with the longitudinal field to transfer the radial 
particle momentum into azimuthal momentum. At the end of the solenoid, the 
azimuthal motion couples with the radial field components of the fringe field to 
transfer azimuthal momentum into longitudinal momentum. In this idealized picture 
a divergent positron beam emerging from a small source area is transformed or 
focused into a quasi-parallel beam of larger cross section. Such a focusing device is 
called a A /4-lens, since the particles follow one quarter of a helical trajectory in the 
solenoid. 

In other applications large volume solenoids are used as part of elementary 
particles detectors in high energy physics experiments performed at colliding-beam 
facilities. The strong influence of these solenoidal detector fields on beam dynamics 
in a storage ring must be compensated in most cases. In still other applications 
solenoid fields are used just to contain a particle beam within a small circular 
aperture like that along the axis of a linear accelerator. 


6.7 Periodic Wiggler Magnets 


Particular arrays or combinations of magnets can produce desirable results for a 
variety of applications. A specially useful device of this sort is a wiggler magnet 
[7] which is composed of a series of short bending magnets with alternating field 
excitation. Most wiggler magnets are used as sources of high brightness photon 
beams in synchrotron radiation facilities and are often also called undulators. There 
is no fundamental difference between both. We differentiate between a strong field 
wiggler magnet and an undulator, which is merely a wiggler magnet at low fields, 
because of the different synchrotron radiation characteristics. As long as we talk 
about magnet characteristics in this text, we make no distinction between both types 
of magnets. Wiggler magnets are used for a variety of applications to either produce 
coherent or incoherent photon beams in electron accelerators, or to manipulate 
electron beam properties like beam emittance and energy spread. To compensate 
anti-damping in a combined function synchrotron a wiggler magnet including a 
field gradient has been used for the first time to modify the damping partition 
numbers [8]. In colliding-beam storage rings wiggler magnets are used to increase 
the beam emittance for maximum luminosity [9]. In other applications, a very 
small beam emittance is desired as is the case in damping rings for linear colliders 
or synchrotron radiation sources which can be achieved by employing damping 
wiggler magnets in a different way [10]. 

Wiggler magnets are generally designed as flat magnets as shown in Fig. 6.20 
[7] with field components only in one plane or as helical wiggler magnets [11- 
13] where the transverse field component rotates along the magnetic axis. In this 
discussion, we concentrate on flat wigglers which are used in growing numbers to 
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Fig. 6.20 Permanent magnet 
wiggler showing the 
magnetization direction of 
individual blocks (schematic) 
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path 
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generate, for example, intense beams of synchrotron radiation from electron beams, 
to manipulate beam parameters or to pump a free electron laser. 


6.7.1 Wiggler Field Configuration 


Whatever the application may be, the wiggler magnet deflects the electron beam 
transversely in an alternating fashion without introducing a net deflection on the 
beam. Wiggler magnets are generally considered to be insertion devices installed in 
a magnet free straight section of the lattice and not being part of the basic magnet 
lattice. To minimize the effect of wiggler fields on the particle beam, the integrated 
magnetic field through the whole wiggler magnet must be zero 


/ Bidz=0. (6.108) 


wiggler 


Since a wiggler magnet is a straight device, we use a fixed cartesian coordinate 
system (x,y,z) with the z-axis parallel to the wiggler axis to describe the wiggler 
field, rather than a curvilinear system that would follow the oscillatory deflection 
of the reference path in the wiggler. The origin of the coordinate system is placed 
in the middle of one of the wiggler magnets. The whole magnet may be composed 
of N equal and symmetric pole pieces placed along the z-axis at a distance Ap/2 
from pole center to pole center as shown in Fig. 6.21. Each pair of adjacent wiggler 
poles forms one wiggler period with a period length A, and the whole magnet is 
composed of N/2 periods. Since all periods are assumed to be the same and the 
beam deflection is compensated within each period no net beam deflection occurs 
for the complete magnet. 

Upon closer inspection of the precise beam trajectory we observe a lateral 
displacement of the beam within a wiggler magnet. To compensate this lateral beam 
displacement, the wiggler magnet should begin and end with only a half pole of 
length A,/4 to allow the beams to enter and exit the wiggler magnet parallel with 
the unperturbed beam path. 

The individual magnets comprising a wiggler magnet are in general very short 
and the longitudinal field distribution differs considerable from a hard-edge model. 
In fact most of the field will be fringe fields. We consider only periodic fields which 
can be expanded into a Fourier series along the axis including a strong fundamental 
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Fig. 6.21 Field distribution 
in a wiggler magnet 
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component with a period length A, and higher harmonics expressed by the ansatz 
[14] 


By = Bo > bon4i(x,y) cos[(2n + 1kpz] , (6.109) 


n>=0 


where the wave number ky = 277/Ay. The functions b;(x, y) describe the variation 
of the field amplitude orthogonal to the beam axis for the harmonic 7. The content 
of higher harmonics is greatly influenced by the particular design of the wiggler 
magnet and the ratio of the period length to the pole gap aperture. For very 
long periods relative to the pole aperture the field profile approaches that of a 
hard-edge dipole field with a square field profile along the z-axis. For very short 
periods compared to the pole aperture, on the other hand, we find only a significant 
amplitude for the fundamental period and very small perturbations due to higher 
harmonics. 

We may derive the full magnetic field from Maxwell’s equations based on a 
sinusoidal field along the axis. Each field harmonic may be determined separately 
due to the linear superposition of fields. To eliminate a dependence of the magnetic 
field on the horizontal variable x, we assume a pole width which is large compared 
to the pole aperture. The fundamental field component is then 


By(y,z) = Bobi (y) cos kpz. (6.110) 
Maxwell’s curl equation is in the wiggler aperture V x B = 0, we. = os and with 
(6.110) we have : 
OB, OB, . 
= — = —Bob (y)kp sin kpz. (6.111) 


ay az 
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Integration of (6.111) with respect to z gives the vertical field component 
B, = —Bokpbi(y) / “sin kpZ dz. (6.112) 
0 


We have not yet determined the y-dependence of the amplitude function b; (y). From 
VB = 0 and the independence of the field on the horizontal position we get with 
(6.110) 


Og 2a. PE. ng OP) oe kpz.- (6.113) 


Forming the second derivatives 07 B./(dy dz) from (6.111), (6.113) we get for the 
amplitude function the differential equation 


07bi(y) 
dy? 


=Kb(y), (6.114) 


which can be solved by the hyperbolic functions 
bi(y) = acoshkpy + bsinhkpy . (6.115) 


Since the magnetic field is symmetric with respect to y = 0 and b\(0) = 1, the 
coefficients are a = | and b = 0. Collecting all partial results, the wiggler magnetic 
field is finally determined by the components 


By =0 ’ 
By, = Bocoshkpy coskpz, (6.116) 
B, = —Bosinhkpy sinkpyz, 


where B, is obtained by integration of (6.111) with respect to y. 

The hyperbolic dependence of the field amplitude on the vertical position intro- 
duces higher-order field-errors which we determine by expanding the hyperbolic 
functions 


= (ky) (kpy)* (kpy)® (kpy)8 
coshkpy = 1+ T + 4 + 6! + I at nae 5 (6.117) 


(kpy)? (kpy)? (kpy)! 
3! 5! 7! 


sinh kpy = +(kpy) + (6.118) 
Typically the vertical gap in a wiggler magnet is much smaller than the period 
length or y < A, to avoid drastic reduction of the field strength. Due to the 
fast convergence of the series expansions (6.117) only a few terms are required 
to obtain an accurate expression for the hyperbolic function within the wiggler 
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aperture. The expansion (6.117) displays higher-order field components explicitly 
which, however, do not have the form of higher-order multipole fields and we cannot 
treat these fields just like any other multipole perturbation but must consider them 
separately. 

To determine the path distortion due to wiggler fields, we follow the reference 
trajectory through one quarter period starting at a symmetry plane in the middle of a 
pole. At the starting point z = 0 in the middle of a wiggler pole the beam direction 
is parallel to the reference trajectory and the deflection angle at a downstream point 
zis given by 


3(2) = < ['s, (2) az = = By cosh py [ cosky2 dz (6.119) 
PJo — P 0 
el . 
= rae cosh kpy sin kpz. 
Pp 


The maximum deflection angle is equal to the deflection angle for a quarter period 
or half a wiggler pole and is from (6.119) for y = 0 and kpz = 1/2 


P) 
i= oe (6.120) 


This deflection angle is used to define the wiggler strength parameter 


ce 
amc” 


K = py0= ; Body. (6.121) 


where mc” is the particle rest energy and y the particle energy in units of the rest 
energy. In more practical units this strength parameter is 


K = CxBo (T) Ap (cm) © By (T) Ap (cm) , (6.122) 
where 
Cy = —“ ~~ =0.93373 Tem— . 
27 mc2 


The parameter K is a characteristic wiggler constant defining the wiggler strength 
and is not to be confused with the general focusing strength K = x* + k. Coming 
back to the distinction between wiggler and undulator magnet, we speak of a wiggler 
magnet if K >> 1 and of an undulator if K < 1. Of course, many applications 
happen in a gray zone of terminology when K ~ 1. 
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6.8 Electrostatic Quadrupole 


A different focusing device based on electrostatic fields can be designed very much 
along the strategy for a magnetic quadrupole. We pick the first term on the r.h.s. 
of (6.25) and modify the expression to reflect the beam rigidity (6.12) for electric 
fields 


Vo(x, y) = —RpBA205 (0° — y*) = -g50° —y”), (6.123) 


where the field gradient, g = dE/dx . Such a device can be constructed by placing 
metallic surfaces in the form of a hyperbola 


x° —y’ = +R = const. (6.124) 


where R is the aperture radius of the device as shown in Fig. 6.22 (left) 

The potential of the four electrodes is alternately V = + 5 gR*. This design can 
be somewhat simplified by replacing the hyperbolic metal surfaces by equivalently 
sized metallic tubes as shown in Fig. 6.22 (right). Numerical computer simulation 
programs can be used to determine the degradation of the quadrupole field due to 
this simplification. 


Fig. 6.22 Electric field quadrupole, ideal pole profile (/eft), and an example of a practical approach 
with cylindrical metallic tubes (right) 
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Problems 


6.1 (S). Show that the electrical power in the excitation coil is independent of the 
number of turns. Show also that the total electrical power in a copper coil depends 
only on the total weight of the copper used and the current density. 


6.2 (S). Design an electrostatic quadrupole which provides a focal length of 10m 
in the horizontal plane for particles with a kinetic energy of 10 MeV. The device 
shall have an aperture with a diameter of 10cm and an effective length of 0.1 m. 
What is the form of the electrodes, their orientation and potential? 


6.3 (S). In the text, we have derived the fields from a scalar potential. We could 
also derive the magnetic fields from a vector potential A through the differentiation 
B= V xA. For purely transverse magnetic fields, show that only the longitudinal 
component A, ~ O must be non zero. Derive the vector potential for a dipole 
and quadrupole field and compare with the scalar potential. What is the difference 
between the scalar potential and the vector potential? 


6.4 (S). Derive the pole profile (aperture radius r = | cm) for a combined function 
magnet including a dipole field to produce for a particle beam of energy E = 
50GeV a bending radius of p = 300m, a focusing strength k = 0.45m” anda 
sextupole strength of m = 23.0m-°. 


6.5 (S). Strong mechanical forces exist between the magnetic poles when a magnet 
is energized. Are these forces attracting or repelling the poles? Why? Consider a 
dipole magnet £ =1 m long, a pole width w = 0.2m and a field of B = 1.5T. 
Estimate the total force between the two magnet poles ? 


6.6 (S). Following the derivation of (5.7) for a bending magnet, derive a similar 
expression for the electrical excitation current in A-turns of a quadrupole with an 
aperture radius R and a desired field gradient g. What is the total excitation current 
necessary in a quadrupole with an effective length of £ =1m and R = 3cm to 
produce a focal length of f = 50 m for particles with an energy of cp = 500 GeV? 


6.7 (S). Consider a coil in the aperture of a magnet as shown in Fig. 6.14. All n 
windings are made of very thin wires and are located exactly on the radius R. We 
rotate now the coil about its axis at a rotation frequency v. Such rotating coils are 
used to measure the multipole field components in a magnet. Show analytically that 
the recorded signal is composed of harmonics of the rotation frequency v. What is 
the origin of the harmonics? 


6.8 (S). Explain why a quadrupole with finite pole width does not produce a pure 
quadrupole field. What are the other allowed multipole field components ignore 
mechanical tolerances and why? 


6.9 (S). Through magnetic measurements the following vertical magnetic multi- 
pole field components in a quadrupole are determined. At x = 1.79cm and y = 0 
cm: By = 0.3729 T, B3 = 1.25 x 1074 T, By = 0.23 x 10+ T, Bs = 0.36 x 1074 T, 
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Be = 0.726x10~4 T, By = 0.020x10~4 T, Bg = 0.023 10~* T, By = 0.0051x10~* 
T, Bio = 0.0071 x 10~* T. Calculate the relative multipole strengths atx = 1cm 
normalized to the quadrupole field at 1cm. Why do the 12-pole and 20-pole 
components stand out with respect to the other multipole components? 


6.10 (S). Derive the equation for the pole profile of an iron dominated upright 
octupole with a bore radius R. Ignore longitudinal variations. To produce a field 
of 0.2 T at the pole tip (R = 3cm) what total current per coil is required? 


6.11 (S). Calculate and design the current distribution for a pure air coil, super- 
conducting dipole magnet to produce a field of By) = 5T in an aperture of radius 
R = 3cm without exceeding an average current density of 7 = 1,000 A/mm”. 


6.12. Derive an expression for the current distribution in air coils to produce a 
combination of a dipole, quadrupole and sextupole field. Express the currents in 
terms of fields and field gradients. 
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Part III 
Beam Dynamics 


Chapter 7 
Single Particle Dynamics 


The general equations of motion, characterized by an abundance of perturbation 
terms on the right-hand side of, for example, (6.95), (6.96) have been derived in the 
previous chapter. If these perturbation terms were allowed to become significant in 
real beam transport systems, we would face almost insurmountable mathematical 
problems trying to describe the motion of charged particles in a general way. 
For practical mathematical reasons it is therefore important to design components 
for particle beam transport systems such that undesired terms appear only as 
small perturbations. With a careful design of beam guidance magnets and accurate 
alignment of these magnets we can indeed achieve this goal. 

Most of the perturbation terms are valid solutions of the Laplace equation 
describing higher order fields components. Virtually all these terms can be mini- 
mized to the level of perturbations by proper design of beam transport magnets. 
Specifically, we will see that the basic goals of beam dynamics can be achieved 
by using only two types of magnets, bending magnets and quadrupoles which 
sometimes are combined into one magnet. Beam transport systems, based on only 
these two lowest order magnet types, are called linear systems and the resulting 
theory of particle dynamics in the presence of only such magnets is referred to as 
linear beam dynamics or linear beam optics. 

In addition to the higher order magnetic field components, we also find purely 
kinematic terms in the equations of motion due to large amplitudes or due to the 
use of curvilinear coordinates. Some of these terms are generally very small for 
particle trajectories which stay close to the reference path such that divergences are 
small, x’ < 1 and y’ < 1. The lowest order kinematic effects resulting from the use 
of a curvilinear coordinate system, however, cannot generally be considered small 
perturbations. One important consequence of this choice for the coordinate system 
is that the natural bending magnet is a sector magnet which has very different beam 
dynamics properties than a rectangular magnet which would be the natural magnet 
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type for a Cartesian coordinate system. While a specific choice of a coordinate 
system will not change the physics, we must expect that some features are expressed 
easier or more complicated in one or the other coordinate system. We have chosen to 
use the curvilinear system because it follows the ideal path of the particle beam and 
offers a simple direct way to express particle trajectories deviating from the ideal 
path. In a fixed Cartesian coordinate system we would have to deal with geometric 
expressions relating the points along the ideal path to an arbitrary reference point. 
The difference becomes evident for a simple trajectory like a circle of radius r and 
center at (xo, yo) which in a fixed orthogonal coordinate system would be expressed 
by (x — x0)? + (vy — yo)* = 7°. In the curvilinear coordinate system this equation 
reduces to the simple identity x(z) = 0. 


7.1 Linear Beam Transport Systems 


The theory of beam dynamics based on quadrupole magnets for focusing is called 
strong focusing beam dynamics in contrast to the case of weak focusing, which 
utilizes the focusing of sector magnets in combination with a small gradient in 
the bending magnet profile. Such focusing is employed in circular accelerators 
like betatrons or some cyclotrons and the first generation of synchrotrons. The 
invention of strong focusing by Christofilos [1] and independently by Courant et al. 
[2] changed quickly the way focusing arrangements for large particle accelerators 
are determined. One of the main attraction for this kind of focusing was the ability 
to greatly reduce the magnet aperture needed for the particle beam since the stronger 
focusing confines the particles to a much smaller cross section compared to weak 
focusing. A wealth of publications and notes have been written during the fifties 
to determine and understand the intricacies of strong focusing, especially the rising 
problems of alignment and field tolerances as well as those of resonances. Particle 
stability conditions from a mathematical point of view have been investigated by 
Moser [3]. 

In this chapter, we will discuss the theory of linear charged particle beam dynam- 
ics and apply it to the development of beam transport systems, the characterization 
of particle beams, and to the derivation of beam stability criteria. The bending and 
focusing function may be performed either in separate magnets or be combined 
within a synchrotron magnet. The arrangement of magnets in a beam transport 
system, called the magnet lattice, is often referred to as a separated function or 
combined function lattice depending on whether the lattice makes use of separate 
dipole and quadrupole magnets or uses combined function magnets, respectively. 

Linear equations of motion can be extracted from (6.95), (6.96) to treat beam 
dynamics in first or linear approximation. For simplicity and without restricting 
generality we assume the bending of the beam to occur only in one plane, 
the x-plane. The linear magnetic fields for bending and quadrupole magnets are 
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expressed by 


B= —gy, (7.1a) 
By = By + gx, (7.1b) 


where Byo is the dipole field and g the gradient of the quadrupole field. With these 
field components we obtain from (6.95), (6.96) the equations of motion in the 
approximation of linear beam dynamics 


x" + (ko+kG,) x = 0, (7.2a) 
y —ky =0. (7.2b) 


Both, the focusing from the bending magnet and that from a quadrupole may be 
combined into one parameter 


K(z) = ko(z) + Koy (2): (7.3) 


So far no distinction has been made between combined or separated function 
magnets and the formulation of the equations of motion based on the magnet 
strength parameter K as defined in (7.3), is valid for both types of magnets. For 
separated function magnets either ko or Ko, is Set to zero while for combined function 
magnets both parameters are nonzero. 


7.1.1 Nomenclature 


Focusing along a beam transport line is performed by discrete quadrupoles placed to 
meet specific particle beam characteristics required at the end or some intermediate 
point of the beam line. The dependence of the magnet strength on z is, therefore, 
a priori indeterminate and is the subject of lattice design in accelerator physics. 
To describe focusing lattices simple symbols are used to point out location and 
sometimes relative strength of magnets. In this text we will use symbols from 
Fig. 7.1 for bending magnets, quadrupoles, and sextupoles or multipoles. 

All magnets are symbolized by squares along the z-axis and the length of the 
symbol may represent the actual magnetic length. The symbol for pure dipole 
magnets is a square centered about the z-axis while bending magnets with a gradient 
are shifted vertically to indicate the sign of the focusing. Positive squares are 
used to indicate horizontal focusing and negative squares for horizontal defocusing 
quadrupoles. 
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Fig. 7.1 Symbols for 
————E 


magnets in lattice design 


and typical distributions bending magnet focusing quadrupole 
of magnets along a beam 
transport line » 


[| 


sextupole/multipole defocusing quadrupole 


Using such symbols, a typical beam transport line may have general patterns like 
that shown in Fig 7.1. The sequence of magnets and their strength seems random 
and is mostly determined by external conditions to be discussed later. More regular 
magnet arrangements occur for circular accelerators or very long beam transport 
lines composed of periodic sections. 


7.2 Matrix Formalism in Linear Beam Dynamics 


The seemingly arbitrary distribution of focusing parameters in a beam transport 
system makes it impossible to formulate a general solution of the differential 
equations of motion (7.2). To describe particle trajectories analytically through a 
beam transport line composed of drift spaces, bending magnets, and quadrupoles, 
we will derive mathematical tools which consist of partial solutions and can be used 
to describe complete particle trajectories. 

In this section we will derive and discuss the matrix formalism [4] as a method to 
describe particle trajectories. This method makes use of the fact that the magnet 
strength parameters are constant at least within each individual magnet. The 
equations of motion become very simple since the restoring force K is constant 
and the solutions have the form of trigonometric functions. The particle trajectories 
may now be described by analytical functions at least within each uniform element 
of a transport line including magnet free drift spaces. 

These solutions can be applied to any arbitrary beam transport line, where the 
focusing parameter K changes in a step like function as shown in Fig. 7.1. Cutting 
this beam line into its smaller elements so that K = const. in each of these pieces 
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we will be able to follow the particle trajectories analytically step by step through 
the whole transport system. This is the model generally used in particle beam optics 
and is called the hard edge model. 

In reality, however, since nature does not allow sudden changes of physical 
quantities (natura non facit saltus) the hard edge model is only an approximation, 
although for practical purposes a rather good one. In a real magnet the field strength 
does not change suddenly from zero to full value but rather follows a smooth 
transition from zero to the maximum field. Sometimes, the effects due to this smooth 
field transition or fringe field are important and we will derive the appropriate 
corrections later in this section. For now, we continue using the hard edge model 
for beam transport magnets and keep in mind that in some cases a correction may 
be needed to take into account the effects of a smooth field transition at the magnet 
edges. 

Using this approximation, where 1/9 and k are constants, and ignoring pertur- 
bations, the equation of motion is reduced to that of a harmonic oscillator, 


u’ + K,u=0, where K, = kw +k, = const. (7.4) 


The principal solutions have been derived in Sect. 5.5.1 and are expressed in matrix 


formulation by 
u(z) = Cuz) Si(Z) ug 
Gy 7 ico 2) es : (75) 


where u may be used for either x or y. We have deliberately separated the motion 
in both planes since we do not consider coupling. Formally, we could combine the 
two 2 x 2 transformation matrices for each plane into one 4 x 4 matrix describing 
the transformation of all four coordinates 


x(z) C,(z) Sx(z) 0 0 xo 
XQ] _| GOs 0 0 [fm] (7.6) 
y(z) 0 0 Cy, Sy@ ]] yo 
y¥@ 0 0 C@S@O/ \ym 


Obviously the transformations are still completely decoupled but in this form we 
could include coupling effects, where, for example, the x-motion depends also on 
the y-motion and vice versa. This can be further generalized to include any particle 
parameter like the longitudinal position of a particle with respect to a reference 
particle, or the energy of a particle, the spin vector, or any particle coordinate 
that may depend on other coordinates. In the following paragraphs we will restrict 
the discussion to linear (2 x 2) transformation matrices for a variety of beam line 
elements. 


182 7 Single Particle Dynamics 
7.2.1 Driftspace 


In a driftspace of length £ or in a weak bending magnet, where kj, < 1 and ko = 0, 
the focusing parameter K = 0 and the solution of (7.4) in matrix formulation can 


be expressed by 
u(z)\ _ (1é Ug 
(3) = (or) (%)- ao 


A more precise derivation of the transformation matrices for bending magnets of 
arbitrary strength will be described later in this chapter. Any drift space of length 
£ = z—z, therefore, is represented by the simple transformation matrix 


1¢ 
£\0) = : 7.8 
Matto) = (4) 78) 

We recognize the expected features of a particle trajectory in a field free drift 
space. The amplitude u changes only if the trajectory has an original non vanishing 
slope uj) # 0 while the slope itself does not change at all. 


7.2.2 Quadrupole Magnet 


For a pure quadrupole the bending term ko, = 0 and the field gradient or quadrupole 
strength k(z) 4 0 can be positive as well as negative. With these assumptions we 
solve again (7.4) and determine the integration constants by initial conditions. For 
k > 0 we get the transformation for a focusing quadrupole 


( u(z) ) _ cos a sin ( u(Zo) ) (7.9) 
u'(z) —VJksiny cosy u'(zo) J , 


where y = Vk(z — zp. This equation is true for any section within the quadrupole 
as long as both points zp and z are within the active length of the quadrupole. 

For a full quadrupole of length @ and strength k we set gp = Vké and the 
transformation matrix for a full quadrupole in the focusing plane is 


1 


cosp —sing 
Mor (£|0) = vk . 7.10 
or (£10) G4 cos Y ( ) 
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Fig. 7.2. Example of a beam 
transport line (schematic) 


Similarly, we get in the other plane with k < 0 the solution for a defocusing 


quadrupole 
(102) = (ont ashy) (a0 aun 
u'(z) VJ|k|sinhy  coshy u' (zo) J’ , 
where py = “f [kl] (z— zo). The transformation matrix in the defocusing plane 


through a complete quadrupole of length @ with g = ,/|k|€ is therefore 


he 


cosh @ sinh 
Map (£0) = vial 712 
sd (= sinhg cosh aie 


These transformation matrices make it straight forward to follow a particle 
through a transport line. Any arbitrary sequence of drift spaces, bending magnets 
and quadrupole magnets can be represented by a series of transformation matrices 
M,. The transformation matrix for the whole composite beam line is then just equal 
to the product of the individual matrices. For example, by multiplying all matrices 
along the path in Fig. 7.2 the total transformation matrix M for the eight magnetic 
elements of this example is determined by the product 


M = Ms...M4M3M2M, (7.13) 


and the particle trajectory transforms through the whole composite transport line 
like 


ee ) = M (z|z) ( u(zo) ) (7.14) 


u'(z) u' (Zo) 


where the starting point zo in this case is at the beginning of the drift space M, and 
the end point z is at the end of the magnet Ms. 
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7.2.3 Thin Lens Approximation 


As will become more apparent in the following sections, this matrix formalism 
is widely used to calculate trajectories for individual particle or for a virtual 
particle representing the central path of a whole beam. The repeated multiplication 
of matrices, although straightforward, is very tedious and therefore, most beam 
dynamics calculations are performed on digital computers. In some cases, however, 
it is desirable to analytically calculate the approximate properties of a small set of 
beam elements. For these cases it is sufficient to use what is called the thin lens 
approximation. In this approximation it is assumed that the length of a quadrupole 
magnet is small compared to its focal length (¢ « f) and we set 


£>0, (7.15) 
while keeping the focal strength constant, 
f-! = +ki = const. (7.16) 


This result is analogous to geometric light optics, where we assume the glass lenses 
to be infinitely thin. As a consequence gy = Vk& — 0 and the transformation 
matrices (7.10,) (7.12) are the same in both planes except for the sign of the 


focal length 
u(z)\ _{ 1 € Ug 


f | =k£>0 in the focusing plane 
f-!| =k£ <0 in the defocusing plane. 


where 
(7.18) 


The transformation matrix has obviously become very simple and exhibits only the 
focusing property in form of the focal length. Quite generally one may regard for 
single as well as composite systems the matrix element M2; as the element that 
expresses the focal strength of the transformation. 

In thin lens approximation it is rather easy to derive focusing properties of 
simple compositions of quadrupoles. A quadrupole doublet composed of two 
quadrupole magnets separated by a drift space of length L is described by the total 
transformation matrix 


1 0\/1L 1 0 
M 10) =( )( )( (7.19) 
m = lots = 1 


7 ae mere) 
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Fig. 7.3. Reversed lattice 


where we find the well known expression from geometric paraxial light optics 


Lo, 4 2 
fF f 2 Ak 


Such a doublet can be made focusing in both planes if, for example, the quadrupole 
strengths are set such that f, = —f: = f. The total focal length then is f* = 
+L/f? > 0 in both the horizontal and the vertical plane. 

This simple result, where the focal length is the same in both planes, is a valid 
solution only in thin lens approximation. For a doublet of finite length quadrupoles 
the focal length in the horizontal plane is always different from that in the vertical 
plane as can be verified by using the transformations (7.10), (7.12) to calculate the 
matrix M q,. Since individual matrices are not symmetric with respect to the sign of 
the quadrupole field, the transformation matrices for the horizontal plane M gp, and 
the vertical plane M gp,y must be calculated separately and turn out to be different. In 
special composite cases, where the quadrupole distribution is symmetric as shown 
in Fig. 7.3, the matrices for both of the two symmetric half sections are related in a 
simple way. If the matrix for one half of the symmetric beam line is 


_ [ab 
M = (eo) (7.21) 


(7.20) 


then the reversed matrix for the second half of the beam line is 


M, = (‘ °) (7.22) 


ca 


and the total symmetric beam line has the transformation matrix 


(7.23) 


_ _ f(ad+be 2bd 
Muy = MeM = ( 2ac a 
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Fig. 7.4 Symmetric 
quadrupole triplet 


We made no particular assumptions for the lattice shown in Fig.7.3 except for 
symmetry and the relations (7.21), (7.22) are true for any arbitrary but symmetric 
beam line. 

The result for the reversed matrix is not to be confused with the inverse matrix, 
where the direction of the particle path is also reversed. The inverses matrix of 
(7.21) is 


M, = ( d ne (7.24) 


—c a 


Going through an arbitrary section of a beam line and then back to the origin 
again results in a total transformation matrix equal to the unity matrix 


10 
Met = Mi M = & :) , (725) 
These results allow us now to calculate the transformation matrix M,, for a 
symmetric quadrupole triplet. With (7.19), (7.24) the transformation matrix of a 
quadrupole triplet as shown in Fig. 7.4 is 


_ _ (1-21 /f? Ld +L/f) 
My = M.M = ( up oe (7.26) 


where f* is defined by (7.20) with fi = —frp = f. 

Such a triplet is focusing in both planes as long as f > L. Symmetric triplets 
as shown in Fig. 7.4 have become very important design elements of long beam 
transport lines or circular accelerators since such a triplet can be made focusing 
in both planes and can be repeated arbitrarily often to provide a periodic focusing 
structure called a FODO-channel. The acronym is derived from the sequence of 
focusing (F) and defocusing (D) quadrupoles separated by non-focusing elements 
(O) like a drift space or a bending magnet. 
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Fig. 7.5 Field profile in a real quadrupole with a bore radius of R = 3cm and an iron length of 
€iron = 15.9cm 


7.2.4 Quadrupole End Field Effects 


In defining the transformation through a quadrupole we have assumed the strength 
parameter k(z) to be a step function with a constant nonzero value within the 
quadrupole and zero outside. Such a hard edge field distribution is only approx- 
imately true for a real quadrupole. The strength parameter in a real quadrupole 
magnet varies in a gentle way from zero outside the quadrupole to a maximum 
value in the middle of the quadrupole. In Fig. 7.5 the measured gradient of a real 
quadrupole along the axis is shown. 

The field extends well beyond the length of the iron core and the effective 
magnetic length, defined by 


lett = (7.27) 


where go is the field gradient in the middle of the quadrupole, is longer than the iron 
length by about the radius of the bore aperture 


lest x Liron +R. (7.28) 


This is the effective or hard edge magnet length f with strength k. The real 
field distribution can be approximated by a trapezoid such that / g dz is the same 
in both cases (see Fig. 7.5). To define the trapezoidal approximation we assume 
a fringe field extending over a length equal to the bore radius R as shown in 
Fig. 7.5. End field effects must therefore be expected specifically in quadrupoles 
with large bore radii and short iron cores. It is interesting to investigate as to what 
extend the transformation characteristics for a real quadrupole differ from the hard 
edge model. The real transformation matrix can be obtained by slicing the whole 
axial quadrupole field distribution in thin segments of varying strength. Treating 
these segments as short hard edge quadrupoles the full transformation matrix is the 
product of the matrices for all segments. 
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Fig. 7.6 Decomposition of an actual quadrupole field profile into segments of hard edge 
quadrupoles. (ko, &o are for the hard edge model, k, £ for the hard edge model with real fringe 
fields, A and L are used for mathematical evaluation only) 


While it is possible to obtain an accurate transformation matrix this way the 
variations of the matrix elements due to this smooth field distribution turn out to 
be mostly small and in practice, therefore, the hard edge model is used to develop 
beam transport lattices. Nonetheless after a satisfactory solution has been found, 
these real transformation matrices should be used to check the solution and possibly 
make small adjustment to the idealized hard edge model design. 

In this section, we will discuss an analytical estimate of the correction to 
be expected for a real field distribution [5] by looking for the “effective” hard 
edge model parameters (k, 2) which result in a transformation matrix equal to the 
transformation matrix for the corresponding real quadrupole. The transformation 
matrix for the real quadrupole be 


CS 
My = e oe (7.29) 


where the matrix elements are the result of multiplying all “slice” matrices for the 
quadrupole segments as shown in Fig. 7.6 over the length L. 

We assume now that this real quadrupole can be represented by a hard edge 
model quadrupole of length ¢ with adjacent drift spaces A as indicated in Fig. 7.6. 
The transformation through this system for a focusing quadrupole is given by [5] 


(51) cosy sing ico 
01/\-Vksing cosg 01 


7 cosy — VkA sing 2A cosy — ~z sin gp (7.30) 
~ —Jksing cosy — VkA sing 
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with g = ko. This hard edge transformation matrix must be the same as 
the actual matrix (7.29) and we will use this equality to determine the effective 
quadrupole parameters k, £. First, we note that the choice of the total length L = 
£9 + 2A is arbitrary as long as it extends over the whole field profile, and both, 
the “slices” and hard edge matrices extend over the whole length L by employing 
drift spaces if necessary. Equating (7.29) and (7.30) we can compose two equations 
which allow us to determine the effective parameters k, £ from known quantities 


C21, = COS + 49, sing,, (7.31) 

C/ fl, = —9; sing, . 
Here we have added the index + to indicate a focusing quadrupole. The first of these 
equations can be solved for g since the quantities C,, Cc , and L are known. The 
second equation then is solved for ¢, and k, = g? /£,. Two parameters are sufficient 
to equate the 2 x 2 matrices (7.29), (7.30) since two of the four equations are 
redundant for symmetry reasons, M\,; = Mz = C = S’, and the determinant of the 
matrices on both sides must be unity. Similarly, we get for a defocusing quadrupole 


Ci 3h, = cosh p, — 38, sinh g, , (7.32) 

Cll, = —g, sinh g, . 
Equations (7.31) and (7.32) define a hard edge representation of a real quadrupole. 
However, we note that the effective quadrupole length ¢ and strength k are different 
from the customary definition, where kp is the actual magnet strength in the middle 
of the quadrupole and the magnet length is defined by £95 = 7 J k(z) dz. We also 
observe that the effective values ¢ and k are different for the focusing and defocusing 
plane. Since the endfields are not the same for all quadrupoles but depend on the 
design parameters of the magnet we cannot determine the corrections in general. In 
practical cases, however, it turns out that the corrections Ak = k — ko and Af = 
£ — £0 are small for quadrupoles which are long compared to the aperture and are 
larger for short quadrupoles with a large aperture. In fact the differences Ak and 
A¢ turn out to have opposite polarity and the thin lens focal length error Ak Aé is 
generally very small. 

As an example, we use the quadrupole of Fig. 7.5 and calculate the corrections 
due to end field effects. We calculate the total transformation matrix for the real field 
profile as discussed above by approximating the actual field distribution by a series 
of hard edge “slice” matrices in both planes as a function of the focusing strength ko 
and solve (7.31), (7.32) for the effective parameters (ke, €¢) and (ka, £4), respectively. 
In Fig. 7.7 these relative fringe field corrections to the quadrupole strength Ak/ko 
and to the quadrupole length Af/£o are shown as functions of the strength ky. The 
effective quadrupole length is longer and the effective quadrupole strength is lower 
than the pure hard edge values. In addition the corrections are different in both 
planes. Depending on the sensitivity of the beam transport system these corrections 
may have to be included in the final optimization. 
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7.3 Focusing in Bending Magnets 


Bending magnets have been treated so far just like drift spaces as far as focusing 
properties are concerned. This is a good approximation for weak bending magnets 
which bend the beam only by a small angle. In cases of larger deflection angles, 
however, we observe focusing effects which are due to the particular type of magnet 
and its end fields. In Chap. 6 we discussed the geometric focusing term «* which 
appears in sector magnets only. Other focusing terms are associated with bending 
magnets and we will discuss in this section these effects in a systematic way. 
Specifically, the focusing of charged particles crossing end fields at oblique angles 
will be discussed. 

The linear theory of particle beam dynamics uses a curvilinear coordinate system 
following the path of the reference particle and it is assumed that all magnetic fields 
are symmetric about this path. The “natural” bending magnet in this system is one, 
where the ideal path of the particles enters and exits normal to the magnet pole faces. 
Such a magnet is called a sector magnet as shown in Fig. 7.8. The total deflection 
of a particle depends on the distance of the particle path from the ideal path in the 
deflecting plane which, for simplicity, we assume to be in the horizontal x-plane. 
Particles following a path at a larger distance from the center of curvature than the 
ideal path travel a longer distance through this magnet and, therefore, are deflected 
by a larger angle than a particle on the ideal path. Correspondingly, a particle passing 
through the magnet closer to the center of curvature is deflected less. 

This asymmetry leads to a focusing effect which is purely geometric in nature. 
On the other hand, we may choose to use a magnet such that the ideal path of the 
particle beam does not enter the magnet normal to the pole face but rather at an 
angle. Such a configuration has an asymmetric field distribution about the beam 
axis and therefore leads to focusing effects. We will discuss the effects of fringe 
fields in more detail in Sect. 7.3.2. 
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Fig. 7.8 Focusing in a sector magnet, where nn = n. = O 


7.3.1 Sector Magnets 


The degree of focusing in a sector magnet can be evaluated in any infinitesimal 
sector of such a magnet by calculating the deflection angle as a function of the 
particle position x. With the notation from Fig.7.8 we get for the deflection angle 
while keeping only linear terms in x 


dé = Koda = Ko el + KoXx) dz. (7.33) 

The first term on the r.h.s. merely defines the ideal path, while the second x- 

dependent term of the deflection angle in (7.33) describes the particle motion in 

the vicinity of the ideal path. With respect to the curvilinear coordinate system 
following the ideal path we get the additional deflection 


50 = Ko xdz. (7.34) 


This correction is to be included in the differential equation of motion as an 
additional focusing term 


Ax” =-— = Ko x (7.35) 
to the straight quadrupole focusing leading to the equation of motion 

x" +(k+K5)x=0, (7.36) 
which is identical to the result obtained in Sect. 5.3. 


The differential equation (7.36) has the same form as that for a quadrupole and 
therefore the solutions must be of the same form. Using this similarity we replace k 
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by (A+ Ko) and obtain immediately the transformation matrices for a general sector 
magnet. For K = k + xj > 0 and 


O = VKe (7.37) 


we get from (7.10) the transformation matrix 
cos@ —-Lsin@ 
; (7.38) 


Magy (€|0) = VK 
(E10) ae cos @ 


where £ is the arc length of the sector magnet and where both the focusing term 
k and the bending term kp may be nonzero. Such a magnet is called a synchrotron 
magnet since this magnet type was first used for lattices of synchrotrons. 

For the defocusing case, where K = k + Ko <Oand0 = VIKIE, we get from 
(7.12) 


cohO® —L-sinhoO 
: (7.39) 


Msgy a(€|0) = Viki 
va(l|0) ia sinh® cosh® 


Note that the argument © is equal to the deflection angle @ only in the limit 
k — 0 because these transformation matrices include bending as well as focusing in 
the same magnet. Obviously, in the nondeflecting plane kg = O and such a magnet 
acts just like a quadrupole with strength k and length £. 

A subset of general sector magnets are pure dipole sector magnets, where we 
eliminate the focusing by setting k = 0 and get the pure dipole strength K = kj > 0. 
The transformation matrix for a pure sector magnet of length £ and bending angle 
0 = kof in the deflecting plane becomes from (7.38) 


(7.40) 


Mp1) = ( a ane 


—Kosin@ cos@ 


If we also let ko — O we arrive at the transformation matrix of a sector magnet in 
the nondeflecting plane 


M,o(€|0) = @ Oi (7.41) 


which has the form of a drift space. A pure dipole sector magnet therefore behaves 
in the non-deflecting plane just like a drift space of length £. Note that ¢ is the arc 
length of the magnet while the engineering magnet length might be given as the 
straight length between entry and exit point. 


7.3. Focusing in Bending Magnets 193 


B,(z) 
linearized fringe field 


effective hard edge x 
Py 5 i . 


fringe field 


end of iron core fringe field 


<—_—___________——_-+>'! 
' 


Z 


> 


K—2G— > Zr 


Fig. 7.9 End field profile in a dipole magnet and fringe field focusing 


7.3.2 Fringe Field Effects 


The results obtained above are those for a hard edge model and do not reflect 
modifications caused by the finite extend of the fringe fields. The hard edge model 
is again an idealization and for a real dipole we consider the gradual transition of 
the field from the maximum value to zero outside the magnet. The extend of the 
dipole fringe field is typically about equal to the gap height or distance between the 
magnet poles. 

We assume magnet poles which are very wide compared to the gap height 
and therefore transverse field components in the deflecting plane, here B,, can be 
neglected. At the entrance into a magnet the vertical field component B, increases 
gradually from the field free region to the maximum value in the middle of the 
magnet (Fig. 7.9). We will discuss the effects on the particle dynamics caused by 
this fringe field and compare it with the results for a hard edge model magnet. 

For the following discussion we consider both a fixed orthogonal Cartesian 
coordinate system (u, v, w), used in the fringe area, as well as a moving curvilinear 
system (x,y,z). The origin of the fixed coordinate system is placed at the point 
Po where the field starts to rise (Fig. 7.9). At this point both coordinate systems 
coincide. The horizontal field component vanishes for reasons of symmetry 


B, = 0 (7.42) 
and the vertical field component in the fringe region may be described by 
B, = F(w). (7.43) 


With Maxwell’s curl equation 0B,,/dv — 0B,/dw = 0 we get after integration the 
longitudinal field component B,, = [ (0B, /dw) dv or 


dF (w) 


By =y ; 
we Ow 


(7.44) 
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where y = v and where a linear fringe field (see Fig.7.9) was assumed with 
OF (w)/dw = const. These field components must be expressed in the curvilinear 
coordinate system (x, y, z). Within the fringe field B,, (z) can be split into B, and B, 
as shown in Fig. 7.9. The horizontal field component is then B, = B,, sind where 6 
is the deflection angle at the point z defined by 


i= = ['F@ dz. (7.45) 
Po Jo 
With 
__ aFQv) __ aF(w) dz OF) I 
ne i ° Be dw” de cass a 
we get 
B,(z) = y F(z) tané, (7.47) 


where F’(z) = dF/dz. The vertical fringe field component is with 0B,/dy — 
dB, /dx = 0 and integration 


By(z) = By +x F(z) tans. (7.48) 

The longitudinal field component is from (7.46) and with B, = B,, cosé 
Bz) = yF’(2). (7.49) 
The field components of the fringe field depend linearly on the transverse 


coordinates and therefore fringe field focusing [6] must be expected. With the 
definition of the focal length from (7.3) we get 


; = , " K@ dz. (7.50) 


where K(z) is the focusing strength parameter defined in (). In the deflecting plane 
the fringe field focusing is with k(z) = (e/po) 0B,(z)/0x and (7.48) 


1 zt 
= / (k’ tané + x?) dz, (7.51) 


where we have set k(z) = (e/po) F(z). For small deflection angles 6 in the fringe 
fieldtanéd ~ 6 = Hs "« dz and after integration of (7.48) by parts through the full 
fringe field we get the focal length while neglecting higher order terms in dr 


: = kb (7.52) 
= ko or, ais 
; 0 Of 


x 
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where ko = 1/po is the curvature in the central part of the magnet and 4y is the total 
deflection angle in the fringe field region. 

This result does not deviate from that of the hard edge model, where for a small 
deflection angle 6 we have from (7.40) 1/f, + ko 6 agreeing with (7.52). We obtain 
therefore the convenient result that in the deflecting plane of a sector magnet there 
is no need to correct the focusing because of the finite extend of the fringe field. 


7.3.3 Finite Pole Gap 


In the vertical plane this situation is different since we expect vertical focusing from 
(7.47) while there is no focusing in the approximation of a hard edge model. Using 
the definition (7.50) of the focal length in the vertical plane gives with K(z) = —k(z) 
and (7.47) 


1 2f Zf 
= -f «’ tand dz & -| kK’ (z) (2) dz. (7.53) 
0 0 


The fringe field of a sector magnet therefore leads to a defocusing effect which 
depends on the particular field profile. We may approximate the fringe field by a 
linear fit over a distance approximately equal to the pole gap 2G which is a good 
approximation for most real dipole magnets. We neglect the nonlinear part of the 
fringe field and approximate the slope of the field strength by «’ = k9/2G =const. 
The focal length for the full fringe field of length z = 2G is therefore with «(z) = 
k'z,0<z< zand 


8(2) = i KE = Oe (7.54) 
0 
given by 
1 2G 
i =) K'5@) dz = — 39 G = —3K0 dr, ee) 
y 0 
where 
dp = K0G. (7.56) 


This is the focusing due to the fringe field at the entrance of a sector magnet. At 
the exit we have the same effect since the sign change of x’ is compensated by the 
need to integrate now from full field to the field free region which is just opposite 
to the case in the entrance fringe field. Both end fields of a sector magnet provide a 
small vertical defocusing. We note that this defocusing is quadratic in nature, since 
d5¢ & Ko and therefore independent of the sign of the deflection. 


196 7 Single Particle Dynamics 


With these results we may now derive a corrected transformation matrix for a 
sector magnet by multiplying the hard edge matrix (7.41)on either side with thin 


length fringe field focusing 
1 0 1 0 
Jy ty 


and get with (7.55) and 6 = £/po for the transformation matrix in the vertical, 
non-deflecting plane of a sector magnet instead of (7.41) 


(7.58) 


1+10%  ¢ 


25 1 i 
5 9 9 pe 1+ 36 d¢ 


The second order term in the M>;-matrix element can be ignored for practical 
purposes but is essential to keep the determinant equal to unity. 


7.3.4 Wedge Magnets 


In a more general case compared to a sector magnet we will allow the reference path 
of the particle beam to enter and exit the magnet at an arbitrary angle with the pole 
face. Figure 7.10 shows such a wedge magnets and we will derive its transformation 
matrices. First, we note that the fringe field effect is not different from the previous 
case of a sector magnet except that now the angle 6(z) must be replaced by a new 
angle 7 + 5(z) where the pole rotation angle 7 and the sign convention is defined in 
Fig. 7.10. 
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Fig. 7.10 Fringe field focusing in wedge magnets 
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Different from the case of a sector magnet, we cannot replace the tangent 
in (7.51) by its argument since the angle 7 may be large to prohibit such an 
approximation. As a further consequence of a large value of 7, we must take into 
account the actual path length in the fringe field. To calculate the focal length f,, we 
have instead of (7.51) 


_— [ [«’ tan (n + 8) + x | dz (7.59) 
0 


Expanding for small angles 6 < 1 we get tan(n +6) ®% tann + 6. This 
approximation is true only as long as dtann < 1 or for entrance angles 7 not 
too close to 90° and the argument in the integral (7.59) becomes x’ tan + k’5 + k?. 
In addition to the terms for a sector magnet, a new term (x’ tan 7) appears and the 
focal length of the fringe field is 


1 e 
Z = } K’ tann dz + Kod¢ = Ko tann + Kor, (7.60) 
x 0 
where the integral extends over the whole fringe field. Since to first order the path 


length through the fringe field is 


2G 
z= : (7.61) 
cos 


where 2G is the pole gap height, we have 


2G/ cos n 
fo / dz. (7.62) 
0 


The term «0dr describes again the well-known focusing of a sector magnet in the 
deflecting plane while the term ko tan 7 provides the correction necessary for non- 
normal entry of the beam path into the magnet. For the case shown in Fig. 7.10, 
where 7 > 0, we obtain beam focusing in the deflecting plane from the fringe field. 
Similarly, we get a focusing or defocusing effect at the exit fringe field depending on 
the sign of the pole rotation. The complete transformation matrix of a wedge magnet 
in the horizontal deflecting plane is obtained by multiplying the matrix of a sector 
magnet with thin lens matrices to take account of edge focusing. For generality, 
however, we must assume that the entrance and the exit angle may be different. We 
will therefore distinguish between the edge focusing for the entrance angle 7 = no 
and that for the exit angle 7 = 7. and get for the transformation matrix in the 
deflecting plane 


7 1 0 cos@ po sind 1 0 
Myo (£,0) = ze i [3 sin@ cos@ —;, tan no 1|° oo) 
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In the vertical plane the focal length is similar to (7.53) and for not too large angles 


1 


2f af 
—= -| x’ tan(n + 6) dz © —xo tan -{ Kb dz. (7.64) 
fy 0 0 


Again we have the additional focusing term which is now focusing in the vertical 
plane for n < 0. For a linear fringe field the focal length is in analogy to (7.55) 


i 1 
= = —ko tann + 3kodr, (7.65) 
Sh 3 
where 
2G/cosn 2G? G 
bi -| dz = «/ = (7.66) 
0 cos*n cos? n 


since kK (z) ~ k’z and k’ = ko/ (G/ cos). The complete transformation matrix in 
the vertical plane for a horizontally deflecting wedge magnet becomes then 


1 O;s12 1 0 
w. £,0 = . 
Mw (€,0) . (tan ne + +6;,) [ot] = (tan yo + 46%) | 
(7.67) 


Equations (7.63) and (7.67) are for bending magnets with arbitrary entrance 
and exit angles 79 and 7. We note specifically that the transformation in the 
nondeflecting plane becomes different from a simple drift space and find a focusing 
effect due to the magnet fringe fields which depends on the entrance and exit angles 
between particle trajectory and pole face. 

This general derivation of the focusing properties of a wedge magnet must be 
taken with caution where the pole face rotations are very large. In spite of the finite 
pole rotation angles we have assumed that the particles enter the fringe field at the 
same location z along the beam line independent of the transverse particle amplitude 
x. Similarly, the path length of the trajectory in such a wedge magnet depends on 
the particle amplitude x and slope x’. Obviously these are second order effects but 
may become significant in special cases. 


7.3.5 Rectangular Magnet 


A particular case of a wedge magnet is the rectangular magnet which has parallel 
end faces. If we install this magnet symmetrically about the intended particle 
trajectory the entrance and exit angles equal to half the bending angle as shown 
in Fig. 7.11. 
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Fig. 7.11 Rectangular magnet 


For a deflection angle @ , 70 = je = —6/2 and the transformation matrix in the 
deflecting plane is from (7.63) 


1 O cos 6 sin 0 1 O 
Mz.p (€|0) = e ) (“i pee (sin ) (7.68) 
po 


Po PO 
_ (1 posin@ 
~\o ol ; 


A rectangular dipole magnet transforms in the deflecting plane like a drift space 
of length po sin @ and does not focus the beam. Note, that the “magnet length” ¢ 
defined by the deflection angle 6 = £/po is the arc length and is related to the 
straight magnet length L by 


6 £ 
L = 20 sin = = 29 sin —. (7.69) 
2 20 


In the vertical plane we observe a focusing with the focal length 
1 1 a) 
= tan +}, (7.70) 
2 3 


From (7.66) 6g/2 = G/[po cos(@/2)] and with (7.69) 59/2 = 2G tan(@/2)/L. 
Inserting this in (7.70), we obtain for the transformation matrix of a rectangular 
bending magnet in the nondeflecting plane 


LO) 12) f 10 i=) 2 
M,.o(€|0) -( rl ( ) ( 1 -( 2, 2 2|> (7-71) 
poy Ae ae aes 
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where 


1 _ 1 1 2G ‘ 6 172 
i ral ap) (5). ae 


In a rectangular dipole magnet we find just the opposite edge focusing properties 
compared to a sector magnet. The focusing in the deflecting plane of a sector magnet 
has shifted to the vertical plane in a rectangular magnet and focusing is completely 
eliminated in the deflecting plane. Because of the finite extend of the fringe field, 
however, the focusing strength is reduced by the fraction 2G/ [3Lcos (6/2)] where 
2G is the gap height and L the straight magnet length. 


7.3.6 Focusing in a Wiggler Magnet 


The derivation of fringe field focusing in ordinary dipole magnets as discussed in 
previous sections can be directly applied to wiggler magnets. The beam path in a 
wiggler magnet is generally not parallel to the reference trajectory z because of the 
transverse deflection in the wiggler field and follows a periodic sinusoidal form 
along the reference path. For this reason the field component B, appears to the 
particle partially as a transverse field Bs = B,tand? ~ B,0, where we use for a 
moment & as an auxiliary transverse coordinate normal to and in the plane of the 
actual wiggling beam path. We also assume that the wiggler deflection angle is 
small, } < 1. The field component B; can be expressed with (6.116), (6.117) more 
explicitly by 


2 . 
e [+ diy je) sinh (kpy) cosh (kpy) 7.73) 


“Bes = 
P ; Kp 

where 1/p9 = Bo is the inverse bending radius in the center of a wiggler pole at 

which point the field reaches the maximum value Bo. With the expansions (6.119) 

we have finally 


I 2 
“B =-— E sin (3) (y+ okey” +...). (7.74) 


The linear y-dependence is similar to that found to produce vertical focusing in 
wedge magnets. Since the wiggler field appears quadratically in (7.73) Bg(z) = 
Bg(—z) and Bg(Bo) = Be(—Bo). In other words, the transverse field has the same 
sign along all wiggler poles independent of the polarity of the vertical main wiggler 
field. The integrated field gradient per wiggler half pole is from (7.74) 


1 p/4 vi 
kt =-— sin” kyz dz = —i— (7.75) 
Po 0 Po 
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Fig. 7.12 Wiggler magnet 
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where ¢ is the effective length of the focusing element and kp = =. The integrated 
equivalent quadrupole strength or inverse focal length for each half pole is 


I 1 (eBo\* A 
il ==, (=) Re, (7.76) 
: Po 8/9 


For N wiggler poles we have 2N times the focusing strength and the focal length 
of the total wiggler magnet of length Ly = SNAp expressed in units of the wiggler 
strength parameter K becomes 


: ce L 7.77 
a of0 (7.77) 

Tacitly, a rectangular form of the wiggler poles has been assumed (Fig. 7.12) and 
consistent with our sign convention, we find that wiggler fringe fields cause focusing 
in the nondeflecting plane. Within the approximation used there is no corresponding 
focusing effect in the deflecting plane. This is the situation for most wiggler magnets 
or poles except for the first and last half pole where the beam enters the magnetic 
field normal to the pole face. 

A reason to possibly use wiggler magnets with rotated pole faces like wedge 
magnets originates from the fact that the wiggler focusing is asymmetric and 
not part of the lattice focusing and may therefore need to be compensated. For 
moderately strong wiggler fields the asymmetric focusing in both planes can mostly 
be compensated by small adjustments of lattice quadrupoles. The focusing effect 
of strong wiggler magnets, however, may generate a significant perturbation of the 
lattice focusing structure or create a situation where no stable solution for betatron 
functions exists anymore. The severity of this problem can be reduced by designing 
the wiggler poles as wedge magnets in such a way as to split the focusing equally 
between both the horizontal and vertical plane. In this case local correction can be 
applied efficiently in nearby lattice quadrupoles. 

We will therefore discuss the focusing and transformation matrix through a 
wiggler pole in the case of arbitrary entry and exit angles. To derive the complete 
and general transformation matrices, we note that the whole wiggler field can be 
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treated in the same way as the fringe field of ordinary magnets. The focal length of 
one half pole in the horizontal deflecting plane is from (7.60) 


I Ap/4 
~=| ide + Kade, (7.78) 
0 


where the pole face rotation angle 7 has been assumed to be small and of the order 
of the wiggler deflection angle per pole (Fig. 7.13). With k, = ky cos kpz the field 
slope is 


x! = Kyokp Sin kpz (7.79) 


and after integration of (7.78), the focal length for the focusing of a wiggler half 
pole is 


1 
p= kot), (7.80) 


x 


where dr is given by (7.56) and in the case of a wiggler magnet is equal to the 
deflection angle of a half pole. In the case of a rectangular wiggler pole 7 = —d¢ 
and the focusing in the deflecting plane vanishes as we would expect. In the 
nondeflecting plane (7.53) applies and the focal length is for small angles 7 and 6 


1 Ap/4 
= -| «In + 8(@] dz. (7.81) 
fy 0 

The focal length per wiggler half pole is after integration 
: Gaia eek (7.82) 
7 = Kx) f) — —K xo Of . : 
ne 4 

Here again setting 7 = —é; restores the result obtained in (7.77). 


The focusing in each single wiggler pole is rather weak and we may apply thin 
lens approximation to derive the transformation matrices. For this we consider the 
focusing to occur in the middle of each wiggler pole with drift spaces of length A, /4 
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on each side. With 2/f being the focal length of a full pole in either the horizontal 
plane (7.80) or vertical plane (7.82) the transformation matrix for each wiggler pole 


is finally 
— [1LaAp/4 1 0 1A,/4 
soe= (V4 )( 4) (0% os 
Ap A A 
_(1-3 #( - #) ~( 1 ae 
a Pace 


where the approximation A, « f was used. For a wiggler magnet of length Ly = 
5N Ap: we have N poles and the total transformation matrix is 


Mie dte = Ms (7.84) 


This transformation matrix can be applied to each plane and any pole rotation 
angle 7. Specifically, we set 7 = —K/y for a rectangular pole shape and n = 0 for 
pole rotations orthogonal to the path like in sector magnets. 


7.3.7 Hard-Edge Model of Wiggler Magnets 


Although the magnetic properties of wiggler magnets are well understood and easy 
to apply it is nonetheless often desirable to describe the effects of wiggler magnets 
in the form of hard-edge models. This is particularly true when numerical programs 
are to be used which do not include the feature of properly modeling a sinusoidal 
wiggler field. On the other hand accurate modeling is important since frequently 
strong wiggler magnets are to be inserted into a beam transport lattice. 

For the proper modeling of linear wiggler magnet properties we choose three 
conditions to be fulfilled. The deflection angle for each pole should be the same 
as that for the equivalent hard-edge model. Similarly the edge focusing must be 
the same. Finally, like any other bending magnet in an electron circular accelerator, 
a wiggler magnet also contributes to quantum excitation and damping of the beam 
emittance and beam energy spread. The quantum excitation is in first approximation 
proportional to the third power of the curvature while the damping scales like the 
square of the curvature similar to focusing. 

Considering now a wiggler field 


B(z) = Bo sinkpz, (7.85) 
we try to model the field for a half pole with parallel endpoles by a hard-edge 


magnet. Three conditions should be met. The deflection angle of the hard-edge 
model of length € and field B must be the same as that for a wiggler half pole, 
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Here we use p, for the bending radius of the equivalent hard-edge model and po 
for the bending radius at the peak wiggler field By. The edge focusing condition can 
be expressed by 


U 1 A 
= = = za sin? kpz dz = —e (7.87) 
Ph Po Jhalfpole 8/5 


1 

: 

Modeling a wiggler field by a single hard-edge magnet requires in linear beam 

optics only two conditions to be met which can be done with the two parameters 

B(z) and £ available. From (7.86), (7.87) we get therefore the hard-edge magnet 
parameters (Fig. 7.14) 


Pr = + po and Ly = Ap. (7.88) 

For a perfect modeling of the equilibrium energy spread and emittance due to 

quantum excitation in electron storage rings we would also like the cubic term to be 
the same 


Ly, 2 I 


+3 Ap 
5 =a sin” kypzdz = ——z (7.89) 
Ph Po JShalfpole 


37 py 


Since we have no more free parameters available, we can at this point only 
estimate the mismatch. With (7.87), (7.88) we get from (7.89) the inequality 


1 a 
— ~— 7.90 
3x 7 32 ( ) 
which indicates that the quantum excitation from wiggler magnets is not correctly 
treated although the error is only about 8 %. 
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Similarly, one could decide that the quadratic and cubic terms must be equal 
while the deflection angle is let free. This would be a reasonable assumption since 
the total deflection angle of a wiggler is compensated anyway. In this case the 
deflection angle would be underestimated by about 8 %. Where these mismatches 
are not significant, the simple hard-edge model (7.89) can be applied. For more 
accuracy the sinusoidal wiggler field must be segmented into smaller hard-edge 
magnets. 


7.4 Elements of Beam Dynamics 


The most basic elements of a beam transport line are drift spaces, bending magnets 
and focusing magnets or quadrupoles. Obviously, in a drift space of length ¢ 
the electric or magnetic field vanishes. Bending magnets act as beam guidance 
devices while quadrupoles will focus the beam. In the following section, we will 
discuss building blocks made up of bending magnets and quadrupoles, which exhibit 
features known from light optics thus justifying our extensive use of terminology 
from optics in particle beam dynamics. 


7.4.1 Building Blocks for Beam Transport Lines 


With special arrangements of bending and focusing magnets it is possible to 
construct lattice sections with particular properties. We may desire a lattice section 
with specific chromatic properties, achromatic or isochronous sections. In the next 
paragraphs we will discuss such lattice elements with special properties. 


General Focusing Properties 


The principal solutions and some elements of transformation matrices through an 
arbitrary beam transport line can reveal basic beam optical properties of this beam 
line. A close similarity to paraxial light optics is found in the matrix element C’(z). 
As shown schematically in Fig.7.15, parallel trajectories (uy = 0) are deflected 
by the focusing system through the matrix element C’(z) and emerge with a slope 
u'(z) = C'(z) uo. 

From light optics we know that —uo/u’(z) is defined as the focal length of the 
system. In analogy, we define therefore also a focal length f for a composite focusing 
system by setting 


f' =C@. (7.91) 


The focal point is defined by the condition u (zp) = 0 and is, therefore, located 
where the cosine like solution becomes zero or C(zp) = 0. 
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Fig. 7.15 Focusing in a quadrupole doublet 
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Fig. 7.16 Point to point imaging 


More similarities with paraxial light optics can be identified. Point to point 
imaging, for example, is defined in particle beam optics by the sine like function 
S(z), starting at the object plane at z = zo. The image point is located where the 
sine-like function crosses again the reference axis or where S(z; + z)) = 0 as shown 
in Fig. 7.16. 

By definition such a section of a beam transport system has a betatron phase 
advance of 180°. The beam size or object size Ho at zo is transformed by the 
cosine like function to become at the image point H(z;) = | C(z; + zo)| Ho and 
the magnification of the beam optical system is given by the absolute value of the 
cosine like function at the image point 


M = |C(z% + 20)I- (7.92) 


Chromatic Properties 


Very basic features can be derived for the chromatic characteristics of a beam 
transport line. In (5.81), we have already derived the dispersion function 


D(z) = S(z) [@ C(Z) dz — C(z) [x S(z) dz. (7.93) 
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From this expression we conclude that there is dispersion only if at least one 
of the two integrals in (7.93) is nonzero. That means only dipole fields can cause 
a dispersion as a consequence of the linear chromatic perturbation term «6. All 
other perturbation terms in (6.95), (6.96) are of higher order in 5 or depend on 
the transverse particle coordinates and therefore contribute only to higher order 
corrections of the dispersion function. 

Specifically, we find from (5.26) the lowest order chromatic quadrupole pertur- 
bation to be kxé. Since any arbitrary particle trajectory is composed of an energy 
independent part xg and an energy dependent part Dé, expressed by x = xg + D6, 
we find the lowest chromatic quadrupole perturbation to the dispersion function to 
be the second order term kD8* which does not contribute to linear dispersion. 

While some dispersion cannot be avoided in beam transport systems where 
dipole magnets are used, it is often desirable to remove this dispersion at least in 
some parts of the beam line. As a condition for that to happen at say z = Za, we 
require that D(zq) = 0. From ( 7.93) this can be achieved if 


Sa) _ So K OS@ & 
Cla) fo’ K@OC@ az’ 


(7.94) 


a condition that can be met by proper adjustments of the focusing structure. 


Achromatic Lattices 


A much more interesting case is the one, where we require both the dispersion 
and its derivative to vanish, D(zqg) = 0 and D’(zg) = 0. In this case we have no 
dispersion function downstream from the point z = zg up to the point, where the 
next dipole magnet creates a new dispersion function. The conditions for this to 
happen are 


D(Za) = 0 = —S(Za) Le + Clza) Is 5 


7.95 
Di (za) = 0 = —S' (za) fo + C’(za) Io, heme 


where we have set , = i KC dz and I, = AP «S dz. We can solve (7.95) for J, or 
I, and get 


[C(za) S’(Za) — Sa) C"(Za)] Te = 0, 


(7.96) 
[C(za) S’(za) — S(za) C’(Za)] Is = 0. 
Since C(za)S’(za) — S(za)C’(za) = 1, the conditions for a vanishing dispersion 
function are 
= (4 "(3) C() dz = 
l=], kOC@) da =0, (7.97) 


Is = fo’ k@ S(@ dz = 0. 
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A beam line is called a first order achromat or short an achromat if and only if 
both conditions (7.97) are true. The physical characteristics of an achromatic beam 
line is that at the end of the beam line, the position and the slope of a particle 
trajectory is independent of the energy. 


7.4.2 Isochronous Systems 


For the accelerating process we will find that the knowledge of the path length is 
of great importance. The path length L of any arbitrary particle trajectory can be 
derived by integration to give 


Lo ds Lo 
t= fa=f saa | x2 + yi? + (1+ eyxrde, (7.98) 
0 z 0 


where Lp is the length of the beam line along the ideal reference path. For simplicity 
we have ignored a vertical deflection of the beam. The path length variation due to 
a vertical bend would be similar to that for a horizontal bend and can therefore be 
easily derived form this result. Since x’, y’ and x,x are all small compared to unity, 
we may expand the square root and get in keeping only second order terms 


Lo 


L= |] [ltext 4107+ y¥* +122) dz + 00). (7.99) 
0 


Utilizing (5.83) we get from (7.99) for the path length difference 
Lo Lo 
(L —_ Lo) sector= XO if K,(Z)C(Z) dz + x, i K(Z)S(Z) dz (7.100) 
0 0 


Io 
+6 / K,(Z)D(Z) dz. 
0 


The variation of the path length has two contributions. For 6 = 0 the path length 
varies due to the curvilinear coordinate system, where dipole fields exist. This is a 
direct consequence of the coordinate system which selects a sector magnet as its 
natural bending magnet. The ideal path enters and exits this type of dipole magnet 
normal to its pole face as shown in Fig.7.17. It becomes obvious from Fig. 7.17 
that the path length difference depends on the particle position with respect to the 
reference path and is in linear approximation 


dé = £— Ly = (m +x) dy— podg. (7.101) 
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Fig. 7.18 Path length in a wedge magnet 


Figure 7.18 displays the general situation for a wedge magnet with arbitrary 
entrance and exit pole face angles. The path length differs from that in a sector 
magnet on either end of the magnet. The first integral in (7.100) therefore must 
be modified to take into account the path length elements in the fringe field. For a 
wedge magnet we have therefore instead of (7.100) 


Lo 
(L = Lo) wedge= XO / Ky(Z) C(Z) dz 
0 
+ [C(Zo)x0 + polno + [CZe)x0 + pole 
— xoC(zo) tan no — Xo C(Ze) tan Ne 
Lo Io 
+X) i Ky(Z)S(Z) dz + 6 / K,(Z)D(Z) dz (7.102) 
0 0 
~ (L = Lo) sector + O(2) . 


Here [C(z) x9 + po] 7 is the arc length through the wedge-like deviations from a 
sector magnet which must be compensated by the decrease or increase C(z) x tan 7 
in the adjacent drift space. For small edge angles both terms compensate well and 
the total path length of a wedge magnet is similar to that of a sector magnet. In 
general we therefore ignore path length variations in wedge magnets with respect to 
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sector magnets as well as those in the adjacent drift spaces. For large edge angles, 
however, this assumption should be reconsidered. 

Equation (7.100) imposes quite severe restrictions on the focusing system if the 
path length is required to be independent of initial condition and the energy. Since 
the parameters xo,xj and 6 are independent parameters for different particles, all 
three integrals in (7.100) must vanish separately. An isochronous beam transport 
line must therefore be a first order achromat (7.97) with the additional condition 
that f «,DdzZ = 0. 

For highly relativistic particles (68 ~ 1) and this condition is equivalent to being 
an isochronous beam line. In general, any beam line becomes isochronous if we 
require the time of flight rather than the path length to be equal for all particles. 
In this case we have to take into account the velocity of the particles as well as its 
variation with energy. The variation of the particle velocity with energy introduces 
in (7.100) an additional chromatic correction and the variation of the time of flight 
becomes 


Bc(T —To) = xole + xp Is + 8a —y”). (7.103) 


In straight beam lines, where no bending magnets are involved, (7.103) vanishes 
and higher than linear terms must be considered. From (7.99) it is obvious that 
the bending independent terms are quadratic in nature and therefore isochronicity 
cannot be achieved exactly since 


Lo 
pear = | (x? + y')dz > 0. (7.104) 
0 


This integral is positive for any particle oscillating with a finite betatron amplitude. 
A straight beam transport line is therefore an isochronous transport system only in 
first order. 


Problems 


7.1 (S). Sketch a quadrupole doublet and draw the sine- and cosine-like trajectories 
through the quadrupole doublet to the focal point for the horizontal and vertical 
plane. Verify that (7.20) is indeed true. (hint: first define from where to where you 
need to measure the combined focal length f ). 


7.2 (S). Consider a thin quadrupole doublet with a drift space of 1 m between them. 
The quadrupole strengths are to be adjusted to make a focal point in both planes 
at a point 5m from the second quadrupole. Determine the quadrupole strengths 
and calculate the combined doublet focal length in both planes. Sketch the doublet 
focusing and define in this sketch the calculated combined focal lengths. 
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7.3 (S). Consider a quadrupole doublet made of thin lenses. a) Calculate the focal 
length of a quadrupole doublet with |fi| = [f:| = 5m and a distance between the 
magnets of d = | m. Plot for this doublet the focal length as a function of particle 
momentum —5% < Ap/p < 5%. b) Use a parallel beam of radius ro and maximum 
divergence r’ and calculate the beam radius r at the focal point of this doublet. c) 
Plot the magnification r/ro as a function of momentum —5 % < Ap/p < 5%. What 
is the chromatic aberration (r — ro) /ro of the spot size? 


7.4 (S). Sector and rectangular magnets have opposite focusing properties. Deter- 
mine the geometry of a wedge magnet with equal focusing in both planes (ignore 
the gap effect). 


7.5 (S). In an arbitrary open beam transport line, we assume that at the point zp the 
particle beam is kicked in the horizontal or vertical plane by the deflection angle 
0. What is the betatron amplitude for the beam at any point z downstream from zo? 
To maximize the betatron amplitude at z how should the lattice functions, betatron 
function and/or phase, be chosen at zp and z? 


7.6 (S). Design a beam bump within three cells of a symmetric FODO lattice $QF 17 
QD) -QF>-QD2-QF3-QD3-4QF4 with a betatron phase advance We = 90° per cell. 
Further assume there are special coils in the quadrupoles to produce dipole fields 
which can be used to deflect the beam. a) Construct a symmetric beam bump which 
starts at QF;, ends at QF, and reaches an amplitude Ay = 2cm in the center of 
QD». How many trim coils need to be activated? b) Derive the relative kick angles 
required to construct the beam bump and calculate the beam displacement in each 
quadrupole. Is Ay the maximum amplitude of the beam bump? Why? Why not? 
(hint: do not use betatron and phase functions, but use thin lens approximation) 


7.7. a) Design a symmetric thin lens triplet with a focal point for both planes at the 
same point z = zr. b) Calculate and plot the betatron function for the quadrupole 
triplet and some drift space extending beyond the focal point. The value for the 
betatron function be 8 = 8m at the entrance to the triplet z = 0 where we also 
assume a@ = 0 c) Derive the phase advance in one plane between z = 0 and z = z 
both from the elements of the transformation matrix and by integrating the betatron 
function. Both method should give the same results. (note: do the integration roughly 
from the drawing of the betatron function with linear interpolation). 
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Chapter 8 
Particle Beams and Phase Space 


The solution of the linear equations of motion allows us to follow a single charged 
particle through an arbitrary array of magnetic elements. Often, however, it is 
necessary to consider a beam of many particles and it would be impractical to 
calculate the trajectory for every individual particle. We, therefore, look for some 
representation of the whole particle beam. 

To learn more about the collective motion of particles, we observe their dynamics 
in phase space. Each particle at any point along a beam transport line is represented 
by a point in six-dimensional phase space with coordinates (x, p;. y, py. 8, E) where 
Py © pox’ and py ® poy are the transverse momenta with cpp = Eo, s the 
coordinate along the individual trajectory, Eo the ideal particle energy and E the 
particle energy. Instead of the energy E often the momentum cp or the momentum 
deviation from the ideal momentum Ap = p — po or the relative momentum 
deviation Ap/po may be used. We use the momentum to study particle dynamics in 
the presence of magnetic field. In accelerating systems, like linear accelerators, the 
use of the particle’s kinetic energy is more convenient. Similarly, when the beam 
energy stays constant, we use instead of the transverse momenta rather the slope 
of the trajectories x’, y’ which are proportional to the transverse momenta and are 
generally very small so we may set sinx’ ~ 1’, etc. 

The coupling between the horizontal and vertical plane is being ignored in linear 
beam dynamics or treated as a perturbation as is the coupling between transverse 
and longitudinal motion. Only the effect of energy errors on the trajectory will be 
treated in this approximation. First, however, we set AE = 0 and represent the beam 
by its particle distribution in the horizontal (x,x’) or vertical (y, y’)-phase space 
separately. Because of the absence of coupling between degrees of freedom in this 
approximation we may split the six-dimensional phase space into three independent 
two-dimensional phase planes. 
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214 8 Particle Beams and Phase Space 
8.1 Beam Emittance 


Particles in a beam occupy a certain region in phase space which is called the 
beam emittance and we define three independent two-dimensional beam emittances. 
Their numerical values multiplied by z are equal to the area occupied by the beam 
in the respective phase plane. The beam emittance is a measure of the transverse 
or longitudinal temperature of the beam and depends on the source characteristics 
of a beam or on other effects like quantized emission of photons into synchrotron 
radiation and its related excitation and damping effects. 

A simple example of a beam emittance and its boundaries is shown in Fig. 8.1, 
where particles emerge from a disk with radius w and where the direction of the 
particle trajectories can be anywhere within +90° with respect to the surface of the 
source. The proper phase space representation of this beam at the surface of the 
source is shown in Fig. 8.1 (left). All particles are contained in a narrow strip within 
the boundaries Xmax = + w but with a large distribution of transverse momenta 
(px = po tanx’). 

Any real beam emerging from its source will be clipped by some aperture 
limitations of the vacuum chamber. We assume a simple iris as the aperture 
limitation located at a distance d from the source and an opening with a radius 
of R = w. The fact that we choose the iris aperture to be the same as the size of the 
source is made only to simplify the arguments. Obviously many particles emerging 
from the source will be absorbed at the iris. The part of the beam which passes the 
iris occupies a phase space area at the exit of the iris like the shaded area shown 
in Fig. 8.1 (right). Among all particles emerging from the source with an amplitude 
x = tw only those will pass the iris for which the slope of the trajectory is between 
x = Oand x = +2w/é. This beam now has a measurable beam emittance as 
determined by the source and iris aperture. 
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Fig. 8.1 Beam from a diffuse source in real space and in phase space (/eft). Reduction of phase 
space (shaded area) due to beam restriction by an iris aperture (right) 
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The concept of describing a particle beam in phase space will become very 
powerful in beam dynamics since we can prove that the density of particles in 
phase space does not change along a beam transport line, where the forces acting 
on particles can be derived from macroscopic electric and magnetic fields. In other 
words particles that are within a closed boundary in phase space at one point of 
the beam line stay within that boundary. This is Liouville’s theorem which we will 
prove for the fields used in beam dynamics. 


8.1.1 Liouville’s Theorem* 


In Chap. 7 we have learned to follow individual particles through an arbitrary beam 
transport line made up of drift spaces, dipole and quadrupole magnets. Since this is 
true for any particle with known initial parameters (xo, x4, yo, Yo) it is in principle 
possible to calculate trajectories along a beam line for a large number of particles 
forming a particle beam. This is impractical, and we are therefore looking for more 
simple mathematical methods to describe the beam as a whole. To this end, we make 
use of methods in statistical mechanics describing the evolution of a large number 
of particles forming a particle beam. 

Liouville’s theorem is of specific importance in this respect and we will use it 
extensively to describe the properties of a particle beam as a whole. This theorem 
states that under the influence of conservative forces the particle density in phase 
space stays constant. Since ((7.1), (7.2)) is equivalent to the equation of a free 
harmonic oscillator, we know that the motion of many particles in phase space 
follow Liouville’s theorem. A more direct proof of the validity of Liouville’s 
theorem in particle beam dynamics can be obtained by observing the time evolution 
of an element in the six-dimensional phase space. 

If W is the particle density in phase space, the number of particles within a six- 
dimensional, infinitesimal element is 


W(X, YZ, Px» Py» Pz) Ax dy dz dp, dpy dp;. (8.1) 
The phase space current created by the motion of these particles is 
J= (Wx, Vy, YZ, WV py, V Dy, Y pz), (8.2) 


where the time derivatives are to be taken with respect to a time t measured along 
the trajectory of the phase space element. This time is to be distinguished from the 
reference time f along the reference orbit in the same way as we distinguish between 
the coordinates s and z. We set therefore x = dx/dt, etc. The phase space current 
must satisfy the continuity equation 


. ow 
Vit+t—=0. (8.3) 
OT 
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From this, we get with (8.2) and the assumption that the particle location does not 
depend on its momentum and vice versa 


ow ; : 
ar eld Cas as 2 (8.4) 
=v, W+w (V,r)+p Vp +W (VP), 


—-(2# 39 a —-(2 39 90 i ivati 
where V, = (4. iy? a) and V, = (4. es ). The time derivative of the space 


vector r 


r cp 


ci Jap me’ (8.5) 
does not depend on the location r, and we have therefore 
Vi.r=0. (8.6) 
From the Lorentz force equation we get 
V,p = eV,|r x B] = eB(V, xr) —er(V, x B). (8.7) 


The magnetic field B does not depend on the particle momentum p and therefore 
the second term on the right hand side of (8.7) vanishes. For the first term, we find 


—_ Ba OE OY OE, 20 Dz = CPy Pz 
V,xr = 0 because (V, xr), = dg bs and 


c z = 

apy apy af p2+m?c2 (p2-+m2¢2)*/ " 
=, where we have used p? = p: + p, + pz. We get a similar result for the other 
components and have finally for (8.7) 


Vp = 0. (8.8) 


With these results, we find from (8.4) the total time derivative of the phase space 
density W to vanish 


ow ; . dw 
—+VWr+V,Wp = — =0, (8.9) 
OT dt 

proving the invariance of the phase space density W. 

Independent from general principles of classical mechanics we have shown 
the validity of Liouville’s theorem for the motion of charged particles under the 
influence of Lorentz forces. This is obviously also true for that part of the Lorentz 
force that derives from an electrical field since 


V,p=eV,E = 0 (8.10) 


because the electric field E does not depend on the particle momentum. 
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The same result can be derived in a different way from the property of the 
Wronskian in particle beam dynamics. For that, we assume that the unit vectors 
U,,U2...,U6 form a six-dimensional, orthogonal coordinate system. The determi- 
nant formed by the components of the six vectors x;,X2,...,X6 in this system is 
equal to the volume of the six-dimensional polygon defined by the vectors x;. The 
components of the vectors x; with respect to the base vectors u; are x; and the 
determinant is 


X11 X12 X13 X14 X15 X16 
X21 X22 Xz ct ee 
X31 X32 ce ct tt ee 
= = |x], X2, X3, X4, 5, Kol. (8.11) 
Ray See ese Sw yeas: wea 
XG] He ee eae se eae 


XG PL ae cet Nee 


We will derive the transformation characteristics of this determinant considering a 
transformation 


y, = Mx;, (8.12) 


where M = (ajj) and the determinant (8.11) then transforms like 


6 6 6 
IV1,¥2--- Yol= ) teas > ee bjs Xj 


i=l j=l j=l 
6 
= ) Qj, A2jr +--+ Abje | xXj,. Xjz, se Xe. (8.13) 
The determinant |x;,, Xj, ... Xj, | 1s equal to zero if two or more of the indices j; 


are equal and further the determinant changes sign if two indices are interchanged. 
These rules lead to 


6 
lV. ¥2--- Ye | — ) Ejujz.i6 Aj, A2j2 - ++ Moje | X1,X2,---,X6 |, (8.14) 
ji=l 
where 


+1 for even permutations of the indices j; 
€j,,j...6 = ) —1 for odd permutations of the indices j; (8.15) 
0 if any two indices are equal. 
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The sum a Ejrjo-i¢ Uj, Q2jr «++ G6jg 18 just the determinant of the transformation 
matrix M and finally we get 


IVisYo-++ Yoel = |M| |x1,x2,..-,X6l- (8.16) 


For a particle beam transport line, however, we know that |M| is the Wronskian 
with 


W=|M|=1. (8.17) 


If we now identify this six-dimensional space with the six-dimensional phase space, 
we see from (8.16) and (8.17) that the phase space under the class of transformation 
matrices considered in beam dynamics is constant. Conversely, if W # 1, we would 
get a change in phase space. 


8.1.2 Transformation in Phase Space 


Liouville’s theorem provides a powerful tool to describe a beam in phase space. 
Knowledge of the area occupied by particles in phase space at the beginning of a 
beam transport line will allow us to determine the location and distribution of the 
beam at any other place along the transport line without having to calculate the 
trajectory of every individual particle. 

In the previous paragraph, we found that the phase space density is a constant 
under the assumed forces. There are three space and three momentum coordinates. 
In beam dynamics, we often use trajectory slopes instead of transverse momenta. 
Similar relations exist for other coordinates. Using slopes instead of momenta 
preserves the phase space density only as long as po is a constant, which is true 
in most beam dynamics calculations. We distinguish therefore two definitions of 
the beam emittance, the normalized emittance €, based on space-momentum phase 
space and the geometric emittance € based on space-slope phase space. Both are 
related by 


én = Bye, 


where y is the relativistic factor and 6 = v/c the relative particle velocity. 
In beam dynamics it has become customary to surround all particles of a beam 
in phase space by an ellipse called the phase ellipse (Fig. 8.2) described by 


yx + 2axx’ + Bx? = «, (8.18) 


where a, f, y and € are ellipse parameters. This seemingly arbitrary boundary will 
soon gain physical significance. The area enclosed by the ellipse is called the 
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Fig. 8.2 Phase space ellipse 
-ave/y 


tan2@ = 2a/(y-B) 


aN 
> 
VER 
geometric beam emittance €! defined by 
dx dx’ = ze, (8.19) 


ellipse 


while the parameters w, 6 and y determine the shape and orientation of the ellipse. 
This characterization of the beam emittance by the area of an ellipse seems at 
first arbitrary although practical. Later in Sect. 8.2, we will see that all particles 
travel along their individual ellipses in phase space. If we now choose that or those 
particles on the largest phase ellipse within a particular beam, we know that all 
other particles within that ellipse will stay within that ellipse. We are thereby able 
to describe the collective behavior of a beam formed by many particles by the 
dynamics of a single particle. 

Since all particles enclosed by a phase ellipse stay within that ellipse, we only 
need to know how the ellipse parameters transform along the beam line to be able 
to describe the whole particle beam. Let the equation 


YoXo + 2ax0Xp + Box> =€E (8.20) 


be the equation of the phase ellipse at the starting point z = 0 of the beam line. 
Any particle trajectory transforms from the starting point z = 0 to any other point 
. x (z) C(z) S(z) a) . 
z # 0 by the transformation = . Solving for xo and 
me (a) = (Case) (2 pai 
Xp and inserting into (8.20), we get after sorting of coefficients and stopping to show 


'The literature is not always uniform in the representation of numerical values for the beam 
emittance. Often the beam emittance is quoted in units of z-mm-mrad and it is not always clear if 
the factor z is included in the numerical value or not. We define in this book the beam emittance 
as the beam phase space area divided by z in accordance with Hamiltonian dynamics. 


220 8 Particle Beams and Phase Space 


explicitly the (z)-dependence 


€ = (C7 By — 2S'Cay + S79) x" (8.21) 
+ 2(—CC'Bo + S'C ap + SC’ao — SS’ yo) xx’ 
+ (C’By —2S Cay + S*y9) x”. 
This equation can be brought into the form (8.18) by replacing the coefficients in 
(8.21) with 
y = C” Bo _ 25'C' ao + SV, 
a = —CC'Bo + (S'C + SC’)an — SS'yo, (8.22) 
B = iC Bo — 28 Cao + 5’ yp. 

The resulting ellipse equation still has the same area z € as we would expect, but 
due to different parameters y,a, 8, the new ellipse has a different orientation and 
shape. During a transformation along a beam transport line the phase ellipse will 
continuously change its form and orientation but not its area. In matrix formulation 


the ellipse parameters, which are also called Twiss parameters [11], transform from 
(8.22) like 


B() C2 -2c8_— 8? Bo 
a(z) | =| -cc’ cs’ +C’s —ss' | | a |. (8.23) 
y (2) Cc? —C9 7} x 


The orientation, eccentricity and area of an ellipse is defined by three parameters, 
while (8.20) includes four parameters a, 6, y and €. Since the area is defined by € 
we expect the other three parameters to be correlated. From geometric properties of 
an ellipse we find that correlation to be 


By—o? =1. (8.24) 


So far, we have used only the (x, x’)-phase space, but the results are valid also for the 
(y, )-phase space. Equation (8.23) provides the tool to calculate beam parameters 
anywhere along the beam line from the initial values Bo, ao, yo. 

The phase ellipse in a drift space, for example, becomes distorted in a clock 
wise direction without changing the slope of any particle as shown in Fig. 8.3. If the 
drift space is long enough a convergent beam transforms eventually into a divergent 
beam, while the angular envelope A = x/,,, = ,/€V stays constant. The point z,, at 
which the beam reaches its minimum size is determined by a(zy) = 0 and we get 
from (8.23) for the location of a beam waist in a drift section. 


ao 
Yo 


(8.25) 
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z=0 Z=Z Z=Zy 


diverging converging beam diverging 
beam beam waist beam 


Fig. 8.4 Transformation of a phase ellipse due to a focusing quadrupole. The phase ellipse is 
shown at different locations along a drift space downstream from the quadrupole 


This point of minimum beam size is up or downstream of z = zo depending on the 
sign of a being negative or positive, respectively. 
More formally, the transformation through a simple drift space of length ¢ is 


B (6 1-2¢ ?\ (Bo 
a(é)}=[0 1 -€]}] a). (8.26) 
y (£) 00 1 YO 


which describes, for example, the transition of a convergent phase ellipse to a 
divergent phase ellipse as shown in Fig. 8.4. Particles in the upper half of the 
phase ellipse move from left to right and particles in the lower half from right to 
left. During the transition from the convergent to divergent phase ellipse we find 
an upright ellipse which describes the beam at the location of a waist. The form 
and orientation of the phase ellipse tells us immediately the characteristics beam 
behavior. Convergent beams are characterized by a rotated phase ellipse extending 
from the left upper quadrant to the lower right quadrant while a divergent beam 
spreads from the left lower to the right upper quadrant. A symmetric phase ellipse 
signals the location of a waist or symmetry point. 


222 8 Particle Beams and Phase Space 


A divergent beam fills, after some distance, the whole vacuum chamber aperture 
and in order not to lose beam a focusing quadrupole must be inserted. During the 
process of focusing a diverging beam entering a focusing quadrupole reaches a 
maximum size and then starts to converge again. This transformation, generated by 
a focusing quadrupole is shown in Fig. 8.4, where we recognize slopes of particle 
trajectories to reverse signs thus forming a convergent beam. 

After this step, the beam may develop as shown for a drift space until the next 
focusing quadrupole is required. In reality this focusing scenario is complicated 
by the fact that we need also vertical focusing which requires the insertion of 
defocusing quadrupoles as well. 


8.1.3 Beam Matrix 


Particle beams are conveniently described in phase space by enclosing their 
distribution with ellipses. Transformation rules for such ellipses through a beam 
transport system have been derived for a two-dimensional phase space and we 
expand here the discussion of phase space transformations to more dimensions. The 
equation for an n-dimensional ellipse can be written in the form 


uo u=ti, (8.27) 


where the symmetric matrix o is still to be determined, u’ is the transpose of the 
coordinate vector u defined by 


be 


ae 


< 


=~ 


(8.28) 


i 
II 


aa 


The volume of this n-dimensional ellipse is 


2 
gt! 2 


= aaa Tap eet (8.29) 


Vn 


where J” is the gamma function. Applying (8.27) to the two dimensional phase 
space, we get for the ellipse equation 


Ox +20nxx tonx" =1 (8.30) 
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and comparison with (8.18) defines the beam matrix with well known beam 


parameters as 
o= & 7) =e B ao) (8.31) 
021 022 —a y 


Since only three of the four parameters in the beam matrix o are independent, we 
find that 02; = 012. This identification of the beam matrix can be expanded to six 
or arbitrary many dimensions including, for example, spin or coupling terms which 
we have so far neglected. The two-dimensional “volume” or phase space area is 


V2 = aVdeto = 14/041 022 — 07, = TE (8.32) 


consistent with the earlier definition of beam emittance, since By — a? = 1. 

The definition of the beam matrix elements are measures of the particle distri- 
bution in phase space. As such, we would expect different definitions for different 
distributions. Since most particle beams have a Gaussian or bell shaped distribution, 
however, we adopt a uniform definition of beam matrix elements. The betatron 
oscillation amplitude for a particular particle and its derivative is described by 


x; = aiv/B cos (Ww + Wi), (8.33) 
ae Soe 
igen ae aE 


We form now average values of all particles within a well defined fraction of a beam 
and get 


sin(w + Wj). (8.34) 


(3) = (a3 cos? (W + vid) B = 5 (al) B = eB. (8.35) 
2 2 

(i) = (oA) 54 + (08) 53 = bel) =e, (836) 

(x;x}) = —(a7)a5 = —ea, (8.37) 


where we have assumed a Gaussian particle distribution and a beam emittance 
defined by € = (a? sin? (Wy — Vi))- This definition describes that part of the beam 
which is within one standard deviation of the distribution in multidimensional phase 
space. The beam matrix elements are finally defined by 


On = (x) = ey, (8.38) 
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With this definition the beam emittance can be expressed by 


€ = 01107) — 07) = (x7) a) = (x;x4)? . (8.39) 

This definition is generally accepted also for any arbitrary particle distribution. 
Specifically, beams from linear accelerators or proton and ion beams can have 
arbitrary distributions. 

Similar to the two-dimensional case, we look for the evolution of the n 
dimensional phase ellipse along a beam transport line. With M(P;|P2) the n x n 
transformation matrix from point Po to P; we get uy = M(P;|Po) uo and the 
equation of the phase ellipse at point P, is 


(M7!uy)'o9 (Mla) = uloy lu, = 1. (8.40) 
With (Mr) 051M! = [Moo.M"'] |! the beam matrix transforms therefore like 
0, = MoyoM’. (8.41) 


This formalism will be useful for the experimental determination of beam emit- 
tances. 


Measurement of the Beam Emittance 


The ability to manipulate in a controlled and measurable way the orientation and 
form of the phase ellipse with quadrupoles gives us the tool to experimentally 
determine the emittance of a particle beam. Since the beam emittance is a measure 
of both the beam size and beam divergence, we cannot directly measure its value. 
While we are able to measure the beam size with the use of a fluorescent screen, for 
example, the beam divergence cannot be measured directly. If, however, the beam 
size is measured at different locations or under different focusing conditions such 
that different parts of the ellipse will be probed by the beam size monitor, the beam 
emittance can be determined. 
Utilizing the definition of the beam matrix in (8.31) we have 


011 022 — is =< (8.42) 


and the beam emittance can be measured, if we find a way to determine the beam 
matrix. To determine the beam matrix oo at point Po, we consider downstream 
from Pp a beam transport line with some quadrupoles and beam size monitors like 
fluorescent screens at three places P; to P3. From (8.23) and (8.31) we get for the 
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beam sizes 0;,;; at locations P; three relations of the form? 
Oi = C0011 + 28iCio0,12 + $200.22 (8.43) 


which we may express in matrix formulation by 


O11 Cr 208i 8; 00,11 00,11 
O21 | = | Cy 2C2S2 SF 0012 |} = Mao | oo.12 |. (8.44) 
03,11 C2 2C383 S2 00,22 00,22 


where C; and S$; are elements of the transformation matrix from point Pp to P; and 
6;;« are elements of the beam matrix at P;. Equation (8.44) can be solved for the 
beam matrix elements 0;,;, at Po 


00,11 O111 

1h —-l1, 4T 
00,12 = (M,M.,) M, 02,11 7 (8.45) 
00,22 03.11 


where the matrix MM, is known from the parameters of the beam transport line 
between Pp and P; and Mt is the transpose of it. The solution vector can be used in 
(8.42) to calculate finally the beam emittance. 

This procedure to measure the beam emittance is straight forward but requires 
three beam size monitors at appropriate locations such that the measurements can 
be conducted with the desired resolution. A much simpler procedure makes use of 
only one beam size monitor at P; and one quadrupole between Pp and P|. We vary 
the strength of the quadrupole and measure the beam size at P; as a function of the 
quadrupole strength. These beam size measurements as a function of quadrupole 
strength are equivalent to the measurements at different locations discussed above 
and we can express the results of n beam size measurements by the matrix equation 


O11 Cr 2015) 87 
02.11 C 2028> Se 90,11 90,11 
_ ‘ ‘ 00,12 = Mon 00.12 | - (8.46) 
: , . ‘ 00,22 00,22 
On J Cc 2C,8n s 


Note: the sign of the cross term is different from (8.23) because 012 = —a. 
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This method of emittance measurement is also known as quad scan. Like in (8.45) 
the solution is from simple matrix multiplications 


O11 
90,11 On11 
T —1l,y4T , 
00,12 = (M5.Mo.n) Mon (8.47) 
00,22 
On,11 


An experimental procedure has been derived which allows us to determine the 
beam emittance through measurements of beam sizes as a function of focusing. 
Practically, the evaluation of (8.47) is performed by measuring the beam size 
01,11(k) at P; as a function of the quadrupole strength k and comparing the results 
with the theoretical expectation 


oak) = C?(k)oo11 + 2C(K)S(k)o0,12 + S?(k)o0.22. (8.48) 


By fitting the parameters 00,11, 00,12 and 0,22 to match the beam size measure- 
ments, one can determine the beam emittance from (8.42). However, this procedure 
does not guarantee automatically a measurement with the desired precision. To 
accurately fit three parameters we must be able to vary the beam size considerably 
such that the nonlinear variation of the beam size with quadrupole strength becomes 
quantitatively significant. An analysis of measurement errors indicates that the beam 
size at Po should be large and preferable divergent. In this case variation of the 
quadrupole strength will dramatically change the beam size at P; from a large value 
when the quadrupole is off, to a narrow focal point and again to a large value by 
over focusing. 

A most simple arrangement consists of a single quadrupole and a screen at a 
distance d. Assuming that the length €, of the quadrupole is £4 < d, we can use 
thin lens approximation and the total transformation matrix is then 


Cai 1) = (01) art): a 
Equation (8.48) becomes 
oai(k) = (1- d lyk)” 9011 + 2 (1 —d gk) doo12 + d°o0. 
or after reordering 


o1n(®) = (@€00.1) 1 + (—2d £400.11 — 2d?Lq00,12) k (8.50) 


+ (0,11 + 2d 00,12 + d’0029) . 
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Fitting 0),1;(k) with a parabola (ak? + bk+ c) will determine the whole beam 
matrix 09 by 


a 
00.11 = Zz? 

Ss 

—b— 2dl400 11 

= —__——_, 8.51 

99,12 2PL, (8.51) 

¢ — 00,11 — 2doo,12 
00,22 = . 


d2 


The beam matrix not only defines the beam emittance but also the betatron 
functions at the beginning of the quadrupole in this measurement. We gain with 
this measurement a full set of initial beam parameters (ao, Bo, ¥6. €) and may now 
calculate beam parameters at any point along the transport line. 


8.2. Betatron Functions 


The trajectory of a particle through an arbitrary beam transport system can be 
determined by repeated multiplication of transformation matrices through each of 
the individual elements of the beam line. This method is convenient especially for 
computations on a computer but it does not reveal many properties of particle 
trajectories. For deeper insight, we attempt to solve the equation of motion 
analytically. The differential equation of motion is 


u" +k(z)u=0, (8.52) 


where u stands for x or y and A(z) is an arbitrary function of z resembling the 
particular distribution of focusing along a beam line. For a general solution of (8.52) 
we apply the method of variation of integration constants and use an ansatz with a 
z-dependent amplitude and phase 


u(z) = Je V B(z) cos[w(z) — Wol, (8.53) 


which is similar to the solution of a harmonic oscillator with a constant coefficient k. 
The quantities « and Wo are integration constants. From (8.53) we form first and 
second derivatives with the understanding that 6B = B(z), ¥ = W(z), etc. 


1 eB 
VaR 

B iid ae p? Bp’ 
ae cos(w — Yo) — Ven 


—e/B sin(w — Wo) Ww" — Ve VB cos(w — Wo) W”, 


cos(w — Wo) — Vev/B sin(h — Wo) W’. 


u"= Je sin(y — Yo) w’ (8.54) 
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and insert into (8.52). The sum of all coefficients of the sine and cosine terms 
respectively must vanish separately to make the ansatz (8.53) valid for all phases w. 
From this, we get the two conditions: 


3(BB" — 58°) — Bw? + Bk =0 (8.55) 
and 
Bly’ + By" =0. (8.56) 
Equation (8.56) can be integrated immediately since B’y + Bw” = (B w’)’' for 
Bw’ =const = 1, (8.57) 
where a specific normalization of the phase function has been chosen by selecting 


the integration constant to be equal to unity. From (8.57) we get for the phase 
function 


4 dz 
va =f 2@ + (8.58) 


Knowledge of the function 6(z) along the beam line obviously allows us to 
compute the phase function. Inserting (8.57) into (8.55) we get the differential 
equation for the function 6(z) 


5BB" — 48° + Bk =1, (8.59) 


which becomes with a = —} ‘and y = (1+ a’)/B 


Bp" + 2kB-—2y =0. (8.60) 

The justification for the definition of y becomes clear below, when we make the 
connection to ellipse geometry and (8.24).With a’ = —5f” this is equivalent to 

a’ =kB-y. (8.61) 


Before we solve (8.60) we try to determine the physical nature of the functions 
B(z), a(z), and y(z). To do that, we note first that any solution that satisfies (8.60) 
together with the phase function y(z) can be used to make (8.53) a real solution 
of the equation of motion (8.52). From that solution and the derivative (8.54) we 
eliminate the phase (w — yo) and obtain a constant of motion which is also called 
the Courant-Snyder invariant [4] 


yur + 2auu' + Bu? =e. (8.62) 
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This invariant expression is equal to the equation of an ellipse with the area zre 
which we have encountered in the previous section and the particular choice of 
the letters 6,a, y,¢ for the betatron functions and beam emittance becomes now 
obvious. The physical interpretation of this invariant is that of a single particle 
traveling in phase space along the contour of an ellipse with the parameters 6, a, 
and y. Since these parameters are functions of z however, the form of the ellipse 
is changing constantly but, due to Liouville’s theorem, any particle starting on that 
ellipse will stay on it. The choice of an ellipse to describe the evolution of a beam in 
phase space is thereby more than a mathematical convenience. We may now select 
a single particle to define a phase ellipse and know that all particles with lesser 
betatron oscillation amplitudes will stay within that ellipse. The description of an 
ensemble of particles forming a beam have thereby been reduced to that of a single 
particle. 

The ellipse parameter functions or Twiss parameters 6,a,y and the phase 
function w are called the betatron functions or lattice functions or Twiss functions 
and the oscillatory motion of a particle along the beam line (8.53) is called the 
betatron oscillation. This oscillation is quasi periodic with varying amplitude and 
frequency. 

To demonstrate the close relation to the solution of a harmonic oscillator, we use 
the betatron and phase function to perform a coordinate transformation 


(u, z) —- (w, v) (8.63) 
by setting 
u(z) 2 dz 
ee 7c : I B®) os 


where u(z) stands for x(z) and y(z) respectively. The new coordinates (w, W) are 
called normalized coordinates and equation of motion (8.52) transforms to 


=p), (8.65) 


which indeed is the equation of a harmonic oscillator with angular frequency one. 
This identity will be very important for the treatment of perturbing driving terms 
that appear on the right hand side of (8.65) which will be discussed in more detail 
in Sect. 8.3.1. 

So far, we have tacitly assumed that the betatron function 6(z) never vanishes or 
changes sign. This can be shown to be true by setting g(z) = ./ A(z) and inserting 
into (8.59). With B’ = 2qq/ and B” = 2 (q? +qq") we get the differential equation 


1 
q’ +kq- a 0. (8.66) 
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The term 1/g? prevents a change of sign of g(z). Letting g > 0 vary toward 
zero q" ~ 1/q> — o. This curvature, being positive, will become arbitrarily large 
and eventually turns the function g(z) around before it reaches zero. Similarly, the 
function q(z) stays negative along the whole beam line if it is negative at one point. 
Since the sign of the betatron function is not determined and does not change, it has 
became customary to use only the positive solution. 

The beam emittance parameter € appears as an amplitude factor in the equation 
for the trajectory of an individual particle. This amplitude factor is equal to the beam 
emittance only for particles traveling on an ellipse that just encloses all particles in 
the beam. In other words, a particle traveling along a phase ellipse with amplitude 
./é€ defines the emittance of that part of the total beam which is enclosed by this 
ellipse or for all those particles whose trajectories satisfy 


Bu? +2auu' + yu < &. (8.67) 


Since it only leads to confusion to use the letter € as an amplitude factor we will 
from now on use it only when we want to define the whole beam and set /e = a 
for all cases of individual particle trajectories. 


8.2.1 Beam Envelope 


To describe the beam and beam sizes as a whole, a beam envelope equation can 
be defined. All particles on the beam emittance defining ellipse follow trajectories 
described by 


xi(z) = Ve V B(z) cos[y(z) + 4], (8.68) 


where 4; is an arbitrary phase constant for the particle i. By selecting at every point 
along the beam line that particle i for which cos[y(z) + 6;] = £1, we can construct 
an envelope of the beam containing all particles 


E(2) = + Ve/ BQ. (8.69) 


Here the two signs indicate only that there is an envelope an either side of the 
beam center. We note that the beam envelope is determined by the beam emittance 
€ and the betatron function 6(z). The beam emittance is a constant of motion and 
resembles the transverse “temperature” of the beam. The betatron function reflects 
exterior forces from focusing magnets and is highly dependent on the particular 
arrangement of quadrupole magnets. It is this dependence of the beam envelope 
on the focusing structure that lets us design beam transport systems with specific 
properties like small or large beam sizes at particular points. 
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8.3. Beam Dynamics in Terms of Betatron Functions 


Properties of betatron functions can now be used to calculate the parameters of 
individual particle trajectories anywhere along a beam line. Any particle trajectory 
can be described by 


u(z) =avVB cosy +b VB sin y (8.70) 


and the amplitude factors a and b can be determined by setting at z = 0 


wv =0, B = Bo, u(0) = uo, (8.71) 


— ae 4 
a =a, u'(0) = up. 


With these boundary conditions we get 


a= gi (8.72) 
b= VE, lo + V Bo Mo, , 


and after insertion into (8.70) the particle trajectory and its derivative is 


u(z) = | Ecos + a sin) uy + VB Bo sin up, 
0 


1 


ul(z) = Tepe —a) cosy — (1 + aap) sin y] uo (8.73) 
0 
Bo : ; 
+ Fs w—asinw) up, 
or in matrix formulation 
( C2) S(2) ) _[ Vie Cos + ao sin) /BBo sin w 
C'(z) S"(2) VEBs cos y — ae sin yw (2 (cosy —asinw) 


(8.74) 


Knowledge of the betatron functions along a beam line allows us to calculate 
individual particle trajectories. The betatron functions can be obtained by either 
solving numerically the differential equation (8.59) or by using the matrix formalism 
(8.23) to transform phase ellipse parameters. Since the ellipse parameters in (8.23) 
and the betatron functions are equivalent, we have found a straightforward way 
to calculate their values anywhere once we have initial values at the start of the 
beam line. This method is particularly convenient when using computers to perform 
matrix multiplication. 
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Transformation of the betatron functions becomes very simple in a drift space 
where the transformation matrix is 


CZ) S() _ Lz 
fe =) 7 ({ :) : (8.75) 


The betatron functions at the point z are from (8.26) 


B(z) = Bo — 2a0z+ yor’, 
a(Z) = Qo — YoZ, (8.76) 
y(z) = Yo, 


with initial values Bo, ao, Yo taken at the beginning of the drift space. 

We note that y(z) = const. in a drift space. This result can be derived also 
from the differential equation (8.60) which for k = 0 becomes 6” = 2y and the 
derivative with respect to zis B’” = 2y’. On the other hand, we calculate from the 
first equation (8.76) the third derivative of the betatron function with respect to z to 
be B’”” = 0. Obviously both results are correct only if the y-function is a constant in 
a drift space where k = 0. 

The location of a beam waist is defined by a = 0 and occurs from (8.76) at 
Zw = Qo/Yo. The betatron function increases quadratically with the distance from 
the beam waist (see Fig. 8.5) and can be expressed by 


= 2 
B(z—2w) = Bw + oo (8.77) 


where fy is the value of the betatron function at the waist and z — zy is the distance 
from the waist. From (8.77) we note that the magnitude of the betatron function 
away from the waist reaches large values for both large and small betatron functions 
at the waist. We may therefore look for conditions to obtain the minimum value for 
the betatron function anywhere in a drift space of length 2L. For this we take the 
derivative of 6 with respect to By and get from (d6/d6, = 0) 


Bw,opt =L. (8.78) 


Fig. 8.5 Betatron function in 
a drift space 
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At either end of the drift space we have then 


B(L) =2 Bw,opt- (8.79) 


This is the optimum solution for the betatron function on either side of a drift 
space with length 2Z resulting in a minimum aperture requirement along a drift 
space of length L. The phase advance in a drift space is from (8.77) 


w(L) _ [ dz/ Bw = Seiad L _ IU ae L =a (8 80) 
Jo 14+ @/Bw)? By 2 Bw , , 


The phase advance through a drift space of length 2Z is therefore never larger 
than z and actually never quite reaches that value 


A Waitt <7. (8.81) 


8.3.1 Beam Dynamics in Normalized Coordinates 


The form and nomenclature of the differential equation (8.52) resembles very much 
that of a harmonic oscillator and indeed this is not accidental since in both cases the 
restoring force increases linearly with the oscillation amplitude. In particle beam 
dynamics we find an oscillatory solution with varying amplitude and frequency 
and by a proper coordinate transformation we are able to make the motion of a 
particle look mathematically exactly like that of a harmonic oscillator. This kind of 
formulation of beam dynamics will be very useful in the evaluation of perturbations 
on particle trajectories since all mathematical tools that have been developed for 
harmonic oscillators will be available for particle beam dynamics. 

We introduce Floquet’s coordinates, or normalized coordinates through the 
transformation 


(8.82) 


l* 


and 


z dz 
9 (2) = ae (8.83) 
o VB) 
Note, that we used in here a different normalization than that selected in (8.57) to 
adapt more appropriately to the issues to be discussed here. With this transformation 
we get for the first derivative 


! vB B’ 1 a 


= vB ew: = aa VB" (8.84) 


Uu 
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and for the second derivative 


oe Z 2 
n_ WwW a a 


~ y2p3/2 "TB ~ B3/2 


(8.85) 


where dots indicate derivatives with respect to the phase w = dw/dg, etc. We insert 
these expressions into (8.52) and get the general equation of motion expressed in 
normalized coordinates 


ui +ku= w+(5BB" —a* + kB’)v-w | = p(x. y,2), (8.86) 
———— 


=1 


1 
2 B3/2 


where the right-hand side represents a general perturbation term p(x, y, z) which 
was neglected so far. The square bracket is equal to unity according to (8.59) and 
the equation of motion takes the simple form of a harmonic oscillator with some 
perturbation 


Ww + v?w— v?B?/2n(x, y,z) = 0. (8.87) 


This nonlinear equation of motion can be derived from the Hamiltonian 
k=l Dk 
H = pW + pv?w? — vB? SB? rae (8.88) 
k=l 
where coupling has been ignored and 


n = n ket 7 
P@.2) = ope! = eB Ew, (8.89) 
k=1 k=1 


where px is a perturbation of order k. Later, we will perform another canonical 
transformation to action-angle variables, which brings the Hamiltonian into a 
convenient form to exhibit effects of perturbations. 

Since the parameter v is constant, we have in the case of vanishing perturbations 
Pn = 0 the exact equation of a harmonic oscillator and particles perform in this 
representation periodic sine-like oscillations with the frequency v 


w = wo cos(w + 4). (8.90) 


The transformation matrix in these variables is given by 


_ (CW) SH) _ ( cos) sin) 
MEeld= (Soy sey) = (singe costes) si 


as can easily be derived from (8.90). 
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Fig. 8.6 Ideal phase ellipse 
in normalized coordinates dw/dy 


The use of normalized coordinates not only allows us to treat particle beam 
dynamics equivalent to a harmonic oscillator but is also convenient in the discus- 
sions of perturbations or aberrations. In phase space each particle performs closed 
trajectories in the form of an ellipse which we called the phase ellipse. In Cartesian 
coordinates this ellipse, however, continuously changes its shape and orientation and 
correlations between two locations are not always obvious. If we use normalized 
coordinates, the unperturbed phase ellipse becomes an invariant circle as shown in 
Fig. 8.6. 


From (8.82) we get with u(z) = a,/ B(z) cos W(z) where w(z) = v¢(z) 


=acosy, (8.92) 


wy) = 
"VB 


d 
OW = Vp + u=-asiny, (8.93) 


dy JB 


and after elimination of the phase the Courant-Snyder invariant becomes 
dw? 
ipa. (=) age (8.94) 


where a is the betatron oscillation amplitude. 

The equation of motion (8.87) is now ready to be transformed into action-angle 
variables. The constancy of the action J is now synonymous with the Courant- 
Snyder invariant (5.59) or the constancy of the beam emittance. 


J = tv (yw? +2auu' + B uw’) = tve. (8.95) 


In (W,J) phase-space, the particle moves along a circle with radius J at a 
revolution frequency v. The motion is uniform, periodic and stable. Including the 
independent variable g to form a three-dimensional phase-space, we find a particle 
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trajectory 
in phase space 


reference path 


Fig. 8.7 Unperturbed particle trajectories in (w, J, ¢) phase-space 


to spiral along the surface of a torus as shown in Fig. 8.7. The ensemble of all 
particles oscillating with the same amplitude J follow spirals occupying the full 
surface of the torus. 

This result is not particularly interesting in itself since it only corroborates what 
we have found earlier for harmonic oscillators with simpler mathematical tools. The 
circle in (wy, J)-phase space, however, provides us with a reference against which to 
compare perturbed motions and derive stability criteria. Indeed, we will later use 
canonical transformations to eliminate well-known linear motions, like the circular 
motion of an unperturbed harmonic oscillator in (y,J)-space to exhibit more 
clearly the effects of perturbation only. Including perturbations into the Hamiltonian 
(5.57) allows the determination of perturbed tunes and study resonance phenomena. 
Having defined canonical variables for the system, we also will be able to study 
the evolution of particle beams by applying Vlasov’s equation in Sect. 12.1. The 
Fokker-Planck equation finally will allow us to determine beam parameters even in 
the presence of statistical events. 

We have chosen the betatron phase w as the independent variable and the 
particles cover one full turn along the phase “ellipse” for each betatron oscillation. 
This is a convenient way of representation in beam transport systems, yet, for 
circular accelerators we find it more useful to define g = y/v as the independent 
variable in which case the particle rotation frequency in phase space is the same as 
that in the ring. This is particularly convenient when we discuss field and alignment 
perturbations which occur periodically in a ring and allow the application of Fourier 
techniques. 


8.4 Dispersive Systems 


Beam guidance and focusing is performed by applying Lorentz forces and the 
effects of these fields on particle trajectories depend on the momentum of the 
particles. So far, we have derived beam dynamics for particles with ideal momenta 
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for which the beam transport system is designed. To properly describe the dynamics 
of a real particle beam we must include chromatic effects caused by an error in the 
beam energy or by a spread of energies within the particle beam. In Sect. 5.5.4; the 
perturbation due to a momentum error has been derived and expressed in terms of a 
dispersion. Continuing the formulation of beam dynamics in terms of transformation 
matrices we derive in this section transformation matrices for particles with a 
momentum error. 


8.4.1 Analytical Solution 


The dispersion function has been derived as a special solution to a chromatic 
perturbation term in (5.81) where 


D2 = / “K@ [S@ C@ —C@) S@] az (8.96) 


describes the dispersion function in a beam transport line. There is no contribution 
to the dispersion function unless there is at least one bending magnet in the beam 
line. Knowledge of the location and strength of bending magnets, together with 
the principal solutions of the equations of motion, we may calculate the dispersion 
anywhere along the beam transport line by integration of (8.96). 

Similar to the matrix formalism for betatron oscillations we would also like to 
apply the same formalism for the dispersion function. For this we note that the 
particle deviation u from the reference path is composed of the betatron motion and 
a displacement due to an energy error u = ug + us. The transformation matrix is 
therefore a composite of both contributions and can be expressed by 


u(z) up (Zo) us (Zo) 
ul(z) |} =M] we (zo) | +M | us (zo) |. (8.97) 
5 5 8 


where M is the 3 x 3 transformation matrix, 6 the relative momentum error and 
us(z) = D(z)é and us(z) = D’(z)é the displacement and slope, respectively, 
of the reference path for particles with a momentum error 6. Equation (8.97) can 
also be applied to the dispersion function alone by setting the betatron oscillation 
amplitudes to zero and the momentum error 6 = 1 for 


D(z) D(zo) 
D(z) |} =M|{ D(a) |. (8.98) 
1 1 
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By determining the transformation matrices for individual bending magnets, we 
are in a position to calculate in matrix formulation the dispersion function anywhere 
along a beam transport line. 

In the deflecting plane of a pure sector magnet the principal solutions are with 
K=«? =1/p’ 


C(z) S(z) \ _ ( cos(kz)_ psin (Kz) 
ee ae = (: sin (kz) cos (kz) ) ; (8.99) 


With p = const we get from (8.96) and (8.99) for the dispersion function within the 
magnet 


D(z) = sin (kz) , cos (kz) dz — cos (kz) fF sin (Kz) dz 
0 0 


= po [1 — cos (kz)] (8.100) 
D'(z) = sin (kz). 


Particles with momentum error 6 follow an equilibrium path given by D(z) 6 
which can be determined experimentally by observing the beam path for two 
different values of the beam momentum 6; and 6. The difference of the two paths 
divided by the momentum difference is the dispersion function D(z) = Au/(52—41). 
In practical applications this is done either by changing the beam energy or by 
changing the strength of the bending magnets. In circular electron accelerators, 
however, only the first method will work since the electrons always adjust the 
energy through damping to the energy determined by the magnetic fields. In circular 
electron accelerators, we determine the dispersion function by changing the rf- 
frequency which enforces a change in the particle energy as we will discuss later 
in Chap. 9. 


8.4.2 3 x 3-Transformation Matrices 


From (8.99) and (8.100) we may form now 3 x 3-transformation matrices. In the 
deflecting plane of a pure sector magnet of arc length € such a transformation 
matrix is 


cos@ psin@ p(1—cos@) 
Map (€|0) = —; sin 6 cos 0 sin 0 (8.101) 
0 0 1 
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where 6 = ¢/p is the deflection angle of the magnet. In the non deflecting plane, 
the magnet behaves like a drift space with , = 0,k = 0 and arc length ¢ 


C(z) S(z) 0 
Mego (€|0) = | C’(z) S'(z) 0 (8.102) 
0 oOo 1 


For a synchrotron magnet of the sector type we get from (7.38) in analogy to 
(8.100), replacing « by VK and with © = Jk + x? and x = 1/p for the case of a 
focusing synchrotron magnet 


a sinO 1—cos@ 

cos O VK pK 
Moye (€|0) = —/K sin © cos @ we (8.103) 

0 0 1 
and for a defocusing synchrotron magnet 
a sinhO coshO—1 

cosh @ WIKI Po V *T 

Moya (€|0) = | V|K| sinh © coshO viel (8.104) 
0 0 1 


where © = ,/ |k + Ke. 


In case of a rectangular magnet without field gradient, we multiply the matrix for 
a sector magnet by the transformation matrices for endfield-focusing. Since these 
end effects act like quadrupoles we have no new contribution to the dispersion and 
the transformation matrices for each endfield are 


1 00 
M,. = | «tan(6/2) 10]. (8.105) 
0 01 


With these endfield matrices the chromatic transformation matrix for a rectangular 
bending magnet in the deflecting plane is obtained from (8.103) with M,, = 
Me Mey,p Me for k = 0 


1 psin@ p (1 —cos8@) 
M,,(€[0) =| 0 1  2tan(@/2) |. (8.106) 
0 O 1 


Similarly, we can derive the transformation matrices for rectangular synchrotron 
magnets. 
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Only bending magnets create a dispersion. Therefore the transformation matrices 
of other magnets or drift spaces are extended to 3 x 3 matrices by adding a third 
column and row with all elements equal to zero and M33 = 1. 


8.4.3 Linear Achromat 


Frequently it is necessary in beam transport systems to deflect a particle beam. If this 
is done in an arbitrary way an undesirable finite dispersion function will remain at 
the end of the deflecting section. Special magnet arrangements exist which allow 
to bend a beam without generating a residual dispersion. Such magnet systems 
composed of only bending magnets and quadrupoles are called linear achromats. 

Consider, for example, an off momentum particle travelling along the ideal 
path of a straight beam line. At some location, we insert a bending magnet and 
the off-momentum particle will be deflected by a different angle with respect to 
particles with correct momenta. The difference in the deflection angle appears as 
a displacement in phase space from the center to a finite value AW = 6D(z)/ VB : 
From here on, the off momentum reference path follows the dispersion function 
D(z) 6 and the particle performs betatron oscillations in the form of circles until 
another bending magnet further modifies or compensates this motion (Fig. 8.8). 

In case a second bending magnet is placed half a betatron oscillation downstream 
from the first causing the same deflection angle the effect of the first magnet can be 
compensated completely and the particle continues to move along the ideal path 
again. A section of a beam transport line with this property is called an achromat. 

Figure 8.9 displays an achromatic section proposed by Panofsky [10] which may 
be used as a building block for curved transport lines or circular accelerators. This 
section is composed of a symmetric arrangement of two bending magnets with a 


motion in phase space 


D(z)8/p'? 


motion in real space : Bossa he 
P dispersion function: D(z)6 


Z 


bending magnet 


Fig. 8.8 Trajectory of an off momentum particle through a chromatic beam transport section 
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Fig. 8.9 Double bend achromat [3, 10] 


quadrupole in the center and is also know as a double bend achromat or a Chasman- 
Green lattice [3]. 

General conditions for linear achromats have been discussed in Sect. 7.4 and we 
found that the integrals 


I= [ c@seee = 0, (8.107) 
0 
and 
I, = [ ace dz = 0, (8.108) 
0 


must vanish for a lattice section to become achromatic. For a double bend achromat 
this can be accomplished by a single parameter or quadrupole if adjusted such that 
the betatron phase advance between the vertex points of the bending magnet is 180°. 
A variation of this lattice, the triple bend achromat [5, 8], is shown in Fig. 8.10, 
where a third bending magnet is inserted for practical reasons to provide more 
locations to install sextupoles for chromatic corrections. Magnet arrangements as 
shown in Figs. 8.9 and 8.10 are dispersion free deflection units or linear achromats. 
This achromat is focusing only in the deflecting plane but defocusing in the 
nondeflecting plane which must be compensated by external quadrupole focusing 
or, since there are no special focusing requirements for the nondeflecting plane, by 
either including a field gradient in the pole profile of the bending magnets [6] or 
additional quadrupoles between the bending magnets. In a beam transport line this 
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Fig. 8.10 Triple bend achromat [5] 


Fig. 8.11 Achromatic beam translation 


achromat can be used for diagnostic purposes to measure the energy and energy 
spread of a particle beam as will be discussed in more detail in Sect. 8.4.5 

A further variation of the lattice in Fig. 8.9 has been proposed by Steffen [10] 
to generate an achromatic beam translation as shown in Fig. 8.11. In this case, 
the total phase advance must be 360° because the integral J, would not vanish 
anymore for reasons of symmetry. We use therefore stronger focusing to make {, 
vanish because both the bending angle and the cosine like function change sign. 
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Achromatic properties are obtained again for parameters meeting the condition [10] 


1 dVkcosy +2 sing 


tan(0/2) +A = ; 
pene Vk dVk sing — 2 cosy 


(8.109) 


where y = \/ké and k, ¢ the quadrupole strength and length, respectively. The need 
for beam translation occurs frequently during the design of beam transport lines. 
Solutions exist to perform such an achromatic translation but the required focusing 
is much more elaborate and may cause significantly stronger aberrations compared 
to a simple one directional beam deflection of the double bend achromat type. 

Utilizing symmetric arrangements of magnets, deflecting achromats can be 
composed from bending magnets only [10]. One version has become particularly 
important for synchrotron radiation sources, where wiggler magnets are used to 
produce high intensity radiation. Such triple bend achromat are composed of a row 
of alternately deflecting bending magnets which do not introduce a net deflection 
on the beam. Each unit or period of such a wiggler magnet (Fig. 8.12) is a linear 
achromat. 

The transformation of the dispersion through half a wiggler unit is the super- 
position of the dispersion function from the first magnet at the end of the second 
magnet plus the contribution of the dispersion from the second magnet. In matrix 
formulation and for hard edge rectangular magnets the dispersion at the end of half 
a wiggler period is 


Dy —po (1 — cos @) 1 2, po (1 — cos 8) 
= : 8.110 
ee Eas 7 01 2 tan (6/2) ( ) 
where p > 0,60 = yp and fy the length of one half wiggler pole (see Fig. 8.12). 
Evaluation of (8.110) gives the simple result 


Dy = 2€y tan(@/2), 
ee (8.111) 


The dispersion reaches a maximum in the middle of the wiggler period and 
vanishes again for reasons of symmetry at the end of the period. For sector magnets 
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we would have obtained the same results. Each full wiggler period is therefore from 
a beam optics point of view a linear achromat. Such an arrangement can also be 
used as a spectrometer by placing a monitor in the center, where the dispersion is 
large. For good momentum resolution, however, beam focusing must be provided 
in the deflecting plane upstream of the bending magnets to produce a small focus at 
the beam monitors as will be discussed in the next section. 

The examples of basic lattice designs discussed in this section are particularly 
suited for analytical treatment. In practice, modifications of these basic lattices are 
required to meet specific boundary conditions making, however, analytical treatment 
much more complicated. With the availability of computers and numerical lattice 
design codes, it is prudent to start with basic lattice building blocks and then use a 
fitting program for modifications to meet particular design goals. 


8.4.4 Spectrometer 


Although the dispersion has been treated as a perturbation it is a highly desired 
feature of a beam line to determine the energy or energy distribution of a particle 
beam. Such a beam line is called a spectrometer for which many different designs 
exist. A specially simple and effective spectrometer can be made with a single 180° 
sector magnet [2, 9]. For such a spectrometer, the transformation matrix is from 
(8.101) 


25 3p 
M (180 ) =| 0 <1 0 1, (8.112) 
i 4 


In this spectrometer all particles emerging from a small target (Fig. 8.13) are 


focused to a point again at the exit of the magnet. The focal points for different 
energies, however, are separated spatially due to dispersion. Mathematically, this is 


Fig. 8.13 Hundred and 
eighty degree spectrometer 


5>0 5=0 5 <0 target 


2: 
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evident since the particle trajectories at the end of the magnet are given by 
x = —X9 + 296, (8.113) 


where xo is the starting point of a particle within the target, and showing different 
positions x for different energies 5. The energy dispersion of the secondary particles 
is Ax = 206. 

The image point is independent of x, and only proportional to 5 with a 
large proportionality factor which allows a large energy resolution. While this 
spectrometer seems to have almost ideal features it is also an example of the 
limitations of perturbation methods. For larger values of 5 of the order of several 
percent higher order terms cannot be neglected anymore. Inclusion of such terms, 
for example, will first tilt and then bend the focal plane at the end of the magnet. 

More sophisticated spectrometers including focusing to accept large emittance 
beams have been devised with special efforts to reduce the effects of aberrations. It 
is not the intend of this text to discuss in detail such designs. More comprehensive 
overviews for spectrometers with further references can be found for example in 
[1, 10]. In the treatment of this spectrometer we have ignored the nondeflecting 
plane. Since there is no focusing, particles are widely spread out in this plane at 
the end of the magnet. Practical versions of this spectrometer, therefore, include a 
focusing term in the nondeflecting plane in such a way that the resulting focusing is 
the same in both planes [7]. 


8.4.5 Measurement of Beam Energy Spectrum 


Frequently it is desirable to determine experimentally the particle energy and energy 
spread. Basically only one bending magnet is needed to perform this experiment. 
The finite beam size of the monochromatic part of the beam will greatly influence 
the resolution of the energy measurement. Optimum resolution is achieved if some 
focusing is included and the measurement is performed at a location, where the 
beam size is small while the dispersion is large. In Fig. 8.14 particle beams at two 
different energies are shown in phase space, where both beam centers are separated 
by the dispersion and its slope. 

In reality no such separation exists since we have a spread of energies rather 
than two distinct energies. This energy spread is mixed with the spread in phase 
space of the beam emittance and beams of different energies can only be separated 
completely if the relative energy difference is at least 


oO ja, (8.114) 
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2 sin0/2 


bending magnet S=Sy 
quadrupole 
Fig. 8.15 Measurement of the momentum spectrum 
where Ey, = 1/€f is the beam envelope. To maximize the energy resolution the 


beam size Ey should be small and the dispersion D(z) large. From Fig. 8.14 we note 
therefore, that for a given beam emittance and dispersion the energy resolution can 
be improved significantly if the measurement is performed at or close to a beam 
waist, where 8 reaches a minimum. 

To derive mathematical expressions for the energy resolution and conditions for 
the maximum energy resolution 1 /dmin we assume a beam line as shown in Fig. 8.15 
with the origin of the coordinate system z = 0 in the center of the bending magnet. 
The salient features of this beam line is the quadrupole followed by a bending 
magnet. With this sequence of magnets we are able to focus the particle beam in 
the deflection plane while leaving the dispersion unaffected. In case of a reversed 
magnet sequence the dispersion function would be focused as well compromising 
the energy resolution. 
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Transforming the dispersion (8.100) back from the end of the sector bending 
magnet to the middle of the magnet we get the simple result 


Do\ _ ( cos$ —posin§ \ ( po (1—cos6))\ _ 0 
(3) 7 (i cos 4 sin 0 =o une |? (8.115) 
ry) 5 


The dispersion appears to originate in the middle of the magnet with a slope 
Di) = 2sin6/2. At a distance z from the middle of the bending magnet the 
betatron function is given by B(z) = Bo — 2a0z + yoz where (Bo, a, Yo) are 
the Twiss functions in the middle of the bending magnet, and the dispersion is 
D(z) = 2 sin(@/2)z. Inserting these expressions into (8.114) we can find the 
location z,, for maximum momentum resolution by differentiating d,in with respect 
to z. Solving démin/dz = 0 for z we get 


i= Po (8.116) 


ao 


and the maximum momentum resolution is 


= Borin?) (8.117) 


The best momentum resolution for a beam with emittance € is achieved if both the 
bending angle @ and the betatron function Bo in the middle of the bending magnet 
are large. From condition (8.116), we also find a > 0 which means that the beam 
must be converging to make a small spot size at the observation point downstream of 
the bending magnet. With (8.76) we find that z,, = Bo/ao = —f,,/o,, and from the 
beam envelope E; = €f,, at z = zy, we get the derivative 2E,E), = «Bi, = —2ea,. 
With this and D/D’ = z, the optimum place to measure the energy spread of a 
particle beam is at 


_ DZ) = Ev (Zn) 
Diy) = EG) 


<M (8.118) 


It is interesting to note that the optimum location z,, is not at the beam waist, where 
A(z) reaches a minimum, but rather somewhat beyond the beam waist, where D/ af B 
is maximum. 

At this point we may ask if it is possible through some clever beam focusing 
scheme to improve this resolution. Analogous to the previous derivation we look 
for the maximum resolution 651, = D(z)/[2 /€B(2)]. The dispersion is expressed 
in terms of the principal solution D(z) = S(z) D'(0) and D’(z) = S’(z) D'(0) since 
D(0) = 0. The betatron function is given by B(z) = C?(z) Bo — 2 C(z)S(z) a + 
S?(z) yo and the condition for maximum resolution turns out to be a/B = —D’/D. 
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With this, we get the resolution 


a = _ es 


Sain = 8.119 
and finally with S(z) = ./ BoB(z) sin w(z) 
Sinin = vhosin) sin w(z) < —_— (8.120) 


which is at best equal to result (8.117) for y(z) = 90°. The momentum resolution is 
never larger than in the simple setup of Fig. 8.15 no matter how elaborate a focusing 
lattice is employed. 

If more than one bending magnet is used the resolution may be increased if the 
betatron phases between the magnets y(z;) and the place of the measurement w(z,, ) 
are chosen correctly. The resolution then is 


Sain = ae V Boi sin(6;/2) sin[y (a) — vz]. (8.121) 


where the sum is taken over all magnets 7. Such an energy resolving system is 
often used in beam transport lines to filter out a small energy band of a particle 
beam with a larger energy spread. In this case a small slit is placed at the place for 
optimum momentum resolution (z = z,,). Of course, for highly relativistic electrons 
the momentum spectrum is virtually equal to the energy spectrum. 

This discussion is restricted to linear beam optics which does not address prob- 
lems caused by nonlinear effects and geometric as well as chromatic aberrations. 


8.4.6 Path Length and Momentum Compaction 


The existence of different reference paths implies that the path length between 
two points of a beam transport line may be different as well for different particle 
momenta. We will investigate this since the path length is of great importance as 
will be discussed in detail in Chap. 9. In preparation for this discussion, we derive 
here the functional dependencies of the path length on momentum and focusing 
lattice. 

The path length along a straight section of the beam line depends on the angle 
of the particle trajectory with the reference path. In this chapter we are interested 
only in linear beam dynamics and may neglect such second order corrections to the 
path length. The only linear contribution to the path length comes from the curved 
sections of the beam transport line. The total path length is therefore given by 


= / aed. (8.122) 
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We evaluate (8.122) along the reference path, where x = D(z) 6. First we find the 
expected result Lo = f dz for 5 = 0, which is the ideal design length of the beam 
line or the design circumference of a circular accelerator. The deviation from this 
ideal length is then 


AL=6 / K (z) D(z) dz. (8.123) 


The variation of the path length with momentum is determined by the momentum 
compaction factor, defined by 


eee ee Pe a (8.124) 


er 
) Pp 


Its numerical value can be calculated with (8.123) and is 


Lo 
A = a K (z) D(z) dz = (Po). (8.125) 


In this approximation the path length variation is determined only by the dispersion 
function in bending magnets and the path length depends only on the energy of the 
particles. To prepare for the needs of longitudinal phase focusing in Chap. 9, we will 
not only consider the path length but also the time it takes a particle to travel along 
that path. If Z is the path length, the travel time is given by 


L 
tS 8.126 
cB ( ) 
Here 6 = v/c is the velocity of the particle and is not to be confused with the 


betatron function. The variation of t gives by logarithmic differentiation 


At _ AL _ AB (8.127) 
Ce a , 


With AL/L = a.é and cp = BE we get dp/p = dB/B + dE/E and with dE/E = 


B?dp/p we can solve for dB/B = (1/y?) dp/p, where y = E/mc? is the relativistic 
factor. From (8.127) we have then 


A 1 d d 
Ar (4 =) nec (8.128) 


and call the combination 


Ne = (= =) (8.129) 
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the momentum compaction. The energy 


1 
Jae 


for which the momentum compaction vanishes is called the transition energy which 
will play an important role in phase focusing. Below transition energy the arrival 
time is determined by the actual velocity of the particles while above transition 
energy the particle speed is so close to the speed of light that the arrival time of 
a particle with respect to other particles depends more on the path length than on 
its speed. For a circular accelerator we may relate the time t, a particle requires to 
follow a complete orbit to the revolution frequency w, and get from (8.128) 


u= (8.130) 


; t P 


For particles above transition energy this quantity is negative which means a particle 
with a higher energy needs a longer time for one revolution than a particle with a 
lower energy. This is because the dispersion function causes particles with a higher 
energy to follow an equilibrium orbit with a larger average radius compared to the 
radius of the ideal orbit. 

By special design of the lattice one could generate an oscillating dispersion 
function in such a way as to make the momentum compaction 7, to vanish. Such 
a transport line or circular accelerator would be isochronous to the approximation 
used here. Due to higher order aberrations, however, there are nonlinear terms in 
the dispersion function which together with an energy spread in the beam cause a 
spread of the revolution frequency compromising the degree of isochronicity. These 
higher order corrections are discussed later in Chap. 9.4.1. 


Problems 


8.1 (S). Particle trajectories in phase space follow the shape of an ellipse. Derive a 
transformation of the phase space coordinates (u, u’) to coordinates (w, w) such that 
the particle trajectories are circles with the radius Be. 


8.2 (S). Use (8.18) for the phase ellipse and prove that the area enclosed by the 
ellipse is indeed equal to ze. 


8.3 (S). Show that the transformation of the beam matrix (8.41) is consistent with 
the transformation of the lattice functions. 


8.4 (S). Sometimes two FODO channels of different parameters must be matched. 
Show that a lattice section can be designed with a phase advance of Ay, = Ay = 
zt/2, which will provide the desired matching of the betatron functions from the 
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symmetry point of one FODO channel to the symmetry point of the other channel. 
Such a matching section is also called a quarter wavelength transformer. Does this 
transformer also work for curved FODO channels, where the dispersion is finite? 


8.5. Construct a beam bump like in problem 7.6 but now use betatron and phase 
functions for the solution. What are the criteria for either Ay being the maximum 
displacement or not? For which phase wy would the dipole fields be minimum? Is 
there a more economic solution for a symmetric beam bump with an amplitude Ay 
in the center of QD2? 


8.6. Consider a ring made from an even number of FODO cells. To provide 
component free space we cut the ring along a symmetry line through the middle 
of two quadrupoles on opposite sides of the ring and insert a drift space of length 
£4. Derive the transformation matrix for this ring and compare with that of the 
unperturbed ring. What is the tune change of the accelerator. The betatron functions 
will be modified. Derive the new value of the horizontal betatron function at the 
symmetry point in units of the unperturbed betatron function. Is there a difference 
to whether the free section is inserted in the middle of a focusing or defocusing 
quadrupole? How does the 7-function change? 


8.7. Consider a regular FODO lattice, where some bending magnets are eliminated 
to provide magnet free spaces and to reduce the 7-function in the straight section. 
How does the minimum value of the 7-function scale with the phase per FODO cell. 
Show if conditions exist to match the 7-function perfectly in the straight section of 
this lattice? 
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Chapter 9 
Longitudinal Beam Dynamics 


In previous chapters we have concentrated the discussion on the interaction of 
transverse electrical and magnetic fields with charged particles and have derived 
appropriate formalisms to apply this interaction to the design of beam transport 
systems. The characteristics of these transverse fields is that they allow to guide 
charged particles along a prescribed path but do not contribute directly to the 
energy of the particles through acceleration. For particle acceleration we must 
generate fields with nonvanishing force components in the direction of the desired 
acceleration. Such fields are called longitudinal fields or accelerating fields. In a 
very general way we describe in this section the interaction of longitudinal electric 
fields with charged particles to derive the process of particle acceleration, its scaling 
laws, and its stability limits. 

The usefulness and application of electric fields to accelerate charged particles 
depends greatly on the temporal variations of these fields. Accelerating fields 
can be static or pulsed or they may be electromagnetic fields oscillating at high 
frequencies. Conceptually, the most simple way to accelerate charged particles is 
through a static field applied to two electrodes as shown in Fig.9.1. In this case, 
the total kinetic energy a particle can gain while traveling from one electrode to 
the other is equal to the product of the particle charge and the voltage between the 
electrodes. 

Electric breakdown phenomena, however, limit the maximum applicable voltage 
and thereby the maximum energy gain. Nonetheless, this method is intriguingly 
simple and efficient compared to other accelerating methods and therefore still 
plays a significant role among modern particle accelerators, for example, in particle 
sources. Electrostatic acceleration schemes are specifically useful for low energy 
particles for which other methods of acceleration would be inefficient. Higher 
voltages and particle energies can be reached if the electric fields are applied in the 
form of very short pulses. Application of electro-static high voltages to accelerate 
particles is limited to some 10 million volts due to high voltage breakdown. 
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Fig. 9.1 Principle of 
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For higher particle energies different acceleration methods must be used. The 
most common and efficient way to accelerate charged particles to high energies 
is to use high frequency electromagnetic fields in specially designed accelerating 
structures. Acceleration to high energies occurs while charged particles either pass 
once through many or many times through one or few accelerating structures each 
excited to electric field levels below the break down threshold. In this section, we 
concentrate the discussion on charged particle acceleration by electromagnetic radio 
frequency fields. 


9.1 Longitudinal Particle Motion 


Application of radio frequency in short rf-fields has become exceptionally effective 
for the acceleration of charged particles. Both, fields and particle motion can be 
synchronized in an effective way to allow the acceleration of charged particles in 
principle to arbitrary large energies were it not for other limitations. 

The first idea and experiment for particle acceleration with radio frequency 
fields has been published by Ising [1] although he did not actually succeed to 
accelerate particles due to an inefficient approach to rf-technology. Later Wideroe 
[2] introduced the concept of generating the accelerating fields in resonating rf- 
cavities and was able to accelerate heavy ions. Original papers describing these 
and other early developments of particle acceleration by rf-fields are collected in a 
monogram edited by Livingston [3]. 

To study the interaction of electromagnetic rf-fields with charged particles, we 
assume a plane electromagnetic wave of frequency w propagating in the z-direction. 
A free electromagnetic wave does not have a longitudinal electric field component 
and therefore a special physical environment, called the accelerating structure, 
must be provided to generate accelerating field components in the direction of 
propagation. As will be discussed later in Sect. 18.1 this is achieved by proper 
choice of boundary conditions. To study particle dynamics in longitudinal fields, 
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we assume that we were able to generate rf-fields with an electric field component 
along the path of the particles expressed by v + c 


E(z,t) = Eg? @® = Ene’ , (9.1) 


where the phase y = wt — kz. The particle momentum changes at a rate equal to 
the electric force exerted on the particle by the rf-field 


dp d 
4 7 CE) = glume). (9.2) 
Multiplying this with the particle velocity we get the rate of change of the 
kinetic energy, dExin = cB dp. Integration of (9.2) with respect to the time becomes 
unnecessarily complicated for general fields because of the simultaneous variation 
of the electric field and particle velocity with time. We therefore integrate (9.2) with 
respect to the longitudinal coordinate and obtain instead of the momentum gain the 
increase in the kinetic or total energy for the complete accelerating structure 


AE = (y—m)me? =e f BQpee, (9.3) 


where yo mc” is the energy of the particle before acceleration. Of course, the trick 
to integrate the electric field through the accelerating section rather than over time 
following the particle is only a conceptual simplification and the time integration 
will have to be executed at some point. Generally this is done when the particular 
accelerating section, the fields, and the synchronization is known. 

Travelling electromagnetic waves are used in linear accelerators and the acceler- 
ating structure is designed such that the phase velocity of the wave is equal to the 
velocity of the particles to be accelerated. In this case, the particle travels along the 
structure in synchronism with the wave and is therefore accelerated or decelerated 
at a constant rate. Maximum acceleration is obtained if the particles ride on the crest 
of the wave. 

In a standing wave accelerating section the electric field has the form 


E(z,t) = Eo(e"*? (9.4) 


where 6 is the phase at the moment the particle enters the accelerating section at 
t = 0. When we refer to an accelerating voltage V in a standing wave cavity we 
mean to say a particle traveling close to the speed of light through the cavity will 
gain a maximum kinetic energy of eV while passing the cavity center at the moment 
the field reaches its crest. Such a particle would enter the cavity some time before 
the field reaches a maximum and will exit when the field is decaying again. For 
slower particles the energy gain would be lower because of the longer transit time. 
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Successful particle acceleration depends on stable and predictable interaction of 
charged particles and electromagnetic fields. Because oscillating rf-fields are used, 
special criteria must be met to assure systematic particle acceleration rather than ran- 
dom interaction with rf-fields producing little or no acceleration. The constructive 
interaction of particles and waves have been investigated in 1945 independently by 
Veksler [4] and McMillan [5] leading to the discovery of the fundamental principle 
of phase focusing. In this subsection, we will derive the physics of phase focusing 
and apply it to the design of particle accelerators. 

The degree of acceleration depends on the momentary phase w of the field 
as seen by the particle while travelling through or with an electromagnetic field. 
Straight superposition of an electromagnetic wave and charged particle motion will 
not necessarily lead to a net acceleration. In general, the particles are either too 
slow or too fast with respect to the phase velocity of the wave and the particle 
will, during the course of interaction with the electromagnetic wave, integrate over 
a range of phases and may gain little or no net energy from the electric fields. 
Therefore, special boundary conditions for the accelerating rf-wave must be met 
such that maximum or at least net acceleration can be achieved. This can be done by 
exciting and guiding the electromagnetic waves in specially designed accelerating 
structures designed such that the phase velocity of the electromagnetic wave is equal 
to the particle velocity. Only then can we choose a specific phase and integration of 
(9.3) becomes straightforward for particles travelling in the direction of propagation 
of the electromagnetic waves. 

For practical reasons, specifically in circular accelerators, particle acceleration 
occurs in short, straight accelerating sections placed along the particle path. In 
this case no direct traveling wave exists between adjacent accelerating sections and 
specific synchronicity conditions must be met for the fields in different accelerating 
sections to contribute to particle acceleration as desired. For the purpose of 
developing a theory of stable particle acceleration we may imagine an rf-wave 
traveling along the path of the particle with a phase velocity equal to the particle 
velocity and an amplitude which is zero everywhere except in discrete accelerating 
cavities. 

To ensure proper synchronization one could assume that every rf-cavity is 
powered by its own microwave source. This is done often in high power rf-cavities, 
but is, for example, impractical in linear accelerators. For the case of individual 
power sources the phase of the rf-field can be chosen in each cavity such that its 
voltage reaches the desired value at the moment the particles pass through. The 
synchronisation for many cavities fed by one power source is more complicated and 
we will discuss in the following paragraphs how to accomplish this. 

We consider a number of rf-cavities powered by a single microwave source. 
To derive the synchronicity conditions, we consider first two accelerating sections 
separated by the distance L as shown in Fig.9.2. Once the proper operating 
conditions are known for two sections a third section may be added by applying 
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the same synchronicity condition between each pair of cavities. The successive 
accelerating sections need not necessarily be a physically different sections but 
could be the same section or the same sections passed through by the particles at 
periodic time intervals. For example, the distance L between successive accelerating 
sections may be equal to the circumference of a circular accelerator. 

For systematic acceleration the phase of the rf-fields in each of the accelerating 
sections must reach specific values at the moment the particles arrive. If the phase 
of the fields in each of N accelerating sections is adjusted to be the same at the time 
of arrival of the particles, the total acceleration is N times the acceleration in each 
individual section. This phase is called the synchronous phase y, defined by 


W, = wt — kz = const, (9.5) 


where w is the oscillating frequency of the electromagnetic field. The time derivative 
of (9.5) vanishes and the synchronicity condition is 


VWs =ao—kBc=0, (9.6) 


since dz/dt = Bc. This condition can be met if we set 


i (9.7) 
= : 
and the frequency of the electromagnetic field is then from (9.6) 
21 20 
OQ, = ky Be = Fad = AT Fi (9.8) 


where @, is the lowest frequency satisfying the synchronicity condition and AT 
is the time needed for particles with velocity Bc to travel the distance L. This 
equation relates the time of travel between successive accelerating sections with 
the frequency of the accelerating rf-fields in a conditional way to assure systematic 
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particle acceleration and the relation (9.8) is therefore called the synchronicity 
condition. 

However, any integer multiple of the frequency a, satisfies the synchronicity 
condition as well and we may instead of (9.8) define permissible frequencies of the 
accelerating rf-fields by 


20 20 
On = hoy = ky Be = ~~ hBe = Toh, (9.9) 


where / is an integer called the harmonic number with k, = hk. 

The synchronicity condition must be fulfilled for any spatial arrangement of the 
accelerating structures which are powered by a single microwave source to get the 
maximum acceleration. To illuminate the principle, we assume here, for example, 
a series of short, equidistant accelerating gaps or accelerating sections along the 
path of a particle. Let each of these gaps be excited by its own power source to 
produce an accelerating rf-field at some random phase. The synchronicity condition 
(9.8) is fulfilled if the rf-frequency is the same in each of these gaps, which are 
separated by an integer multiple of rf-wavelength. However, it does not require each 
accelerating gap to have the same rf-phase at the arrival time of the particles. Each 
cavity in a set of accelerating cavities oscillating at the same frequency may be tuned 
to an arbitrary rf-phase and the synchronicity condition still would be met. From a 
practical point of view, however, it is inefficient to choose arbitrary phases and it is 
more reasonable to adjust the phase in each cavity to the optimum phase desired for 
maximum acceleration. 

The assumption that the rf-frequency of all cavities be the same is unneces- 
sarily restrictive considering that any harmonic of the fundamental frequency is 
acceptable. Therefore, a set of accelerating cavities in a circular accelerator, for 
example, may include cavities resonating at any harmonic of @,. This is sometimes 
done to achieve specific effects (e.g. bunch lengthening), but in the absence of such 
requirements makes only complicates the Rf-system. 

A straightforward application of the synchronicity condition can be found in 
the design of the Wideroe linear accelerator structure [2] as shown in Fig. 9.3. 
Here the fields are generated by an external rf-source and applied to a series of 
metallic drift tubes. Accelerating fields build up at gaps between the tubes while 
the tubes themselves serve as a field screens for particles during the time the 
electric fields is changing sign and would be decelerating. The length of the field 
free drift tubes is determined by the velocity of the particles and is L = cB Ty 
where 7; is the period of the rf-field. As the particle energy increases so does the 
velocity cB and the length L of the tube must increase too. Only when the particles 
become highly relativistic will the distance between field free drift sections become 
a constant together with the velocity of the particles. Structures with varying drift 
lengths are generally found in low energy proton or ion accelerators based on the 
Alvarez structure [6], which is a technically more efficient version of the Wideroe 
structure. 
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Fig. 9.3 Wideroe linac structure 


For electrons it is much easier to reach relativistic energies where the velocity is 
sufficiently constant such that in general no longitudinal variation of the accelerating 
structure is needed. In circular accelerators, we cannot adjust the distance between 
cavities or the circumference as the particle velocity 6 increases. The synchronicity 
condition therefore must be applied differently. From (9.9) we find the rf-frequency 
to be related to the particle velocity and distances between cavities. Consequently 
we have the relation 


pAyh = L, (9.10) 


which requires that the distance between any pair of accelerating cavities be an 
integer multiple of BA,r. Since L and h are constants, this condition requires 
that the rf-frequency be changed during acceleration proportional to the particle 
velocity 6. Only for particles reaching relativistic energies, when 6 ~ 1, will the 
distance between cavities approach an integer multiple of the rf-wave length and the 
circumference C must then meet the condition 


C= Bhi. (9.11) 


9.2 Equation of Motion in Phase Space 


So far, we have assumed that both the particle velocity 6 and the wave number 
k are constant. This is not a valid general assumption. For example, we cannot 
assume that the time of flight from one gap to the next is the same for all 
particles. For low energy particles we have a variation of the time of flight due 
to the variation of the particle velocities for different particle momenta. The wave 
number k or the distance between accelerating sections need not be the same 
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for all particles either. A momentum dependent path length between accelerating 
sections exists if the lattice between such sections includes bending magnets. As 
a consequence, the synchronicity condition must be modified to account for such 
chromatic effects. 

Removing the restriction of a constant wave number k, we obtain by a variation 
of (9.6) 


: : ‘ k 
Aw =w—w.= —A(kBc) = —ckAB — pee OP (9.12) 
, Op ot 
where 
20 20 wo 
k=k=h = =/ ; 9.13 
h ce la h Be (9.13) 


and Lo is the distance between accelerating gaps along the ideal path. The 
synchronous phase is kept constant yy, = const or We = 0 and serves as the 
reference phase against which all deviations are measured. 

The momentum dependence of the wave number comes from the fact that the 
path length L between accelerating gaps may be different from Lo for off momentum 
particles. The variation of the wave number with particle momentum is therefore 


_ 0k OL 
»  oLop 


ok 
dp 


ky OL 
0 Lo Op 


=-—a, (9.14) 
0 Po 


where a, is the momentum compaction factor. We evaluate the momentum com- 
paction factor starting from the path length L = ; dA + f) dz. For transverse 
particle motion x = xg +n (Ap/po) and employing average values of the integrands 
the integral becomes 


x A 
L=lo+{2)t9+ (2) ro, (9.15) 

p PI Po 
Because of the oscillatory character of the betatron motion (k,xg) = 0. The 
relative path length variation is 7 = (2) 22 a 04. <P and the momentum 

compaction factor becomes 
—_ (2). (9.16) 
p 


The momentum compaction factor increases only in curved sections where p # 0 
and the path length is longer or shorter for higher energy particles depending on the 
dispersion function being positive or negative, respectively. For a linear accelerator 
the momentum compaction factor vanishes since the length of a straight line does 


9.2 Equation of Motion in Phase Space 261 


not depend on the momentum. With (dp/dt)At = Ap and mcy*AB = Ap we get 
finally for (9.12) with (9.14) and after some manipulation 


2 A 
w = —Beky(y~ — 0) (9.17) 


The term y~” in (9.17) appears together with the momentum compaction factor 
a, and therefore has the same physical relevance. This term represents the variation 
of the particle velocity with energy. Therefore, even in a linear accelerator where 
a, = 0, the time of flight between accelerating gaps is energy dependent as long as 
particles are still nonrelativistic. 

After differentiation of (9.17) with respect to the time, we get the equation 
of motion in the longitudinal direction describing the variation of the phase 
with respect to the synchronous phase yw, for particles with a total momentum 
deviation Ap 


b+ (Actun. <= } =0. (9.18) 
CPo 


In most practical applications, parameters like the particle velocity 6 or the 
energy vary only slowly during acceleration compared to the rate of change of the 
phase and we consider them for the time being as constants. The slow variation of 
these parameters constitutes an adiabatic variation of external parameters for which 
Ehrenfest’s theorem holds. The equation of motion in the potential of the rf-field 
becomes in this approximation 


is kino O 
fp PO nyc (9.19) 
cpo ot 


Integration of the electrical fields along the accelerating sections returns the kinetic 
energy gain per turn 


e i E(W)dz = eV(W), (9.20) 
L 


where V(y) is the total particle accelerating voltage seen by particles along the 
distance L. For particles with the ideal energy and following the ideal orbit the 
acceleration is eV(w,) where y, is the synchronous phase. 

Acceleration, however, is not the only source for energy change of particles. 
There are also gains or losses from, for example, interaction with the vacuum 
chamber environment, external fields like free electron lasers, synchrotron radiation 
or anything else exerting longitudinal forces on the particle other than accelerating 
fields. We may separate all longitudinal forces into two classes, one for which the 
energy change depends only on the phase of the accelerating fields V(y) and the 
other where the energy change depends only on the energy of the particle U(E) 
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itself. The total energy gain AE per unit time or per turn is the composition of both 
types of external effects 


AE = eV() — U(E), (9.21) 


where U(E) is the energy dependent loss per turn due, for example, to synchrotron 
radiation. 


9.2.1 Small Oscillation Amplitudes 


For arbitrary variations of the accelerating voltage with time we cannot further 
evaluate the equation of motion unless the discussion is restricted to small variations 
in the vicinity of the synchronous phase. While the ideal particle arrives at the 
accelerating cavities exactly at the synchronous phase 7, most other particles in 
a real beam arrive at slightly different phases. For small deviations g from the 
synchronous phase, 


g=V-Vs, (9.22) 


we can expand the accelerating voltage into a Taylor series at yy = yw, and get for 
the average rate of change of the particle energy with respect to the energy of the 
synchronous particle from (9.20) 


d 1 dv du 
di To . (Hs) te dy)” (£0) — aE 


at] ; (9.23) 


where the particle energy E = Ey + AE and Tp is the time of flight between adjacent 
cavities for the reference particle 


(i (9.24) 
At equilibrium eV(Ws) = U(Eo), and since B Acp = AE, we get with (9.23) and 
~ = w from (9.19) the equation of motion or phase equation 


cknne dV 1 dU 


cPo To dw Ws " Ps dE 


Acp _ 


Eo cPo 


(9.25) 


With (9.17) and w = Ws; + @ Eq. (9.25) becomes the differential equation of 
motion for small phase oscillations 


G + 20,0 + 2°9 =0, (9.26) 
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where the damping decrement a, is defined by 


1 dU 
=-— — (9.27) 
2T) dE |x, 
and the synchrotron frequency by 
ky ne dV 
gs eh, = (9.28) 


cpoTo dW |y, , 


Particles orbiting in a circular accelerator perform longitudinal oscillations with 
the frequency S2. These phase oscillations are damped or antidamped depending on 
the sign of the damping decrement. Damping occurs only if there is an energy loss 
which depends on the particle energy itself as in the case of synchrotron radiation. In 
most cases of accelerator physics we find the damping time to be much longer than 
the phase oscillation period and we may therefore discuss the phase equation while 
ignoring damping terms. Whenever damping becomes of interest, we will include 
this term again. 

This phase equation is valid only for small oscillation amplitudes because only 
the linear term has been used in the expansion for the rf-voltage. For larger 
amplitudes this approximation cannot be made anymore and direct integration of the 
differential equation is necessary. The small amplitude approximation, however, is 
accurate to describe most of the fundamental features of phase oscillations. At large 
amplitudes, the nonlinear terms will introduce a change in the phase oscillation 
frequency and finally a limit to stable oscillations to be discussed later in this 
chapter. 

The phase equation has the form of the equation of motion for a damped 
harmonic oscillator and we will look for conditions leading to a positive frequency 
and stable phase oscillations. Because the phase equation was derived first for 
synchrotron accelerators the oscillations are also called synchrotron oscillations and 
are of fundamental importance for beam stability in all circular accelerators based 
on rf-acceleration. For real values of the oscillation frequency we find that particles 
which deviate from the synchronous phase are subjected to a restoring force leading 
to harmonic oscillations about the equilibrium or synchronous phase. From the 
equation of motion (9.25) it becomes clear that phase focusing is proportional to the 
derivative of the accelerating voltage rather than to the accelerating voltage itself 
and is also proportional to the momentum compaction 77¢.. 

To gain further insight into the phase equation and determine stability criteria, 
we must make an assumption for the waveform of the accelerating voltage. In most 
cases, the rf-accelerating fields are created in resonant cavities and therefore the 
accelerating voltage can be expressed by a sinusoidal waveform 


Viv) = Vosinw (9.29) 
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and expanded about the synchronous phase to get with Ww = Ww + @ 
Viv + 9) = Vo (sin y, cos gy + sing cos Wy). (9.30) 
Keeping only linear terms in g the phase equation is 
@ + 229 =0, (9.31) 
where the synchrotron oscillation frequency becomes now 


kutle « 
Q? = She 2%, cos vy. (9.32) 


cpoTo 


A particle passing periodically through localized and synchronized accelerating 
fields along its path performs synchrotron oscillations with the frequency 2 about 
the synchronous phase. 

In circular accelerators we have frequently the situation that several rf-cavities 
are employed to provide the desired acceleration. The reference time 7p is most 
conveniently taken as the revolution time and the rf-voltage Vo is the total 
accelerating voltage seen by the particle while orbiting around the ring once. The 
rf-frequency is an integer multiple of the revolution frequency, 


Sut = h frev (9.33) 


where the integer / is the harmonic number and the revolution frequency is with the 
circumference C 


1 Cc 
v= > =: 9.34 
aa eae, (9.34) 
From (9.32) the synchrotron frequency is in more practical units 
hncev , 
R= Pre aed (9.35) 


2x Bcpo 


Similar to the betatron oscillation tunes, we define the synchrotron oscillation 
tune or short the synchrotron tune as the ratio 


Q 


rev 


Vs = 


(9.36) 


For real values of the synchrotron oscillation frequency the phase equation assumes 
the simple form 


y = § cos(2t+ yi), (9.37) 
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Fig. 9.4 Synchrotron oscillations in phase space for stable motion (2? > 0) (/eft) and for unstable 
motion (2? < 0) (right) 


where yj; is an arbitrary phase function for the particle i at time tf = 0. With W =@ 
we find from (9.17), (9.32) the relation between the momentum and phase deviation 
for real values of the synchrotron oscillation frequencyv, ~ 0.001 — 0.01 


A 


5 Acp _ Q QOD 


CPo 7 hOrey Ne = hOyey Ne 


sin (2t + yi). (9.38) 


The particle momentum deviation, being the conjugate variable to the phase, 
also oscillates with the synchrotron frequency about the ideal momentum. Both, 
the phase and momentum oscillations describe the particle motion in longitudinal 
phase spaceas shown in Fig. 9.4 for stable and unstable synchrotron oscillations, 
respectively. At the time fo when in (9.38) the phase {2% + x; = 0 and we expect 
the momentum deviation to be zero while the phase reaches the maximum value @. 
Thus both oscillations are 90° out of phase. Particles with a negative momentum 
compaction 7, < 0 move clockwise in phase space about the reference point while 
a positive momentum compaction causes the particles to rotate counter clockwise. 

The same process that has led to phase focusing will also provide the focusing 
of the particle momentum. Any particle with a momentum different from the 
ideal momentum will undergo oscillations at the synchrotron frequency which are 
described by 6 = —§ sin (2t+ xi), where the maximum momentum deviation is 
related to the maximum phase excursion @ by 


@. (9.39) 


By inverse deduction we may express the momentum equation similar to the 
phase equation (9.31) and get with Ap/po = 6 the differential equation for the 
momentum deviation 


da 
: +28 =0. (9.40) 
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Similar to the transverse particle motion, we eliminate from (9.37), (9.38) the 
argument of the trigonometric functions to obtain an invariant of the form 


§2 


yp” * 
+5 =1 with $= @, (9.41) 
Fe 


ret 
i) 


where the sign is chosen to indicate stable or unstable motion depending on whether 
the synchrotron oscillation frequency {2 is real or imaginary respectively. The 
trajectories for both cases are shown in Fig.9.4. Clearly, the case of imaginary 
values of the synchrotron oscillation frequency leads to exponential growth in the 
oscillation amplitude. 


9.2.2 Phase Stability 


The synchrotron oscillation frequency must be real and the right-hand side of (9.32) 
must therefore be positive to obtain stable solutions for phase oscillations. All 
parameters in (9.32) are positive quantities except for the momentum compaction 
N¢ and the phase factor cos y,. For low particle energies the momentum compaction 
is in general positive because y~” > a, but becomes negative for higher particle 
energies. The energy at which the momentum compaction changes sign is called the 
transition energydefined by 


1 
Ye = Va, 


Since the momentum compaction factor for circular accelerators is approxi- 
mately equal to the inverse horizontal tune a, ~ v~*, we conclude that the transition 
energy Yr is of the order of the tune and therefore in general a small number 
reaching up to the order of a hundred for very large accelerators. For electrons, the 
transition energy is of the order of a few MeV and for protons in the GeV regime. In 
circular electron accelerators the injection energy always is selected to be well above 
the transition energy and no stability problems occur during acceleration since the 
transition energy is not crossed. Not so for protons. Proton linear accelerators with 
an energy of the order of 10 GeV or higher are very costly and therefore protons and 
ions in general must be injected into a circular accelerator below transition energy. 

The synchronous rf-phase must be selected depending on the particle energy 
being below or above the transition energy. Stable phase focusing can be obtained 
in either case if the rf-synchronous phase is chosen as follows 


(9.42) 


0<W<F for y<YVer, 


9.43 
5< Ws <7 for y> Vr. ( ) 
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In a proton accelerator with an injection energy below transition energy the rf- 
phase must be changed very quickly when the transition energy is being crossed. 
Often the technical difficulty of this sudden change in the rf-phase is ameliorated by 
the use of pulsed quadrupoles [7, 8], which is an efficient way of varying momen- 
tarily the momentum compaction factor by perturbing the dispersion function. A 
sudden change of a quadrupole strength can lower the transition energy below the 
actual energy of the particle. This helpful “perturbation” lasts for a small fraction of 
a second while the particles are still being accelerated and the rf-phase is changed. 
By the time the quadrupole pulse terminates, the rf-phase has been readjusted and 
the particle energy is now above the unperturbed transition energy. 

In general, we find that a stable phase oscillation for particles under the influence 
of accelerating fields can be obtained by properly selecting the synchronous phase 
Ws in conjunction with the sign of the momentum compaction such that 


2? > 0. (9.44) 


This is the principle of phase focusing [5] and is a fundamental process to obtain 
stable particle beams in circular high-energy accelerators. An oscillating acceler- 
ating voltage together with a finite momentum compaction produces a stabilizing 
focusing force in the longitudinal degree of freedom just as transverse magnetic or 
electric fields can produce focusing forces for the two transverse degrees of freedom. 
With the focusing of transverse amplitudes we found a simultaneous focusing of its 
conjugate variable, the transverse momentum. The same occurs in the longitudinal 
phase where the particle energy or the energy deviation from the ideal energy is 
the conjugate variable to the time or phase of a particle. Both variables are related 
by (9.17) and a focusing force not only exists for the phase or longitudinal particle 
motion but also for the energy keeping the particle energy close to the ideal energy. 

Focusing conditions have been derived for all six degrees of freedom where the 
source of focusing originates either from the magnet lattice for transverse motion 
or from a combination of accelerating fields and a magnetic lattice property for 
the energy and phase coordinate. The phase stability can be seen more clearly by 
observing the particle trajectories in phase space. Equation (9.31) describes the 
motion of a pendulum with the frequency §2 which, for small amplitudes sing ~ @ 
becomes equal to the equation of motion for a linear harmonic oscillator and can be 
derived from the Hamiltonian 


H = $@° + 452°¢°. (9.45) 


Small amplitude oscillations in phase space are shown in Fig. 9.4 and we note 
the confinement of the trajectories to the vicinity of the reference point. In case of 
unstable motion the trajectories quickly lead to unbound amplitudes in energy and 
phase (Fig. 9.4 right). 
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Large Oscillation Amplitudes 


For larger oscillation amplitudes we cannot anymore approximate the trigonometric 
function sing * @ by its argument. Following the previous derivation for the 
equation of motion (9.31) we get now 


g= —Q? sin Q, (9.46) 
which can be derived from the Hamiltonian 
H = }¢° — 2’ cosy (9.47) 


being identical to that of a mechanical pendulum. As a consequence of our ability to 
describe synchrotron motion by a Hamiltonian and canonical variables, we expect 
the validity of the Poincaré integral 


= [soa0 = const (9.48) 


under canonical transformations. Since the motion of particles during synchrotron 
oscillations can be described as a series of canonical transformations [9], we find 
the particle density in the (¢g, g ) phase space to be a constant of motion. The same 
result has been used in transverse phase space and the area occupied by this beam in 
phase space has been called the beam emittance. Similarly, we define an emittance 
for the longitudinal phase space. Different choices of canonical variables can be 
defined as required to emphasize the physics under discussion. Specifically we find 
it often convenient to use the particle momentum instead of ¢ utilizing the relation 
(9.17). 

Particle trajectories in phase space can be derived directly from the Hamiltonian 
by plotting solutions of (9.47) for different values of the “energy” H. of the system. 
These trajectories, well known from the theory of harmonic oscillators, are shown 
in Fig. 9.5 for the case of a synchronous phase yw, = 7. 

The trajectories in Fig. 9.5 are of two distinct types. In one type the trajectories 
are completely local and describe oscillations about equilibrium points separated 
by 27 along the abscissa. For the other type the trajectories are not limited to 
a particular area in phase and the particle motion assumes the characteristics of 
libration. This phenomenon is similar to the two cases of possible motion of a 
mechanical pendulum or a swing. At small amplitudes we have periodic motion 
about the resting point of the swing. For increasing amplitudes, however, that 
oscillatory motion could become a libration when the swing continues to go over 
the top. The lines separating the regime of libration from the regime of oscillation 
are called separatrices. 

Particle motion is stable inside the separatrices due to the focusing properties of 
the potential well which in this representation is just the cos g-term in (9.47). The 
area within separatrices is commonly called an rf-bucket describing a place where 
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separatrices 


Fig. 9.5 Phase space diagrams for a synchronous phase yy, = 7 


Fig. 9.6 Potential well for stationary rf buckets, y, = a 


particles are in stable motion. In Fig. 9.6 the Hamiltonian (9.47) is shown in a three- 
dimensional representation with contour lines representing the equipotential lines. 
The stable potential wells, within the separatrices, keeping the particles focused 
toward the equilibrium position, are clearly visible. 

Inside the separatrices the average energy gain vanishes due to oscillatory phase 
motion of the particles. This is obvious from (9.30) which becomes for Ww, = z 


Vw) = Vosinw = Vosin(y, +) = Vosing (9.49) 


averaging to zero since the average phase (y) = 0. 

The area within such separatrices is called a stationary rf-bucket. Such buckets, 
while not useful for particle acceleration, provide the necessary potential well to 
produce stable bunched particle beams in facilities where the particle energy need 
not be changed as for example in a proton or ion storage ring where bunched 
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beams are desired. Whenever particles must receive energy from accelerating fields, 
may it be for straight acceleration or merely to compensate for energy losses 
like synchrotron radiation, the synchronous phase must be different from zero. 
As a matter of fact, due to the principle of phase focusing, particles within the 
regime of stability automatically oscillate about the appropriate synchronous phase 
independent of their initial parameters. 

In the discussion of large amplitude oscillations we have tacitly assumed that 
the synchrotron oscillation frequency remains constant and equal to (9.32) yet, we 
also note that the frequency is proportional to the variation of the rf-voltage with 
phase and we have included in the definition of the synchrotron frequency only 
linear terms so far. Specifically, we note in Fig.9.5 that the trajectories in phase 
space are elliptical only for small amplitudes but are periodically distorted for 
larger amplitudes. This distortion leads to a spread of the synchrotron oscillation 
frequency. 


9.2.3. Acceleration of Charged Particles 


In the preceding paragraph we have arbitrarily assumed that the synchronous phase 
be zero y, = O and as a result of this choice we obtained stationary, non-accelerating 
rf-buckets. No particle acceleration occurs since the particles pass through the 
cavities when the fields are zero. Whenever particle acceleration is required a finite 
synchronous phase must be chosen. The average energy gain per revolution is then 


AE = V(wWs) = Vo sin Ws. (9.50) 


Beam dynamics and stability becomes much different for yy, 4 0. From (9.19), 
we get with (9.21), (9.30), (9.32) a phase equation more general than (9.46) 


2 


= [sin(y, + g) — siny.] = 0, (9.51) 
COs Ws 


g+ 


or after expanding the trigonometric term into its components 
Q2 
g + ——(sin py, cosy + sing cos fy, — sin Ws) = 0. (9.52) 
cos Ws 


This equation can also be derived directly from the Hamiltonian for the dynamics 
of phase motion 


2 


2 
COS Ws 


ip — [cos(W, + vy) —cosW, + gysin Ws] = H. (9.53) 


9.2 Equation of Motion in Phase Space 271 


T 7 7 T T : T 1 
-3.1416 -1.5708 0 1.5708 3.1416 -3.1416 -1.5708 0 1.5708 3.1416 


-3.1416 -1.5708 ie) 1.5708 3.1416 


Fig. 9.7 Phase space diagrams for particles above transition energy (y > ¥.), synchronous phases 
of W; = 2/3 (top left), 5/6 (top right), 27/3 (bottom) 


The phase space trajectories or diagrams differ now considerably from those in 
Fig. 9.5 depending on the value of the synchronous phase wx. In Fig. 9.7 phase space 
diagrams are shown for different values of the synchronous phase and a negative 
value for the momentum compaction 7. 

We note clearly the reduction in stable phase space area as the synchronous phase 
is increased or as the particle acceleration is increased. Outside the phase stable 
areas the particles follow unstable trajectories leading to continuous energy loss or 
gain depending on the sign of the momentum compaction. Equation (9.53) describes 
the particle motion in phase space for arbitrary values of the synchronous phase and 
we note that this equation reduces to (9.45) if we set yy, = a. The energy gain for 
the synchronous particle at yy = w, becomes from (9.18) 


Ape / E(y)dz. (9.54) 


We obtain a finite energy gain or loss whenever the synchronous phase in 
accelerating sections is different from an integer multiple of 180° assuming that all 
accelerating sections obey the synchronicity condition. The form of (9.54) actually 
is more general insofar as it integrates over all fields encountered along the path of 
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Fig. 9.8 3D rendition of a potential well for moving rf buckets y, # 0 


the particle. In case some accelerating sections are not synchronized, the integral 
collects all contributions as determined by the phase of the rf-wave at the time the 
particle arrives at a particular section whether it be accelerating or decelerating. 
The synchronicity condition merely assures that the acceleration in all accelerating 
sections is the same for each turn. 

Particle trajectories in phase space are determined by the Hamiltonian (9.53), 
which is similar to (9.47) except for the linear term in g. Due to this term, the 
potential well is now tilted (Fig.9.8) compared to the stationary case (Fig. 9.6). 
We still have quadratic minima in the potential well function to provide stable 
phase oscillations, but particles escaping over the maxima of the potential well 
will be lost because they continuously loose or gain energy as can be seen by 
following such trajectories in Fig. 9.9. This is different from the case of stationary 
buckets where such a particle would just wander from bucket to bucket while 
staying close to the ideal energy at the center of the buckets. Phase stable 
regions in case of finite values of the synchronous phase are called moving 
rf-buckets. 

The situation is best demonstrated by the three diagrams in Fig. 9.9 showing the 
accelerating field, the potential, and the phase space diagram as a function of the 
phase for different synchronous phases. 
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Fig. 9.9 Phase space focusing for moving rf buckets displaying the phase relationship of 
accelerating field, potential, and rf bucket 


In this particular case we have assumed that the particle energy is above transition 
energy and that the synchronous phase is such that cosy, < 0 to obtain stable 
synchrotron oscillations. The center of the bucket is located at the synchronous 
phase y, and the longitudinal stability range is limited by the phases yy and . In 
the next section we will derive analytical expressions for the longitudinal stability 
limit and use the results to determine the momentum acceptance of the bucket as 
well. 

While both phases, w, as well as 2 — Ws, would supply the desired energy gain 
only one phase provides stability for the particles. The stable phase is easily chosen 
by noting that the synchrotron oscillation frequency {2 must be real and therefore 
Ne COS Ws > 0. Depending on such operating conditions the rf-bucket has different 
orientations as shown in Fig. 9.10. 

We still can choose whether the electric field should accelerate or decelerate 
the beam by choosing the sign of the field. For the decelerating case which, for 
example, is of interest for free electron lasers, the “fish” like buckets in the phase 
space diagram are mirror imaged. 
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Fig. 9.10 Relationship between rf phase and orientation of moving rf buckets for accelerating as 
well as decelerating fields 


9.3. Longitudinal Phase Space Parameters 


We will here investigate in more detail specific properties and parameters of 
longitudinal phase space motion. From these parameters it will be possible to define 
stability criteria. 


9.3.1 Separatrix Parameters 


During the discussions of particle dynamics in longitudinal phase space we found 
specific trajectories in phase space, called separatrices which separate the phase 
stable region from the region where particles follow unstable trajectories leading 
away from the synchronous phase and from the ideal momentum. Within the phase 
stable region particles perform oscillations about the synchronous phase and the 
ideal momentum. This “focal point” in the phase diagram is called a stable fixed 
point (sfp). The unstable fixed point (ufp) is located where the two branches of 
the separatrix cross. The location of fixed points can be derived from the two 
conditions: 

i = 0 and a = 0. (9.55) 

aw dy 


From the first condition, we find with (9.53) that We = 0 independent of any 
other parameter. All fixed points are therefore located along the y-axis of the phase 
diagram as shown in Fig. 9.11. 

The second condition leads to the actual location of the fixed points yp on the 
w-axis and is with wy = W, + @ 


sin We — siny, = 0. (9.56) 
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Fig. 9.11 Characteristic 5 rf-bucket separatrices 
bucket and separatrix \ 
parameters 


te 


This equation can be solved for wp = Wz or Wp = mw — Wz and the coordinates of 
the fixed points are 


(Wet, Wes) = (WW, 0) for the stable fixed point, sfp, and (9.57) 
(War, War) = (2 — Ws,0) for the unstable fixed point, ufp. ‘ 

The distinction between a stable and unstable fixed point is made through the 
existence of a minimum or maximum in the potential at these points respectively. In 
Fig. 9.9, this distinction becomes obvious where we note the stable fixed points in 
the center of the potential minima and the unstable fixed points at the saddle points. 
The maximum stable phase elongation or bunch length is limited by the separatrix 
and the two extreme points yw, and wW2 which we will determine in Sect. 9.3.3. 


9.3.2 Momentum Acceptance 


Particles on trajectories just inside the separatrix reach maximum deviations in 
phase and momentum from the ideal values in the course of performing synchrotron 
oscillations. A characteristic property of the separatrix therefore is the definition of 
the maximum phase or momentum deviation a particle may have and still undergo 
stable synchrotron oscillations. The value of the maximum momentum deviation 
is called the momentum acceptance of the accelerator. To determine the numerical 
value of the momentum acceptance, we use the coordinates of the unstable fixed 
point (9.57) and calculate the value of the Hamiltonian for the separatrix which is 
from (9.53) with Wur = Ws + Gur = 1 — Ws and Wir = 0 


2 
Hr = 
t cos Ws 


Following the separatrix from this unstable fixed point, we eventually reach the 
location of maximum distance from the ideal momentum. Since ¢ is proportional 


[2 cos ¥, — (1 — 2Y,)sin W] . (9.58) 
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to Ap/po, the location of the maximum momentum acceptance can be obtained 
through a differentiation of (9.53) with respect to @ 


dp _ 9? sin Ws — sin(Ws + 9) - 


[ 0. : 
? IO) cos Ws O22) 


At the extreme points where the momentum reaches a maximum or minimum, 
dg /d~ = 0 which occurs at the phase 


sin(s + gy) = sin Ys or g=0. (9.60) 


This is exactly the condition we found in (9.56) for the location of the stable fixed 
points and is independent of the value of the Hamiltonian. The maximum momen- 
tum deviation or momentum acceptance ¢,,- occurs therefore for all trajectories at 
the phase of the stable fixed points y = w,. We equate at this phase (9.58) with 
(9.53) to derive an expression for the maximum momentum acceptance 


$ Pace = 27[2 — (x — 2) tan Ys]. (9.61) 


In accelerator physics it is customary to define an over voltage factor. This factor 
is equal to the ratio of the maximum rf-voltage in the cavities to the desired energy 
gain in the cavity Up 


eVo _ 1 
Uo sins 


q= (9.62) 


and can be used to replace trigonometric functions of the synchronous phase. To 
solve (9.61), we use the expression 


1 
57 — W, = arccos F (9.63) 


derived from the identity cos (37 - VWs) = sin fs, replace the synchrotron oscilla- 


tion frequency S§2 by its representation (9.35) and get with (9.17) the momentum 
acceptance for a moving bucket 


Ap\* Vo sin Ws I 
(=) = eveon, (v gq? — 1 —arccos -) ; (9.64) 
q 


PO / ace h|nc|cpo 


The function 
1 
F(q) =2 (ve — 1 —arccos -) (9.65) 
q 


is shown in Fig. 9.12 as a function of the over voltage factor gq. 


9.3 Longitudinal Phase Space Parameters 277 
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The synchronous phase is always different from zero or 2 when charged particles 
are to be accelerated. In circular electron and very high-energy proton accelerators 
the synchronous phase must be nonzero even without acceleration to compensate for 
synchrotron radiation losses. In low and medium energy circular proton or heavy ion 
storage rings no noticeable synchrotron radiation occurs and the synchronous phase 
is either yy, = 0 or m depending on the energy being below or above the transition 
energy. In either case sin wy, = 0 which, however, does not necessarily lead to a 
vanishing momentum acceptance since the function F(g) approaches the value 2q 
and the factor sin; F(q) — 2 in (9.64) while g — ov. Therefore stable buckets 
for protons and heavy ions can be produced with a finite energy acceptance. The 
maximum momentum acceptance for such stationary buckets is from (9.64) 


Ap\” eV 
( 2 = aie (9.66) 
PO / max,stat. 1h|nNc|cpo 


Note that this expression for the maximum momentum acceptance appears to be 
numerically inconsistent with (9.39) for @ = 7, because (9.39) has been derived for 
small oscillations only (¢@ « z). From Fig.9.11, we note that the aspect ratios of 
phase space ellipses change while going from bucket center towards the separatrices. 
The linear proportionality between maximum momentum deviation and maximum 
phase of (9.39) becomes distorted for large values of @ such that the acceptance of 
the rf-bucket is reduced by the factor 2/2 from the value of (9.39). 

The momentum acceptance is further reduced for moving buckets as the syn- 
chronous phase increases. In circular accelerators, where the required energy gain 
for acceleration or compensation of synchrotron radiation losses per turn is Uo, the 
momentum acceptance is 


2 2 
(=) a r= 52 (2 ) . (9.67) 


PO / max.moving th | Nc Icpo 2 q PO / max,static 


The reduction F(q)/2q in momentum acceptance is solely a function of the 
synchronous phase and is shown in Fig. 9.13 for the case y > Yr. 
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Fig. 9.13 Reduction factor — 
of the momentum acceptance = F(q)/ 2q 
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Overall, the momentum acceptance depends on lattice and rf-parameters and 
scales proportional to the square root of the rf-voltage in the accelerating cavities. 
Strong transverse focusing decreases the momentum compaction thereby increasing 
the momentum acceptance while high rf-frequencies diminish the momentum 
acceptance. Very high frequency accelerating systems based, for example, on high 
intensity lasers to produce high accelerating fields are expected to have a rather 
small momentum acceptance and work therefore best with quasi-monoenergetic 
beams. 

It is often customary to use other parameters than the momentum as the 
coordinates in longitudinal phase space. The most common parameter is the particle 
energy deviation AE/w,- together with the phase. In these units, we get for the 
stationary bucket instead of (9.66) 


AE 


rf 


(9.68) 


max,stat. 


which is measured in eV-sec. Independent of the conjugate coordinates used, 

the momentum acceptance for moving rf-buckets can be measured in units of 
a stationary rf-bucket, where the proportionality factor depends only on the 
synchronous phase. 


9.3.3 Bunch Length 


During the course of synchrotron oscillations, particles oscillate between extreme 
values in momentum and phase with respect to the reference point and both 
modes of oscillation are out of phase by 90°. All particles of a beam perform 
incoherent phase oscillations about a common reference point and generate thereby 
the appearance of a steady longitudinal distribution of particles, which we call a 
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particle bunch. The total bunch length is twice the maximum longitudinal excursion 
of particles from the bunch center defined by 


Q, (9.69) 


where ¢@ is the maximum phase deviation. 

In circular electron accelerators the rf-parameters are generally chosen to gen- 
erate a bucket which is much larger than the core of the beam. Statistical emission 
of synchrotron radiation photons generates a Gaussian particle distribution in phase 
space and therefore the rf-acceptance is adjusted to provide stability far into the tails 
of this distribution. To characterize the beam, however, only the core (one standard 
deviation) is used. In the case of bunch length or energy deviation we consider 
therefore only the situation for small oscillation amplitudes. In this approximation 
the bunch length becomes with (9.39) 


£ | A 
£ _, el"el Ap (9.70) 
2 Q PO |max 
or with (9.35) 
t V2 ¢ A 
a cV20 u CPo P (9.71) 
2 rey heV cos Ws PO \max 


The bunch length in a circular electron accelerator depends on a variety of rf-and 
lattice parameters. It is inversely proportional to the square root of the rf-voltage 
and frequency. A high frequency and rf-voltage can be used to reduce the bunch 
length of which only the rf-voltage remains a variable once the system is installed. 
Practical considerations, however, limit the range of bunch length adjustment this 
way. The momentum compaction is a lattice function and theoretically allows the 
bunch length to adjust to any small value. For high-energy electron rings 7. ~ —Q¢ 
and by arranging the focusing such that the dispersion functions changes sign, 
the momentum compaction factor of a ring can become zero or even negative. 
Rings for which 7, = 0 are called isochronous rings [10]. By adjusting the 
momentum compaction to zero, phase focusing is lost similar to the situation going 
through transition in proton accelerators and total beam loss may occur. In this case, 
however, nonlinear, higher order effects become dominant which must be taken into 
consideration. If on the other hand the momentum compaction is adjusted to very 
small values, beam instability may be avoidable. [11] The benefit of an isochronous 
or quasi-isochronous ring would be that the bunch length in an electron storage ring 
could be made very small. This is important, for example, to either create short 
synchrotron radiation pulses or maximize the efficiency of a free electron laser by 
preserving the micro bunching at the laser wavelength as the electron beam orbits 
in the storage ring. 
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Fig. 9.14 Maximum phases limiting the extend of moving buckets 


In a circular proton or ion accelerator we need not be concerned with the 
preservation of Gaussian tails and therefore the whole rf-bucket could be filled with 
the beam proper at high density. In this case, the bunch length is limited by the 
extreme phases yw, and wW2 of the separatrix. Because the longitudinal extend of the 
separatrix depends on the synchronous phase, we expect the bunch length to depend 
also on the synchronous phase. One limit is given by the unstable fixed point at 
Wi = a — Ws. The other limit must be derived from (9.53), where we replace H. 
by the potential of the separatrix from (9.58). Setting @ = 0, we get for the second 
limit of stable phases the transcendental equation 


cos Wi2 + Wi2 sin Ws = (a — Ws) sin Ws — cos Wg. (9.72) 


This equation has two solutions mod(2z) of which y is one solution and the 
other is wz. Both solutions and their difference are shown in Fig. 9.14 as functions 
of the synchronous phase. 

The bunch length of proton beams is therefore determined only by 


fp = 5 2 = Ui). (9.73) 


Different from the electron case, we find the proton bunch length to be directly 
proportional to the rf-wavelength. On the other hand, there is no direct way of 
compressing a proton bunch by raising or lowering the rf-voltage. This difference 
stems from the fact that electrons radiate and adjust by damping to a changed rf- 
bucket while non-radiating particles do not have this property. However, applying 
adiabatic rf-voltage variation we may modify the bunch length as will be discussed 
in Sect. 9.3.5. 


9.3.4 Longitudinal Beam Emittance 


Separatrices distinguish between unstable and stable regions in the longitudinal 
phase space. The area of stable phase space in analogy to transverse phase space 
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is called the longitudinal beam emittance; however, it should be noted that the 
definition of longitudinal emittance as used in the accelerator physics community 
often includes the factor z in the numerical value of the emittance and is therefore 
equal to the real phase space area. To calculate the longitudinal emittance, we 
evaluate the integral ¢ p dq where p and q are the conjugate variables describing 
the synchrotron oscillation. 

Similar to transverse beam dynamics we distinguish again between beam 
acceptance and beam emittance. The acceptance is the maximum value for the beam 
emittance to be able to pass through a transport line or accelerator components. In 
the longitudinal phase space the acceptance is the area enclosed by the separatrices. 
Of course, we ignore here other possible acceptance limitations which are not 
related to the parameters of the accelerating system. The equation for the separatrix 
can be derived by equating (9.53) with (9.58) which gives with (9.17) and (9.35) 


Acp \* eVo : 
= [cose +1+ 2QW,+9—7)siny,]. (9.74) 
cPo h|Ne|cpo 


We define a longitudinal beam emittance by 


AE 
€g = dg, (9.75) 
S Ort 


where the integral is to be taken along a path S tightly enclosing the beam in phase 
space. Only for Ww; = nz can this integral be solved analytically. The maximum 
value of the beam emittance so defined is the acceptance of the system. Numerically, 
the acceptance of a stationary bucket can be calculated by inserting (9.74) into (9.75) 
and integration along the enclosing separatrices resulting in 


2eVo E 
Epace = 8 sc tEeOL (9.76) 
mh|ne| Ort 


Comparison with the momentum acceptance (9.75) shows the simple relation 
that the longitudinal acceptance is eight times the energy acceptance 


AE 
Ey,ace = 8 ie 
rf 


(9.77) 


max,stat 


For moving rf-buckets, the integration (9.75) must be performed numerically 
between the limiting phases w; and w2. The resulting acceptance in percentage of 
the acceptance for the stationary rf-bucket is shown in Fig. 9.15 as a function of the 
synchronous phase angle. 

The acceptance for yw, < 180° is significantly reduced imposing some practical 
limits on the maximum rate of acceleration for a given maximum rf-voltage. During 
the acceleration cycle, the magnetic fields in the lattice magnets are increased 
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Fig. 9.15 Acceptance of moving rf buckets in units of the acceptance of a stationary rf bucket 


consistent with the available maximum rf-voltage and by virtue of the principle of 
phase focusing the particles will keep close to the synchronous phase whenever the 
rate of energy increase is slow compared to the synchrotron oscillation frequency 
which is always the case. In high-energy electron synchrotrons or storage rings the 
required “acceleration” is no more a free parameter but is mainly determined by the 
energy loss due to synchrotron radiation and a stable beam can be obtained only if 
sufficient rf-voltage is supplied to provide the necessary acceptance. 


9.3.5 Phase Space Matching 


In transverse phase space a need for matching exists while transferring a beam 
from one accelerator to another accelerator. Such matching conditions exist also 
in longitudinal phase space. In the absence of matching part of the beam may be 
lost due to lack of overlap with the rf-bucket or severe phase space dilution may 
occur if a beam is injected unmatched into a too large rf-bucket. In the case of 
electrons a mismatch generally has no detrimental effect on the beam unless part 
or all of the beam exceeds rf-bucket limitations. Because of synchrotron radiation 
and concomitant damping, electrons always assume a Gaussian distribution about 
the reference phase and ideal momentum. The only matching then requires that the 
rf-bucket is large enough to enclose the Gaussian distribution far into the tails of 
7-10 standard deviations. 

In proton and heavy ion accelerators such damping is absent and careful phase 
space matching during the transfer of particle beams from one accelerator to 
another is required to preserve beam stability and phase space density. A continuous 
monochromatic beam, for example, being injected into an accelerator with too large 
an rf-bucket as shown in Fig. 9.16 will lead to a greatly diluted emittance. 

This is due to the fact that the synchrotron oscillation is to some extend nonlinear 
and the frequency changes with oscillation amplitude with the effect that for all 
practical purposes the beam eventually occupies all available phase space. This does 
not conflict with Liouville’s theorem, since the microscopic phase space is preserved 
albeit fragmented and spread through filamentation over the whole bucket. 
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Fig. 9.16 Phase space filamentation after a few synchrotron oscillations 


The situation is greatly altered if the rf-voltage is reduced and adjusted to just 
cover the energy spread in the beam. Not all particles will be accepted, specifically 
those in the vicinity of the unstable fixed points, but all particles that are injected 
inside the rf-bucket remain there and the phase space density is not diluted. The 
acceptance efficiency is equal to the bucket overlap on the beam in phase space. A 
more sophisticated capturing method allows the capture of almost all particles in 
a uniform longitudinal distribution by turning on the rf-voltage very slowly [12], a 
procedure which is also called adiabatic capture. 

Other matching problems occur when the injected beam is not continuous. A 
beam from a booster synchrotron or linear accelerator may be already bunched but 
may have a bunch length which is shorter than the rf-wavelength or we may want to 
convert a bunched beam with a significant momentum spread into an unbunched 
beam with small momentum spread. Whatever the desired modification of the 
distribution of the beam in phase space may be, there are procedures to allow the 
change of particular distributions while keeping the overall emittance constant. 

For example, to accept a bunched beam with a bunch length shorter than 
the rf-wavelength in the same way as a continuous beam by matching only 
the momentum acceptance would cause phase space filamentation as shown in 
Fig.9.17. In a proper matching procedure the rf-voltage would be adjusted such 
that a phase space trajectory surrounds closely the injected beam (Fig. 9.17 left). 
In mathematical terms, we would determine the bunch length @ of the injected 
beam and following (9.70) would adjust the rf-voltage such that the corresponding 
momentum acceptance § = (Ap/po)max Matches the momentum spread in the 
incoming beam. If no correct matching is done and the beam is injected like shown 
in (Fig.9.17 right), then the variation of synchrotron oscillation frequency with 
amplitude would cause filamentation and dilution of beam phase space. Effectively, 
this simulates in real space a larger effective emittance. 

Equation (9.70) represents a relation between the maximum momentum devia- 
tion and phase deviation for small amplitude phase space trajectories which allows 
us to calculate the bunch length as a function of external parameters. Methods 
have been discussed in transverse particle dynamics which allow the manipulation 
of conjugate beam parameters in phase space while keeping the beam emittance 
constant. Specifically, within the limits of constant phase space we were able to 
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Fig. 9.17 Mismatch for a bunched beam (right). Proper match for a bunched beam (left) 


exchange beam size and transverse momentum or beam divergence by appropriate 
focusing arrangements to produce,for example, a wide parallel beam or a small 
beam focus. 

Similarly, we are able to manipulate within the limits of a constant longitudinal 
beam emittance the bunch length and momentum spread. The focusing device in this 
case is the voltage in accelerating cavities. Assume, for example, a particle bunch 
with a very small momentum spread but a long bunch length as shown in Fig. 9.18 
left. To transform such a bunch into a short bunch we would suddenly increase the 
rf-voltage in a time short compared to the synchrotron oscillation period. The whole 
bunch then starts to rotate within the new bucket (Fig.9.18 middle) exchanging 
bunch length for momentum spread. After a quarter synchrotron oscillation period, 
the bunch length has reached its shortest value and starts to increase again through 
further rotation of the bunch unless the rf-voltage is suddenly increased a second 
time to stop the phase space rotation of the bunch (Fig. 9.18 right). The rf-voltage 
therefore must be increased to such a value that all particles on the bunch boundary 
follow the same phase space trajectory. 

This phase space manipulation can be conveniently expressed by repeated appli- 
cation of (9.39). The maximum momentum deviation (Ap/po)o and the maximum 
phase deviation @p for the starting situation in Fig. 9.18 (left) are related by 


Ap (2 


= 7——_ $v: (9.78) 
Po h@rev |Ne| 


0 


where {2p is the starting synchrotron oscillation frequency for the rf-voltage Vo. To 
start bunch rotation the rf-voltage is increased to V; (Fig.9.18 middle) and after 
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Fig. 9.18 Phase space rotation 


a quarter synchrotron oscillation period at the frequency 92; « ./V; the phase 
deviation Gp has transformed into the momentum deviation 


Ap 
Po 


‘ia 


= ——__ @. (9.79) 
h®rev |Ne| 


At the same time the original momentum error Ap /Polo has become a phase error 
@, given by 


2, 


Ap 
ee = Q 
h@rey |Ne| 


Po 


1. (9.80) 


Now we need to stop further phase space rotation of the whole bunch. This can 
be accomplished by increasing a second time the rf-voltage during a time short 
compared to the synchrotron oscillation period in such a way that the new bunch 
length or @ is on the same phase space trajectory as the new momentum spread 
Ap /poli (Fig. 9.18 right). The required rf-voltage is then determined by 


Ap 
Po 


Q, 


= yg 
1 h@rey |Nel 4 


(9.81) 


while solving £2) for the voltage V2. We take the ratio of (9.77) and (9.80) to get 


i122 Ap 
~1 22 _ Ap/poli (9.82) 


Po 20 Ap/polo 
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and replace the ratio of the momentum spreads by the ratio of (9.78) and (9.79). 
With Q; « ./V; and £ « @ we get finally the scaling law for the reduction of the 
bunch length 


7- (2) - (9.83) 


In other words the bunch length can be reduced by increasing the rf-voltage in a two 
step process and the bunch length reduction scales like the fourth power of the rf- 
voltage. This phase space manipulation is symmetric in the sense that a beam with 
a large momentum spread and a short bunch length can be converted into a bunch 
with a smaller momentum spread at the expense of the bunch length by reducing the 
rf-voltage in two steps. 

The bunch length manipulation described here is correct and applicable only for 
non-radiating particles. For radiating particles like electrons, the bunch manipula- 
tion is easier due to damping effects. Equation (9.39) still holds, but the momentum 
spread is independently determined by synchrotron radiation and the bunch length 
therefore scales simply proportional to the square root of the rf-voltage. 


9.4 Higher-Order Phase Focusing 


The principle of phase focusing is fundamental for beam stability in circular 
accelerators and we find the momentum compaction factor to be a controlling 
quantity. Since the specific value of the momentum compaction determines critically 
the beam stability, it is interesting to investigate the consequences to beam stability 
as the momentum compaction factor varies. Specifically, we will discuss the 
situation where the linear momentum compaction factor is reduced to very small 
values and higher-order terms become significant. This is, for example, of interest 
in proton or ion accelerators going through transition energy during acceleration, or 
as we try to increase the quadrupole focusing in electron storage rings to obtain a 
small beam emittance, or when we intentionally reduce the momentum compaction 
to reduce the bunch length. In extreme cases, the momentum compaction factor 
becomes zero at transition energy or in an isochronous storage ring where the 
revolution time is made the same for all particles independent of the momentum. 
The linear theory of phase focusing would predict beam loss in such cases due to 
lack of phase stability. To accurately describe beam stability when the momentum 
compaction factor is small or vanishes, we cannot completely ignore higher-order 
terms. Some of the higher-order effects on phase focusing will be discussed here. 
There are two main contributions to the higher-order momentum compaction factor, 
one from the dispersion function and the other from the momentum dependent path 
length. First, we derive the higher-order contributions to the dispersion function, and 
then apply the results to the principle of phase focusing to determine the perturbation 
on the beam stability. 
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9.4.1 Dispersion Function in Higher Order 


The first-order change in the reference path for off energy particles is proportional to 
the relative momentum error. The proportionality factor is a function of the position 
s and is called the dispersion function. This result is true only in linear beam 
dynamics. We will now derive chromatic effects on the reference path in higher 
order to allow a more detailed determination of the chromatic stability criteria. The 
linear differential equation for the normalized dispersion function is 


d2 Wo 
dy? 


+ °wo = vB = v°F(), (9.84) 


where ¢ is the betatron phase, wo = 1o/ VB. , B(z) the betatron function and 7(z) 


the undisturbed dispersion function. The periodic solution of (9.84) is called the 
+oo v?Fnei”” 

n d 
n=—0oO y2—72 ? 


normalized dispersion function wo(y) = >> 


F(y) = Bix =) F, ei”. (9.85) 


This linear solution includes only the lowest-order chromatic error term from 
the bending magnets and we must therefore include higher-order chromatic terms 
into the differential equation of motion. To do that we use the general differential 
equation of motion while ignoring all coupling terms 


x" + (Kk? + kx = +8 — «8? + «8 — 5m (1 — 8)x? + k'xy (9.86) 
— (k? + 2xk)(1 — 8)? + Su(1 — 8) x” 
+ (2k? + k)x 8 — (k + 2k?)xd? + O(4), 


where k = 1/p. We are only interested in the chromatic solution with vanishing 
betatron oscillation amplitudes and insert for the particle position therefore 


Xy = 05 + m8 + 25° + O(4). (9.87) 


Due to the principle of linear superposition separate differential equations exist 
for each component 7; by collecting on the right-hand side terms of equal power 
in 6. For the terms linear in 6, we find the well-known differential equation for the 
dispersion function 


ny + K(2)n0 = « = )) Fone”®, (9.88) 


n 
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where we also express the perturbations by its Fourier expansion. The terms 
quadratic in 5 form the differential equation 


mn + K(2m =— >) Fone”? (9.89) 


— zg — (kK? + 2k) + Zeng + «Non + (2k? + k)no 


in in 
=-) Fone”? +) Fine ts 
n n 


and terms cubic in 6 are determined by 
nm +K@m=+ >. Fone’ — >) Fine”? (9.90) 


—mnon — 2 (kK? + 2k) non + (2k? +k) 
+ «nym, + «(non + nom) +" non 


in in in 
=+) Fone”? — ) Fie”? +) Fo, e"? . 
n n n 


We note that the higher-order dispersion functions are composed of the negative 
lower-order solutions plus an additional perturbation. After transformation of these 
differential equations into normalized variables, w = n/ 4/6. , etc., we get with j = 
0, 1, 2 differential equations of the form 


m=j n=0co 
iiy(g) + vowi(9) = VOBPF = v5) DY CD"B Fine”, (9.91) 
m=0 n=—oco 


where we have expressed the periodic perturbation on the r.h.s. by an expanded 
Fourier series. Noting that the dispersion functions w;(g) are periodic, we try the 
ansatz 


wj(y) = >_ wine”? (9.92) 


and insertion into (9.91) allows to solve for the individual Fourier coefficients wj, 
by virtue of the orthogonality of the exponential functions e’”?. We get for the 
dispersion functions up to second order and reverting to the ordinary 7-function 


F i eine 
nol) = +B? (v) >~ ee (9.93a) 


9.4 Higher-Order Phase Focusing 289 


Fin ey 


Fon et 
; Sete ; 


m(g) = —B’ (9) Ly (9.93b) 


Fone ing ne Fon ~~ 
m(@) = +B) os rT — Bp Dar St POD 
, (9.93c) 


The solutions of the higher-order differential equations have the same integer- 
resonance behavior as the linear solution for the dispersion function. The higher- 
order corrections will become important for lattices where strong sextupoles are 
required in which cases the sextupole terms may be the major perturbations to 
be considered. Other perturbation terms depend mostly on the curvature « in the 
bending magnets and, therefore, maybe small for large rings or beam-transport lines 
with weak bending magnets. 


9.4.2 Path Length in Higher Order 


The path length together with the velocity of particles governs the time of arrival 
at the accelerating cavities from turn to turn and therefore defines the stability of a 
particle beam. Generally, only the linear dependence of the path length on particle 
momentum is considered. We find, however, higher-order chromatic contributions 
of the dispersion function to the path length as well as momentum independent 
contributions due to the finite angle of the trajectory with respect to the ideal orbit 
during betatron oscillations. 

The path length for a particular trajectory from point z) to point z in our 
curvilinear coordinate system can be derived from the integral L = On ds, where 
s is the coordinate along the particular trajectory. This integral can be expressed by 


L= p va + Kx)? + x? + y2dz, (9.94) 


where the first term of the integrand represents the contribution to the path length 
due to curvature generated by bending magnets while the second and third term 
are contributions due to finite horizontal and vertical betatron oscillations. For 
simplicity, we ignore vertical bending magnets. Where this simplification cannot 
be made, it is straight forward to extend the derivation of the path length in higher 
order to include bending and betatron oscillations in the vertical plane as well. We 
expand (9.94) up to second order and get for the path length variation AL = L — Lo 


AL= p (ux + 4u?x? + bx? + Sy) dz + O(3). (9.95) 
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The particle amplitudes are composed of betatron oscillation, orbit distortions 
and off energy orbits 


x=xp+x0+ md +67 +..., (9.96) 
y=yp+ yo, 

where (xp. yg) describe the betatron oscillations and (y0,7,...) are the linear 

and higher-order dispersion functions derived in Sect. 9.4.1. The quantities (xo, yo) 

describe the deviation of the actual orbit from the ideal orbit or orbit distortion due 

to magnetic field and alignment errors. 

Evaluating the integral (9.95), we note that the oscillatory character of (xg, yg) 
causes all terms linear in (xg,yg) to vanish while averaging over many turns. 
The orbit distortions (xo, yo) are statistical in nature since the correction in a real 
accelerator is done such that (xo) = 0 and (xj) = 0. Betatron oscillations and 
orbit distortions are completely independent and therefore cross terms like (xgx0) 
vanish. The dispersion function 79 and the higher-order term 7; are unique periodic 
solutions of the inhomogeneous equation of motion. For the betatron oscillations we 
assume a nonresonant tune which causes terms like (xgyo) to vanish as well. With 
these results the path length variation is 


AL & io (xf +99 +20 + 98 + 02x} + 0228) de O97) 
+ 8 rngde + 8 p (km + 5K ND + 31) dz. 


There are three main contributions of which two are of chromatic nature. 
The finite transverse betatron oscillations as well as orbit distortions contribute 
to a second order increase in the path length of the beam transport system 
which is of non-chromatic nature. Equation (9.97) can be simplified by using 


the explicit expressions for the particle motion xg(z) = V€xx(z) sin Wx(z) 
and Xp (2) =  vVex/B,(z) [cos W(z) — a(z) sin W(z)]. Forming the square 
xp = €x/ Bx (2) (cos? Wy — ay sin2y, + a2 sin? re) and averaging over all phases 
Wr 
2 1 2 2 in? 
xp dz = € B. (cos” Wr, + ay sin* Wy) dz, (9.98) 


2Wx ; 
— af (Ss 2 + yx. sin’ vs) dz x ef y.dz, 


where we used the simplifying expression (sin* Vx) ~~ ‘. Similarly, we get 
f yg” dz ~ sey yy dz, and f Kx, dz x Sex PK By dz. The integrals are taken 
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over the entire beam transport line of length Lo and using average values for the 
integrands, the path-length variation is 


AL 


AE = Elec) +ey(05) + 46s) 


72 


+ +(x," + L(y, ) + 3 (kx) (9.99) 
+ a.6 + ((em) + 5 (k?15) + $(n67)) 8° + O(3). 


In this expression for the path-length variation we find separate contributions 
due to betatron oscillations, orbit distortion and higher-order chromatic effects. We 
have used the emittance € as the amplitude factor for betatron oscillation and get 
therefore a path length spread within the beam due to the finite beam emittance e. 
Note specifically that for an electron beam this emittance scales by the factor n? to 
include Gaussian tails, where n, is the oscillation amplitude in units of the standard 
amplitude o. For whole beam stability a total emittance of jo, = 77€ — 10’ should 
be considered. For stable machine conditions, the contribution of the orbit distortion 
is the same for all particles and can therefore be corrected by an adjustment of 
the rf-frequency. We include these terms here, however, to allow the estimation of 
allowable tolerances for dynamic orbit changes. 


9.4.3 Higher Order Momentum Compaction Factor 


The longitudinal phase stability in a circular accelerator depends on the value of the 
momentum compaction 7,, which actually regulates the phase focusing to obtain 
stable particle motion. This parameter is not a quantity that can be chosen freely 
in the design of a circular accelerator without jeopardizing other desirable design 
goals. If, for example, a small beam emittance is desired in an electron storage 
ring, or if for some reason it is desirable to have an isochronous ring where the 
revolution time for all particles is the same, the momentum compaction should 
be made to become very small. This in itself does not cause instability unless the 
momentum compaction approaches zero and higher-order chromatic terms modify 
phase focusing to the extent that the particle motion becomes unstable. To derive 
conditions for the loss of phase stability, we evaluate the path length variation (9.99) 
with momentum in higher order 


AL 
= 8 + on 4 £4 OG), (9.100) 
() 


where & represents the momentum independent term 


E = t (ex (ys) + & (ry) + €x (KBs) (9.101) 
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and 
a = (km) + 4 (k?m5) + 5 (no) (9.102) 


is the non-linear momentum compaction factor. 

From the higher order dispersion and path length we may now derive the value 
of the higher order momentum compaction factor. First we note that we are not 
interested in oscillatory terms. Therefore (9.93b) reduces to 


2 
_e Oe a: B {o) 


m(g) = Fi, (9.103) 


Foo = (k) and 


1 1 
Fy = (—Sym = (k° + 2«k) ne + 5k Mo +k'nony + (2k? +k) my): 


Furthermore 
2 
(em) = (« ee Fig. eta), (9.104) 


where the average is to be taken over one superperiod of the accelerator. The other 
terms in (9.102) and (9.101) are straight forward. With the knowledge of the higher 
order momentum compaction factor we are now able to consider higher order phase 
motion. 


9.4.4 Higher-Order Phase Space Motion 


Following the derivation of the linear phase equation, we note that it is the variation 
of the revolution time with momentum rather than the path-length variation that 
affects the synchronicity condition. With the expanded momentum compaction n,. = 
2 — awe get the differential equation for the phase oscillation to second order 


a 
= = —wr (M5 — a5" — €) (9.105) 
and for the momentum oscillation 
06 V, 
= ©" (sin Wy —sinys). (9.106) 


Ot — Tocpo 


9.4 Higher-Order Phase Focusing 293 


S\S2/(Paase 
SERS 


oy 
eZ 


Ewa 
Za 
DRA 


A“ 


ex 


Ws Ws Cc D 
Fig. 9.19 Second-order longitudinal phase space for ¥; = 0, € = O and weak perturbation 
at / Ne = —3.0 
In linear approximation, where a; = 0 and € = 0, a single pair of fixed points 


and separatrices exist in phase space. These fixed points can be found from the 
condition that y = 0 and = 0 and they lie on the abscissa for 5 = 0. The 
stable fixed point is located at (Wer, dsr) = (Ws, 0) defining the center of the rf- 
bucket where stable phase oscillations occur. The unstable fixed point at (Wur, dur) = 
(x — W,, 0) defines the crossing point of the separatrices separating the trajectories 
of oscillations from those of librations. 

Considering also higher-order terms in the theory of phase focusing leads to a 
more complicated pattern of phase space trajectories. Setting (9.106) equal to zero 
we note that the abscissae of the fixed points are at the same location as for the linear 
case 


Wit = Ws and Wo = 1 — Ws. (9.107) 


The energy coordinates of the fixed points, however, are determined by solving 
(9.105) for W = 0 or 


nob — 06° —& =0 (9.108) 


with the solutions 


ia (1 de vI-T) (9.109) 
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Fig. 9.20 Higher-order longitudinal phase space diagrams for yy = 0, & = O and strong 
perturbation a, /7n. = —6.0 (top) and a; /n. = —13.5 (bottom) 


where 


_ 4éqy 


rT a 


(9.110) 


Due to the quadratic character of (9.108), we get now two layers of fixed points 
with associated areas of oscillation and libration. In Figs.9.19, 9.20, the phase 
diagrams are shown for increasing values of a while for now we set the momentum 
independent perturbation = 0. Numerically, the contour lines have been calculated 
from the Hamiltonian (9.114) with A/2n, = 0.005, where A is defined in (26199). 
The appearance of the second layer of stable islands and the increasing perturbation 
of the original rf-buckets is obvious. There is actually a point [Fig.9.20 (top)] 
where the separatrices of both island layers merge. We will use this merging of the 
separatrices later to define a tolerance limit for the perturbation on the momentum 
acceptance. 

The coordinates of the fixed points in the phase diagram are determined from 
(9.116), (9.117) and are for the linear fixed points in the first layer 


pointA: Wa=Vs, da = (1 oe F) 
pointB: wg=a—wW,, b3 = 4 (1- VIP). 


2a 


(9.111) 
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Fig. 9.21 Second-order longitudinal phase space for the same parameters as Fig. 9.20 (top), but 
now with 2§/n, = —0.125 


The momenta of these fixed points are at 6 = 0 for = 0 consistent with 
earlier discussions. As orbit distortions and betatron oscillations increase, however, 
we note a displacement of the equilibrium momentum as I" increases. 

The fixed points of the second layer of islands or rf-buckets are displaced both in 
phase and in momentum with respect to the linear fixed points such that the stable 
and unstable fixed points are interchanged. The locations of the second layer of fixed 
points are 


pointC: Wo=s, bc = # (14+ VI-T), 


2a} 


pointD: wp =27—vy, bp = 2 (1+VI-T). 


2a 


(9.112) 


The dependence of the coordinates for the fixed points on orbit distortions and the 
amplitude of betatron oscillations becomes evident from (9.121), (9.124). Specifi- 
cally, we note a shift in the reference momentum of the beam by &/7, as the orbit dis- 
tortion increases as demonstrated in the examples shown in Figs. 9.21, 9.22, 9.23c, 
d. Betatron oscillations, on the other hand, cause a spread of the beam momentum 
in the vicinity of the fixed points. This readjustment of the beam momentum is a 
direct consequence of the principle of phase focusing whereby the particle follows 
a path such that the synchronicity condition is met. The phase space diagram 
of Fig.9.19 is repeated in Fig.9.21 with a parameter 2&/n, = —0.125 and in 
Fig.9.22 with the further addition of a finite synchronous phase of wy, = 0.7 
rad. In addition to the shift of the reference momentum a significant reduction in 
the momentum acceptance compared to the regular rf-buckets is evident in both 
diagrams. 

As long as the perturbation is small and |ai| < |7.|, the new fixed points are 
located far away from the reference momentum and their effect on the particle 
dynamics can be ignored. The situation becomes very different whenever the 
linear momentum compaction becomes very small or even zero due to strong 
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Fig. 9.22 Higher-order longitudinal phase space diagrams for yw, = 0.7, 2&/n, = —0.125 and a 
weak perturbation a) /n_. = —3.0 
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quadrupole focusing during momentum ramping through transition or in the case 
of the deliberate design of a low qa-lattice for a quasi isochronous storage ring. In 
these cases higher order perturbations become significant and cannot be ignored. 
We cannot assume anymore that the perturbation term a is negligibly small and the 
phase dynamics may very well become dominated by perturbations. 

The perturbation a; of the momentum compaction factor depends on the 
perturbation of the dispersion function and is therefore also dependent on the 
sextupole distribution in the storage ring. Given sufficient sextupole families it is 
at least in principle possible to adjust the parameter a; to zero or a small value by a 
proper distribution of sextupoles. 


9.4.5 Stability Criteria 


Stability criteria for phase oscillations under the influence of higher order momen- 
tum compaction terms can be derived from the Hamiltonian. The nonlinear equa- 
tions of motion (9.105), (9.106) can be derived from the Hamiltonian 


* [cos ¥ —cos Ws + (W — Vs) sin Ws] + cone (&5 — $.5° + 406°). 
(9.113) 


= Te CPo 


To eliminate inconsequential factors for the calculation of phase space trajecto- 
ries, we simplify (9.113) to 


H = A[cos w —cos Ws + (W — Ws) sin Ws] + 225 —8 4+ a (9.114) 


c 
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where 


2eV + 


eee ae (9.115) 
Tocpo@rt Ne 


We may use (9.114) to calculate phase space trajectories and derive stability 
conditions for various combinations of the parameters A, the perturbation of the 
momentum compaction a, and the synchronous phase w, (Figs. 9.19, 9.20, 9.21, 
and 9.22). In Fig.9.23, the phase diagrams of Figs.9.19, 9.20, 9.21, and 9.22 
are displayed now as three-dimensional surfaces plots with the same parameters. 
Starting from the linear approximation where only regular rf-buckets appear along 
the w-axis, we let the ratio a /n_ increase and find the second set of rf-buckets 
to move in from large relative momentum errors ds toward the main rf-buckets. 
A significant modification of the phase diagrams occurs when the perturbation 
reaches such values that the separatrices of both sets of buckets merge as shown in 
Fig. 9.20 (top). A further increase of the perturbation quickly reduces the momentum 


a) b) 

second order unstable second order unstable 

fixed) points \ \ stable fixed pons stable 
Irf-buckets 


42 
—6. 


~ ( se 
is 

SJ ( 

6.28 ‘ty 


-0.05 


Be G22. 15 
eecondiorder unstable ) second order unstable stable 
: ; fixed | points 
fixed pons : P rf-buckets 


Fig. 9.23 Three dimensional rendition of Figs. 9.19(a), 9.20(b), 9.21(c) and 9.22(d) 
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acceptance of the rf-system as can be noticed by comparing Figs.9.20 (top) and 
(bottom) or Figs. 9.23 (top) and 9.23 (bottom). The effect of the momentum shift 
when & 4 0 becomes obvious in Figs. 9.21, 9.22, 9.23c, d as well as the effect of a 
finite synchronous phase in Fig. 9.23d. 

From these qualitative observations we derive a threshold of allowable perturba- 
tion a; above which the momentum acceptance of the system becomes significantly 
reduced. From Figs. 9.20 (top) we take the condition for momentum stability when 
the separatrices of both sets of buckets merge which occurs when the Hamiltonian 
for both separatrices or for the fixed points (B) and (C) are equal and 


H (x — Ws, 6s) = As, bc). (9.116) 
Equation (9.116) becomes in the form of (9.114) 


§ 


Fok + 2 
Ne 


A (—2 cos Ws + (a — 2W;) sin Ws) — 53 + 


Bie 
Nc 


bp (9.117) 
= —§2 4 ra i 


Comparing (9.115) with the results of linear theory, we note that the maximum 
unperturbed momentum acceptance is related to the parameter A by 


= I (2) INc| (9.118) 
F(q) sin Ws PO / max "Nec , 


where ; = siny, and F(q) is defined in (9.65). Equation (9.117) can be solved for 
the maximum momentum acceptance 


ary 2 a) 3 3 g 
— = 6a — 6 da — 63) + 2—— (6c — dp). (9.119) 
(5 = = a a) + ing ») inl 


Using the expression (9.109) for the coordinates of the fixed points (9.119) 
eventually becomes with (9.110) 


Ap\* : 
ae ee Ge ce (9.120) 
Po 307 


and the stability criterion that the nonlinear perturbation not reduce the momentum 
acceptance is finally expressed by 


nel (= ry""* 
ay) 
PO / desired 


a < (9.121) 
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From this criterion we note that the momentum independent perturbation J” can 
further limit the momentum acceptance until there is for J” > | no finite momentum 
acceptance left at all. 

The momentum shift and the momentum acceptance as well as stability limits 
can be calculated analytically as a function of a; and the momentum independent 
term I". As long as the perturbation is small and (9.121) is fulfilled we calculate the 
momentum acceptance for the linear rf-buckets from the value of the Hamiltonian 
(9.114). For stronger perturbations, where the separatrices of both layers of rf- 
buckets have merged and are actually exchanged (Fig. 9.20), a different value of 
the Hamiltonian must be chosen. The maximum stable synchrotron oscillation in 
this case is not anymore defined by the separatrix through fixed point B but rather 
by the separatrix through fixed point C. In the course of synchrotron oscillations a 
particle reaches maximum momentum deviations from the reference momentum at 
the phase wy = w,. We have two extreme momentum deviations, one at the fixed 
point (C), and the other half a synchrotron oscillation away. Both points have the 
same value of the Hamiltonian (9.114) and are related by 


2 1 


$3 =2 Bee 8+ 37 8. (9.122) 
Ne 


We replace dc from (9.112) and obtain a third-order equation for the maximum 
momentum acceptance 6 


E ~ ay 2.28 43 Ne 3/2 3 
22-8 — 8? —s =-—>]}1 lea) =sl). 9.123 
ae ga erase | (9.123) 


This third-order equation can be solved analytically and has the solutions 


(9.124) 


Two of the three solutions are the same and define the momentum at the crossing 
of the separatrix at the fixed point (C) while the other solution determines the 
momentum deviation half a synchrotron oscillation away from the fixed point 
(C). We plot these solutions in Fig. 9.24 together with the momentum shift of the 
reference momentum at the fixed point (A). As long as there is no momentum 
independent perturbation (J” = 0) the momentum acceptance is given by 


(9.125) 


The asymmetry of the momentum acceptance obviously reflects the asymmetry 
of the separatrix. For ~; —> 0 the momentum acceptance in (9.120) diverges, which 
is a reminder that we consider here only the case where the perturbation a, exceeds 
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the limit (9.121). In reality the momentum acceptance does not increase indefinitely 
but is limited by other criteria, for example, by the maximum rf-voltage available. 
The momentum acceptance limits of (9.124) are further reduced by a finite beam 
emittance when I” 4 0 causing a spread in the revolution time. All beam stability is 
lost as I” approaches unity and the stability criterion for stable synchrotron motion 
in the presence of betatron oscillations is defined by 


4Eay 
n2 


2, (9.126) 


where the parameter & is defined by (9.101). 

In evaluating the numerical value of € we must consider the emittances €,, as 
amplitude factors. In case of a Gaussian electron beam in a storage ring, for example, 
a long quantum lifetime can be obtained only if particles with betatron oscillation 
amplitudes up to at least seven standard values are stable. For such particles the 
emittance is € = 7*€,, where €, is the beam emittance for one standard deviation. 
Similarly, the momentum acceptance must be large enough to include a momentum 
deviation of bmax > 70, /Eo. 

In general, the stability criteria can be met especially if sextupole magnets are 
adjusted that the linear perturbation a, of the momentum compaction is made 
small. In this case, however, we must consider dynamic stability of the beam and 
storage ring to prevent qa, to vary more than the stability criteria allow. Any dynamic 
variation Aq, must meet the condition 


2 
Aa, < -=. 9.127 
< qe ( ) 
Even if the quadratic term a; is made to approach zero we still must consider the 
momentum shift due to non-chromatic terms when € # 0. From (9.111) we have 
for the momentum shift 59 of the stable fixed point A 


ca (1-vI=T), (9.128) 


7 2a} 


9.4 Higher-Order Phase Focusing 301 


where J” is small when a; — 0 and the square root can be expanded. In this limit 
the momentum shift becomes 


for a, > 0. (9.129) 


To achieve low values of the momentum compaction, it is therefore also 
necessary to reduce the particle beam emittance. Case studies of isochronous lattices 
show, however, that this might be very difficult because the need to generate both 
positive and negative values for the dispersion function generates large values for 
the slopes of the dispersion leading to rather large beam emittances. 

Adjusting the quadratic term a, to zero finally brings us back to the situation 
created when the linear momentum compaction was reduced to small values. One 
cannot ignore higher-order terms anymore. In this case we would expect that the 
quadratic and cubic perturbations of the momentum compaction will start to play 
a significant role since n, ~ 0 and a, ~ 0. The quadratic term a3 will introduce 
a spread of the momentum compaction due to the momentum spread in the beam 
while the cubic term a, introduces a similar spread to the linear term a. 


Problems 


9.1 (S). A 500 MHz rf-system is supposed to be used in a Wideroe type linac to 
accelerate protons from a 1 MeV Van de Graaf accelerator. Determine the length of 
the first three drift tubes for an accelerating voltage at the gaps of 0.5 MeV while 
assuming that the length of the tubes shall not be less than 15cm. Describe the 
operating conditions from an rf-frequency point of view. 


9.2 (S). A proton beam with a finite energy spread is injected at an energy of 
200 MeV into a storage ring in np, equidistant short bunches while the rf-system 
in the storage ring is turned off. Derive an expression for the debunching time or the 
time it takes for the bunched proton beam to spread out completely. 


9.3 (S). The momentum acceptance in a synchrotron is reduced as the synchronous 
phase is increased. Derive a relationship between the maximum acceleration rate 
and momentum acceptance. How does this relationship differ for protons and 
radiating electrons? 


9.4 (S). Derive an expression for and plot the synchrotron frequency as a function 
of oscillation amplitude within the separatrices. What is the synchrotron frequency 
at the separatrices? 


9.5 (S). Sometimes it is desirable to produce short bunches, even only temporary 
in a storage ring either to produce short X-ray pulses or for quick ejection from a 
damping ring into a linear collider. By a sudden change of the rf-voltage the bunch 
can be made to rotate in phase space. Determine analytically the shortest possible 
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bunch length as a function of the rf-voltage increase considering a finite energy 
spread. For how many turns would the short bunch remain within 50 % of its shortest 
value? 


9.6. Calculate the synchrotron oscillation frequency for a 9GeV proton booster. 
The maximum momentum is CPmax = 8.9GeV the harmonic number i = 84, the 
rf-voltage Vie = 200kYV, transition energy yy =5.4 and rf-frequency at maximum 
momentum fe = 52.8 MHz. Calculate and plot the rf and synchrotron oscillation 
frequency as a function of momentum from an injection momentum of 400 MeV to 
a maximum momentum of 8.9 GeV while the synchronous phase is yw, = 45°. What 
is the momentum acceptance at injection and at maximum energy? How long does 
the acceleration last? 


9.7. Specify a synchrotron of your choice made up of FODO cells for the 
acceleration of relativistic particles. Assume an rf-system to provide an accelerating 
voltage equal to 10~* of the maximum particle energy in the synchrotron. During 
acceleration the synchrotron oscillation tune shall remain less than vs < 0.02. 
What are the numerical values for the rf-frequency, harmonic number, rf-voltage, 
synchronous phase angle and acceleration time in your synchrotron? In case of a 
proton synchrotron determine the change in the bunch length during acceleration. 
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Chapter 10 
Periodic Focusing Systems 


The fundamental principles of charged particle beam dynamics as discussed in 
previous chapters can be applied to almost every beam transport need. Focusing and 
bending devices for charged particles are based on magnetic or electric fields which 
are specified and designed in such a way as to allow the application of fundamental 
principles of beam optics leading to predictable results. 

Beam transport systems can be categorized into two classes: The first group 
includes beam transport lines which are designed to guide charged particle beams 
from point A to point B. In the second class, we find beam transport systems or 
magnet lattices forming circular accelerators. The physics of beam optics is the 
same in both cases but in the design of actual solutions different boundary conditions 
apply. Basic linear building blocks in a beam transport line are the beam deflecting 
bending magnets, quadrupoles to focus the particle beam, and field free drift spaces 
between magnets. Transformation matrices have been derived in Chap.7 and we 
will apply these results to compose more complicated beam transport systems. The 
arrangement of magnets along the desired beam path is called the magnet lattice or 
short the lattice. 

Beam transport lines can consist of an irregular array of magnets or a repetitive 
sequence of a group of magnets. Such a repetitive magnet sequence is called a 
periodic magnet lattice, or short periodic lattice and if the magnet arrangement 
within one period is symmetric this lattice is called a symmetric magnet lattice, 
or short a symmetric lattice. By definition a circular accelerator lattice is a periodic 
lattice with the circumference being the period length. To simplify the design and 
theoretical understanding of beam dynamics it is customary, however, to segment 
the full circumference of a circular accelerator into identical sectors which are 
repeated a number of times to form the complete ring. Such sectors are called 
superperiods and define usually most salient features of the accelerator in contrast 
to much smaller periodic segments called cells, which include only a few magnets. 

In this chapter, we concentrate on the study of periodic focusing structures. For 
long beam transport lines and specifically for circular accelerators it is prudent to 
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consider focusing structures that repeat periodically. In this case, one can apply 
beam dynamics properties of one periodic lattice structure as many times as 
necessary with known characteristics. In circular particle accelerators such periodic 
focusing structures not only simplify the determination of beam optics properties 
in a single turn but we will also be able to predict the stability criteria for particles 
orbiting an indefinite number of revolutions around the ring. 

To achieve focusing in both planes, we will have to use both focusing and 
defocusing quadrupoles in a periodic sequence such that we can repeat a lattice 
period any number of times to form an arbitrary long beam line which provides the 
desired focusing in both planes. 


10.1 FODO Lattice 


The most simple periodic lattice would be a sequence of equidistant focusing 
quadrupoles of equal strength. This arrangement is unrealistic with magnetic 
quadrupole fields which do not focus in both the horizontal and vertical plane in 
the same magnet. The most simple and realistic compromise is therefore a periodic 
lattice like the symmetric quadrupole triplet which was discussed in Sect. 7.2.3. and 
is shown schematically in Fig. 10.1. 

Each half of such a lattice period is composed of a focusing (F) and a defocusing 
(D) quadrupole with a drift space (O) in between forming a FODO sequence. 
Combining such a sequence with its mirror image as shown in Fig. 10.1 results in 
a periodic lattice which is called a FODO lattice or a FODO channel. By starting 
the period in the middle of a quadrupole and continuing to the middle of the next 
quadrupole of the same sign not only a periodic lattice but also a symmetric lattice 
is defined. Such an elementary unit of focusing is called a lattice unit or in this 
case a FODO cell. The FODO lattice is the most widely used lattice especially in 
high energy accelerator systems because of its simplicity, flexibility, and its beam 
dynamical stability. 


FODO Period 
I 
| I|_ Zz 
| L\ 
| I 
| | 
I | 
Y% QF QD ¥2 QF 


Fig. 10.1 FODO-lattice (QF focusing quadrupole, QD defocusing quadrupole) 
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10.1.1 Scaling of FODO Parameters 


To determine the properties and stability criteria for a FODO period we restrict 
ourselves to thin lens approximation, where we neglect the finite length of the 
quadrupoles. The FODO period can be expressed symbolically by the sequence 
$QF-L-QD-L-3QF, where the symbol L represents a drift space of length L and 
the symbols QF and QD are focusing or defocusing quadrupoles, respectively. In 
either case we have a triplet structure for which the transformation matrix has been 
derived in Sect. 7.2.3 


2 L 
1-24 aL (1+ +) 


1 ve 
“fr 1-27 


Mropo = (10.1) 


Here ff = —fa = f, 1/f* = 2(1 — L/f)L/f? and is called a symmetric FODO 
lattice. 

From the transformation matrix (10.1) we can deduce an important property for 
the betatron function. The diagonal elements are equal as they always are in any 
symmetric lattice. Comparison of this property with elements of the transformation 
matrix expressed in terms of betatron functions (8.74) shows that the solution of the 
betatron function is periodic and symmetric since a = 0 both at the beginning and 
the end of the lattice period. We therefore have symmetry planes in the middle of 
the quadrupoles for the betatron functions in the horizontal as well as in the vertical 
plane. The betatron functions then have the general periodic and symmetric form as 
shown in Fig. 10.2. 

From (8.22) and (10.1), we can derive the analytical expression for the periodic 
and symmetric betatron function by setting Bp = 6B, a = 0 and yo = 1/6 and get 


2\2 2 
p= (1 -25) p+ 4r(1 +5) (10.2) 


\ FODO Cell 


Fig. 10.2 Periodic betatron functions in a FODO channel 
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where f > 0 and f is the value of the betatron function in the middle of the focusing 
quadrupole QF. Solving for 6, we get 


ff 
TEA 1 
Atay, aa _ ea 2. (10.3) 
2 eo 
val 
where we define the FODO parameter « by 
k= f >1 (10.4) 


and set B = B* to indicate the solution in the center of the focusing quadrupole. The 
FODO parameter « is used only here and should not be identified with our general 
use of this letter being the curvature. Had we started at the defocusing quadrupole 
we would have to replace f by —f and get analogous to (10.3) for the value of the 
betatron function in the middle of the defocusing quadrupole 


pa X=) | (10.5) 
K>—1 

These are the solutions for both the horizontal and the vertical plane. In the 
middle of the horizontally focusing quadrupole QF (f > 0) we have 8, = B* and 
fh, = B- and in the middle of the horizontally defocusing quadrupole QD (f < 0), 
we have 6, = B~ and fy = Bt. From the knowledge of the betatron functions 
at one point in the lattice, it is straightforward to calculate the value at any other 
point by proper matrix multiplications as discussed earlier. In open arbitrary beam 
transport lines the initial values of the betatron functions are not always known and 
there is no process other than measurements of the actual particle beam in phase 
space to determine the values of the betatron functions as discussed in Sect. 8.1.3. 
The betatron functions in a periodic lattice in contrast are completely determined 
by the requirement that the solution be periodic with the periodicity of the lattice. It 
is not necessary that the focusing lattice be symmetric to obtain a unique, periodic 
solution. Equation (8.22) can be used for any periodic lattice requiring only the 
equality of the betatron functions at the beginning and at the end of the periodic 
structure. Of course, not any arbitrary although periodic arrangement of quadrupoles 
will lead to a viable solution and we must therefore derive conditions for periodic 
lattices to produce stable solutions. 

The betatron phase for a FODO cell can be derived by applying (8.74) to a 
symmetric lattice. With a9 = a = 0 and fo = B this matrix is 


( cos Js8 (10.6) 


a sing cosd 
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where ¢ is the betatron phase advance through a full symmetric period. Since the 
matrix (10.6) must be equal to the matrix (10.1) the phase must be 


(10.7) 

or 

. 1 

sintS=—-. (10.8) 
K 


For the solution (10.8) to be real the parameter « must be larger than unity, a 
result which also becomes obvious from (10.3), (10.5) . This condition is equivalent 
to stating that the focal length of half a quadrupole in a FODO lattice must be longer 
than the distance to the next quadrupole. 

The solutions for periodic betatron functions depend strongly on the quadrupole 
strengths. Specifically, we observe that (10.3) has minimum characteristics for B*. 
Taking the derivative dB* /d« = 0, (10.3) becomes 


kg —Ko -1=0, (10.9) 
which can be solved for 
Ko = 5+ V7+1= 1.6180. (10.10) 
The optimum phase advance per FODO cell is therefore 
goo © 76.345°. (10.11) 


The maximum value of the betatron function reaches a minimum for a FODO 
lattice with a phase advance of about 76.3° per cell. Since beam sizes scale with 
the square root of the betatron functions, a lattice with this phase advance per cell 
requires the minimum beam aperture. 

This criteria, however, is true only for a flat beam when €, >> €, or €, > €,. For 
a round beam with uniform particle distribution in phase space €, ~ €, and we get 
for the maximum beam acceptance by minimizing the beam diameter or E? + # ~ 
f,.+ By), where E, and Ey are the beam envelopes in the horizontal and vertical plane, 
respectively (Fig. 10.3). This minimum is determined by d(6, + f,)/d« = 0, or for 


Kop = V2 (10.12) 
and the optimum betatron phase per cell is then from (10.8) 


Popt = 90°. (10.13) 
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Fig. 10.3 Maximum beam 
acceptance of a FODO lattice 
with a circular aperture of 
radius R and where 


Exy = V €xyBxy 


This solution requires the minimum radial aperture R in quadrupoles for a beam 
with equal beam emittances in both planes €, = €, = e. The betatron functions in 
the middle of the quadrupoles are then simply 


Bay = L(2+ V2), 


be = 12-3). (10.14) 


The beam envelopes are E, = Past and Ey = ~ €Bopr and the maximum beam 


emittance to fit an aperture of radius R or the acceptance of the aperture can be 
determined from 


Ey + By = R’ = €(B* + B ope: (10.15) 
= 4L and the acceptance of a FODO channel 


From (10.14) we find (B* + BP xe 
with an aperture radius R becomes 


R2 

Emax = 40 (10.16) 
Of course, this definition of the acceptance is true only for a monochromatic beam. 
In a real beam we must include the dispersion and energy spread in the beam to find 
the optimum acceptance. Also there are other particle distributions for which this 

optimisation may not be quite accurate. 
With this optimum solution we may develop general scaling laws for the betatron 
functions in a FODO lattice. The values of the betatron functions need not be known 
at all points of a periodic lattice to characterize the beam optical properties. It is 
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Fig. 10.4 Scaling of 
horizontal and vertical 
betatron functions in a FODO 
lattice 


sufficient to know these values at characteristic points like the symmetry points in a 
FODO channel, where the betatron functions reach maximum or minimum values. 
From (10.3), (10.14) the betatron functions at these symmetry points are given by 


pt = k (k+l) 
- Cae (10.17) 


Bopt (2—/2)V«2=1 


The scaling of the betatron function is independent of L and depends only on the 
ratio of the focal length to the distance between quadrupoles « = f /L. In Fig. 10.4 
the betatron functions B* and B~ are plotted as a function of the FODO parameter k. 

The distance L between quadrupoles is still a free parameter and can be adjusted 
to the needs of the particular application. We observe, however, that the maximum 
value of the betatron function varies linear with L and the maximum beam size in a 
FODO lattice scales like JT. 


10.1.2. Betatron Motion in Periodic Structures 


For the design of circular accelerators it is of fundamental importance to understand 
the long term stability of the beam over many revolutions. Specifically we need to 
know if the knowledge of beam dynamics in one periodic unit can be extrapolated 
to many periodic units. 
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Stability Criterion 


The periodic solution for one FODO cell has been derived in the last section and we 
expect that such periodic focusing cells can be repeated indefinitely. Following the 
classic paper by Courant and Snyder [1], we will derive the stability conditions for 
an indefinite number of periodic but not necessarily symmetric focusing cells. The 
structure of the cells can be arbitrary but must be periodic. If M(z + 2L|z) is the 
transformation matrix for one cell, we have for N cells 


M(z 4+ N2L|z) = [M(z+ 2L|2)]”. (10.18) 


Stable solutions are obtained if all elements of the total transformation matrix 
stay finite as N increases indefinitely. To find the conditions for this we calculate 
the eigenvalues A of the characteristic matrix equation. The eigenvalues A are a 
measure for the magnitude of the matrix elements and therefore finite values for the 
eigenvalues will be the indication that the transformation matrix stays finite as well. 
The characteristic matrix equation 


(M —ATZ)x =0, (10.19) 


where Z is the unity matrix. For nontrivial values of the eigenvectors (x # 0) the 
determinant 


Cc-A S 


IM - 27) =| Sa 7 


| =0 (10.20) 


must vanish and with CS’ — SC’ = 1 the eigenvalue equation is 
A2-(C+S8)A4+1=0. (10.21) 


The solutions are 


M2 =4(C+S) 4 V(C+ 8) -1 (10.22) 


or with the substitution $(C + S’) = cos¢ 
Ai2 =cosd +ising =e’. (10.23) 
The betatron phase ¢ must be real or the trace of the matrix MM must be 


TrM}=C+S' < 2. (10.24) 
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On the other hand, the transformation matrix for a full lattice period is 


Mm (one asing B sind Mi (10.25) 


—ysin@ cosd—asing 


which can be expressed with 7 = ( fae ) by 


M=Tcos¢+ Jsing. (10.26) 


This matrix has the form of Euler’s formula for a complex exponential. Since the 
determinant of M is unity we get yB — a? = 1 or J* = —Z. Similar to Moivre’s 
formula, for N equal periods 


MN = (Lcos¢ + J sing)” = Icos (Nd) + J sin (No) (10.27) 
and the trace for N periods is bounded if cos @ < 1 or if (10.24) holds or if 
Tr (M™) = 2cos(N@) < 2. (10.28) 


This result is called the stability criterion for periodic beam transport lattices. We 
note that the trace of the transformation matrix M does not depend on the reference 
point z. To show this we consider two different reference points z; and z2, where 
Z1 < 22, for which the following identities hold 


M(zo + 2L| 21) = M(z2| 21) M(q + 2L| 21) = M (zo + 2L| 2) M(z| 21) 
(10.29) 


and solving for M(z) + 2L|z2) we get 
M (2 + 2L| 22) = M(z2| 21) M(z, + 2L| z1) M71 (z9| Zz). (10.30) 
This is a similarity transformation and therefore, both transformation matrices 


M(z2+2L| z2) and M(z;-+2L| z;) have the same trace and eigenvalues independent 
of the choice of the location z. 


10.1.3, General FODO Lattice 


So far we have considered FODO lattices, where both quadrupoles have equal 
strength, fi = —f2 = f. Since we made no use of this in the derivation of the 
stability criterion for betatron functions we expect that stability can also be obtained 
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for unequal quadrupoles strengths. In this case the transformation matrix of half a 
FODO cell is 


1 0 LL 1 0 1-4 L 
my (4 (0) (49)- (EE). onan 
° B ji FF 1B 


where 1/f* = +1/fi+1/fp—-L/(fi fz). Multiplication with the reverse matrix gives 
for the full transformation matrix of the FODO cell 


1-24 2z(i-£# 
M = ( ; f ( ‘)) . (10.32) 
-A1-£ 1-24 
f* fi f* 
The stability criterion 
4L 
Tr{M} = b a Pe <2 (10.33) 
is equivalent to 
L 
O< Fe <1. (10.34) 


To determine the region of stability in the (u, v)-plane, where u = L/f, and 
v = L/f, we get from (10.34) the condition 


O<u+tv—u <1, (10.35) 


where u and v can be positive or negative. Solving the second inequality for either u 
or v we find the conditions |u| < 1 and |v| < 1. With this, the first inequality can be 
satisfied only if u and v have different signs. The boundaries of the stability region 
are therefore given by the four equations 


(10.36) 


defining the stability region shown in Fig. 10.5 which is also called the necktie 
diagram because of its shape. Due to the full symmetry in |u| and |v| the shaded 
area in Fig. 10.5 is the stability region for both the horizontal and vertical plane. 
For convenience, we used the thin lens approximation to calculate the necktie 
diagram. Nothing fundamentally will, however, change when we use the transfor- 
mation matrices for real quadrupoles of finite length except for a small variation of 
the stability boundaries depending on the degree of deviation from the thin lens 
approximation. With the general transformation matrix for a full FODO period 
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Fig. 10.5 Necktie diagram \ 
lv | 
1 
0.5 
stable solutions 
|u| 
0 — 
0 0.5 1 
Ma | the setotie seiutea tortie b function is B2 = —<, and 
=log the periodic solution for the betatron function is B~ = I-q@ an 
the stability condition 
TrM = |C+S'| <2. (10.37) 


The stability diagram has still the shape of a necktie although the boundaries are 
slightly curved (Fig. 10.5). 

A general transformation matrix for half a FODO cell can be obtained in matrix 
formalism with y = /ké by multiplying the matrices 


cosh { sinh 
Mal,” Wo Yr ion) 
2 7 sinh Wo cosh p 01 
Oe 
cos — sin 
ee ee). (10.38) 
—7, sin cos Wy 


where now L is not the half cell length but just the drift space between two adjacent 
quadrupoles of finite length and the indices refer to the first and the second half 
quadrupole, respectively. From this we get the full period transformation matrix by 
multiplication with the reverse matrix 


Obviously the mathematics becomes elaborate although straight forward and it 
is prudent to use computers to find the desired results. 
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Table 10.1 FODO cell parameters 
Example 

Energy, E (GeV) 

Half cell length, L (m) 
Quadrupole length, £4 (m) 


Bending magnet length, ¢, (m) 3:55 2.486 2:5 99.24 
Phase advance per cell, y 101.4 108.0 135.0 90.0 
Quadrupole strength*, k (m~*) tee tee tee tee 
Lattice type” (FODO) sf cf sf sf 


“These parameters will be determined in Problem 6.1 
bsf separated function, cf combined function lattice 


(m) betatron functions 


By 


z(m) 


Fig. 10.6 FODO lattice for one octant of a synchrotron [2, 3] 


As reference examples to study and discuss a variety of accelerator physics issues 
in this text, we consider different FODO lattices (Table 10.1) which are of some 
but definitely not exhaustive practical interest. Other periodic lattices are of great 
interest as well specifically for synchrotron radiation sources but are less accessible 
to analytical discussions than a FODO lattice. All examples except #2 are separated 
function lattices. 

Example #1 is that for a 10 GeV electron synchrotron at DESY [2, 3] representing 
a moderately strong focusing lattice with a large stability range as is commonly used 
if no extreme beam parameters are required as is the case for synchrotrons used to 
inject into storage rings. Figure 10.6 shows the betatron functions for this lattice. We 
note small deviations from a regular FODO lattice which is often required to make 
space for other components. Such deviations from a regular lattice cause only small 
perturbations in the otherwise periodic betatron functions. As example #2 we use the 
lattice for the long curved beam transport lines leading the 50 GeV beam from the 
linac to the collision area at the Stanford Linear Collider [4]. This lattice exhibits the 
greatest deviation from a thin lens FODO channel as shown in Fig. 10.7. Example 
#3 resembles a theoretical lattice for an extremely small beam emittance used to 
study fundamental limits of beam stability and control of aberrations [7]. Lattices 
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betatron functions 

20 4 B | 
| 

[m] | 
I 

10 4 
| 

I 

| | 

0 
0 1 2 3 4 — s[m] Sy 

| 

| 

| 

% BF BD 4 BE | 


Fig. 10.7. FODO cell for a linear collider transport line [5, 6] (example #2 in Table 10.1) 


for future very high energy hadron colliders in the TeV range use rather long FODO 
cells leading to large values of the betatron and dispersion functions and related high 
demands on magnet field and alignment tolerances. Arc lattice parameters for the 
20 TeV Superconducting Super Collider, SSC are compiled as example #4. 


10.2. Beam Dynamics in Periodic Closed Lattices 


In the previous section, we discussed the beam dynamics in a FODO lattice and 
we will use such periodic lattices to construct a closed path for circular accelerators 
like synchrotrons and storage rings. The term “circular” is used in this context rather 
loosely since such accelerators are generally composed of both circular and straight 
sections giving the ring the appearance of a circle, a polygon or racetrack. Common 
to all these rings is the fact that the reference path must be a closed path so that 
the total circumference of the ring constitutes a periodic lattice that repeats turn for 
turn. 


10.2.1 Hill’s Equation 


The motion of particles or more massive bodies in periodic external fields has been 
studied extensively by astronomers in the last century specially in connection with 
the three body problem. In particle beam dynamics we find the equation of motion 
in periodic lattices to be similar to those studied by the astronomer Hill. We will 
discuss in this chapter the equation of motion, called Hill’s equation its solutions 
and properties. 
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Particle beam dynamics in periodic systems is determined by the equation of 
motion 


u’ + K(z)u=0, (10.39) 
where K(z) is periodic with the period Lp 

K(z) =K(z+L)). (10.40) 
The length of a period L, may be the circumference of the circular accelerator 
lattice or the length of a superperiod repeating itself several times around the 
circumference. The differential equation (10.39) with the periodic coefficient 


(10.40) has all the characteristics of a Hill’s differential equation [8]. The solutions 
of Hill’s equation and their properties have been formulated in Floquet’s theorem 


¢ two independent solutions exist of the form 


u(z) = w(z) e#2/20 , 


u2(z) = w* (z) eth (10.41) 


* w*(z) is the complex conjugate solution to w(z). For all practical cases of beam 
dynamics we have only real solutions and w*(z) = w(z); 
¢ the function w(z) is unique and periodic in z with period Lp 
w(z + Lp) = w(z); (10.42) 
e is acharacteristic coefficient defined by 
cos = $Tr [M (z + Lp Iz)] : (10.43) 
¢ the trace of the transformation matrix M is independent of z 
Tr[M(z + Lplz]) FO; (10.44) 
e the determinant of the transformation matrix is equal to unity 
det M = 1; (10.45) 
e the solutions remain finite for 
+Tr[M(z + Lplz)| < 1. (10.46) 


The amplitude function w(z) and the characteristic coefficient jz can be correlated 
to quantities we have derived earlier using different methods. The transformation 
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of a trajectory u through one lattice period of length L, must be equivalent to the 
multiplication by the transformation matrix (10.25) for that period which gives 


u(z + Ly) = (cosw + @ siny) u(z) + B siny u’'(z), (10.47) 


where u stands for any of the two solutions (10.41) and w is the betatron phase 
advance for the period. From (10.41), (10.42) we get on the other hand 


u(z + Ly) = u(z)e*" = u(z) (cose +i sin). (10.48) 
Comparing the coefficients for the sine and cosine terms we get 
cos W = cos or vw=p (10.49) 
and 
a u(z) + Bul(z) = £iu(z). (10.50) 


The first equality can be derived also from (10.25) and (10.43). Equation (10.50) 
can be further simplified by a logarithmic differentiation 


ul’ u’ p’ a! 
_ = —— — : (10.51) 
ul Uu Bp +i-a 
On the other hand, we can construct from (10.39), (10.50) the expression 
u" ul = KB ti-a@ (10.52) 
uo uw) ti-a Bp ; 
and equating the r.h.s. of both expressions (10.51) and (10.52), we find 
(l—o* — Kp? +0'B —a fp’) +iQa+ fp’) =0, (10.53) 


where all functions in brackets are real as long as we have stability. Both brackets 
must be equal to zero separately with the solutions 


p' =-2a, (10.54) 
and 
a =KB-y. (10.55) 
Equation (10.54) can be used in (10.50) for 


ws tia i 1B’ 
= =+—+4 2 10.56 
: SB BYOB ee 
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and after integration 
log — = =+i [F ele poe 5 (10.57) 


where up = u(zo) and Bo = B(zo) for z = zo. Solving for u(z) we get the well 
known solution 


u(z) =a VB(2 et” , (10.58) 
where a = uo/ VBo and 
W(Z—%) = ‘ a é (10.59) 
With w(Lp) = pw and 
joe = we (10.60) 


we find the previous definitions of the betatron functions to be consistent with the 
coefficients of Floquet’s solutions in a periodic lattice. In the next section we will 
apply the matrix formalism to determine the solutions of the betatron functions in 
periodic lattices. 


10.2.2. Periodic Betatron Functions 


Having determined the existence of stable solutions for particle trajectories in 
periodic lattices we will now derive periodic and unique betatron functions. For 
this we take the transformation matrix of a full lattice period 


Me+inlz=(6 5) (10.61) 


and construct the transformation matrix for betatron functions. 


B Cc 2cy = Bo Bo 
a | =| —-CC’ CS'+C’S-SS’ | | a |= Meg | a |. (10.62) 
y ce 900 3 Yo Yo 


Because of the quadratic nature of the matrix elements, we find the same result 
in case of a 180° phase advance for the lattice segment. Any such lattice segment 
with a phase advance of an integer multiple of 180° is neutral to the transformation 
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of lattice functions. This feature can be used to create irregular insertions in a lattice 
that do not disturb the lattice functions outside the insertions. 

To obtain from (10.62) a general periodic solution for the betatron functions we 
simply solve the eigenvector equation 


(Mg —Z)B = 0. (10.63) 
The solution can be obtained from the component equations of (10.63) 


(CC? —1)B-2S8Ca+S’y =0, 
CC’ B — (S.C + CS'—1)a + SS'y =0, (10.64) 
C? B—2S'C'a + (S?-1)y =0. 


A particular simple solution is obtained if the periodic lattice includes a 
symmetry point. In this case, we define this symmetry point as the start of the 
periodic lattice with a = 0, and get the simple solutions 


Ns 1 
2 _ = 
B = 7G’ a=0, y=r. (10.65) 


The transformation matrix for a superperiod or full circumference of a ring becomes 
then simply from (8.74) 


Mz ( COS |L (10.66) 


_ Sin [LCOS [L 


where ju is the phase advance for the full lattice period. The solutions are stable as 
long as the trace of the transformation matrix meets the stability criterion (10.37) or 
as long as uw # nz, where n is an integer. 

Different from an open transport line, well determined and unique starting values 
for the periodic betatron functions exist in a closed lattice due to the periodicity 
requirement allowing us to determine the betatron function anywhere else in the 
lattice. Although (10.65) allows both a positive and a negative solution for the 
betatron function, we choose only the positive solution for the definition of the 
betatron function. 

Stable periodic solutions for asymmetric but periodic lattices, where a # 0, 
can be obtained in a straightforward way from (10.64) as long as the determinant 
IM, —Z| #0. 

The betatron phase for a full turn around a circular accelerator of circumference 
C is from (10.59) 


ztLco d 
(Lc) = i a ; (10.67) 
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If we divide this equation by 27 we get a quantity v which is equal to the number 
of betatron oscillations executed by particles traveling once around the ring. This 
number is called the tune or operating point of the circular accelerator. Since there 
are different betatron functions in the horizontal plane and in the vertical plane, we 
also get separate tunes in a circular accelerator for both planes 


1 de 
xy — a : 10.68 
ie Bo ee 


This definition is equivalent to having chosen the integration constant in (8.57) 
equal to 1/27 instead of unity. Yet another normalization can be obtained by 
choosing 1/v for the integration constant in ( 8.57), in which case the phase 
defined as 


_wW@ _ f% dé 
p(z) = . =| TBO (10.69) 


varies between 0 and 27 along the circumference of a ring lattice. This normaliza- 
tion will become convenient when we try to decompose periodic field errors in the 
lattice into Fourier components to study their effects on beam stability. 

Equation (10.68) can be used to get an approximate expression for the relation- 
ship between the betatron function and the tune. If B is the average value of the 
betatron function around the ring then (Lc) = 22v © Lc/B © 20R/f or 


p= (10.70) 


os 


This equation is amazingly accurate for most rings and is therefore a useful tool for 
a quick estimate of the average betatron function or for the tunes often referred to 
as the smooth approximation. 

In a circular accelerator three tunes are defined for the three degrees of freedom, 
the horizontal, vertical and longitudinal motion. In Fig. 10.8 the measured frequency 


Cee ee ee a ee ee mers eee Ee hg SA i = Aa LL 


Tenegeet 
ees 


v 2v, v v-v 


Fig. 10.8 Frequency spectrum from a circulating particle beam, v, synchrotron tune, v,, vy 
betatron tunes, v, =£ vy satellites 
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spectrum is shown for a particle beam in a circular accelerator. The electric signal 
from an isolated electrode in the vacuum chamber is recorded and connected to a 
frequency analyzer. The signal amplitude depends on the distance of the passing 
beam to the electrode and therefore includes the information of beam oscillations as 
a modulation of the revolution frequency. 

Synchrotron oscillations can also be detected with electrodes and the signal from 
synchrotron oscillations appears on a spectrum analyzer as sidebands to harmonics 
of the revolution frequency. Analogous to the transverse motion, a longitudinal tune 
vs is defined as the number of oscillations per revolution or as the synchrotron tune. 

We note a number of frequencies in the observed spectrum of the storage ring 
SPEAR as shown in Fig. 10.8. At the low frequency end two frequencies indicate 
the longitudinal tune v, and its first harmonic at 2v,. The two large signals are the 
horizontal and vertical tunes of the accelerator. Since the energy oscillation affects 
the focusing of the particles, we also observe two weak satellite frequencies on one 
of the transverse tunes at a distance of -+-v,. The actual frequencies observed are not 
directly equal to v wo, where wo is the revolution frequency, but are only equal to the 
non-integral part of the tune Av wo, where Av is the distance to the integer nearest 
to v. 


10.2.3 Periodic Dispersion Function 


The dispersion function can be periodic if the lattice is periodic. In this section we 
will determine the periodic solution of the dispersion function first for the simple 
lattice building block of a FODO channel and then for general but periodic lattice 
segments. 


Scaling of the Dispersion in a FODO Lattice 


Properties of a FODO lattice have been discussed in detail for a monochromatic 
particle beam only and no chromatic effects have been taken into account. To 
complete this discussion we now include chromatic effects which cause, in linear 
approximation, a dispersion proportional to the energy spread in the beam and 
are caused by bending magnets. We have used the transformation matrix for a 
symmetric quadrupole triplet as the basic FODO cell. The bending magnet edge 
focusing was ignored and so were chromatic effects. In the following we still ignore 
the quadratic edge focusing effects of the bending magnets, but we cannot ignore 
any longer linear effects of energy errors. For simplicity we assume again thin lenses 
for the quadrupoles and get for the chromatic transformation matrix through half a 
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FODO cell, +QF -B- +QD with (8.101) and assuming small deflection angles 


100\ f1LyP 1 00 
Miropo =|[ 1/f lo} ]ol = —1/f10 
: 0 01 00 1 0 01 
or after multiplication 
L 1 72 
My pono =| -4 1+4% # (1+ 4) (10.71) 
0 0 1 


The absolute value of the focal length f is the same for both quadrupoles but 
since we start at the symmetry point in the middle of a quadrupole this focal length 
is based only on half a quadrupole. We have also assumed that the deflection angle 
of the bending magnet is small, 9 < 1, in analogy to thin lens approximation for 
quadrupoles. Lastly, we assumed that the bending magnets occupy the whole drift 
space between adjacent quadrupoles. This is not quite realistic but allows us an 
analytical and reasonable accurate approach. 

In Sect. 8.4 dispersive elements of transformation matrices have been derived. In 
periodic lattices, however, we look for a particular solution which is periodic with 
the periodicity of the focusing lattice and label the solution by n(z) or the 7-function 
in distinction from the ordinary, generally non-periodic dispersion function D(z). 
The typical form of the periodic dispersion function in a FODO lattice is shown in 
Fig. 10.9. 

In addition to being periodic, this 7-function must be symmetric with respect to 
the symmetry points in the middle of the FODO quadrupoles, where the derivative 
of the n-function vanishes. The transformation through one half FODO cell is 


1 a” 
Oe M1 Fopo | 0 j (10.72) 
1 
Fig. 10.9 Dispersion 27 
function in FODO cells y-function 


(example {{1 in Table 10.1) (m) J 
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where we have set 6 = 1| in accordance with the definition of dispersion functions 
and deflection in the horizontal plane. 

In the particular arrangement of quadrupoles chosen in (10.71) the focusing 
quadrupole is the first element and the dispersion function reaches a maximum 
value n* there. In the center of the defocusing quadrupole the dispersion function is 
reduced to a minimum value 7”. The opposite sequence of quadrupoles would lead 


to similar results. From (10.72) we get with n/t = n/— = 0 the two equations 
= L + i? 
n= (1 = t) UMS eee 
uae a (10.73) 
= PR nt + (1 + 4) 


Solving (10.73) for the periodic dispersion function in the middle of the FODO 
quadrupoles, where 7’ = 0, we get in the focusing or defocusing quadrupole 
respectively 


tal (1+ 4)=£nQe+ 


7 i) 

2 (10.74) 
ae fe L\ _ PB 
1 = “bo (1-4) ag * R= 1) 


where k = f /L. 

As mentioned before, in this approximation the bending magnet is as long as the 
length of half the FODO cell since the quadrupoles are assumed to be thin lenses 
and no drift spaces have been included between the quadrupoles and the bending 
magnet. The bending radius po, therefore, is equal to the average bending radius in 
the FODO lattice. From the known values of the dispersion function at the beginning 
of the FODO lattice we can calculate this function anywhere else in the periodic cell. 
Similar to the discussion in Sect. 10.1, we chose an optimum reference lattice, where 


Ko = V2, (10.75) 
and 


mn = (44+ V2), 

_ 2p (10.76) 

No = 35 (4-2). 

In Fig. 10.10 the values of the dispersion functions, normalized to those for the 

optimum FODO lattice in the middle of the FODO quadrupoles, are plotted versus 
the FODO cell parameter x. 

From Fig. 10.10 we note a diminishing dispersion function in a FODO cell as 
the betatron phase per cell or the focusing is increased (f — 0). This result will 
be important later for the design of storage rings for specific applications requiring 
either large or small beam emittances. The procedure to determine the dispersion 
functions in a FODO cell is straightforward and can easily be generalized to real 
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Fig. 10.10 Scaling of the 10.0 — 
dispersion function in a t/q 
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FODO lattices with finite quadrupole length and shorter bending magnets although 
it may be desirable to perform the matrix multiplications on a computer. For 
exploratory designs of accelerators structures, however, the thin lens approximation 
is a powerful and fairly accurate design tool. 


General Solution for the Periodic Dispersion 


In the previous section the dispersion function for a periodic and symmetric FODO 
lattice was derived. Many periodic lattice structures, however, are neither symmetric 
nor are they pure FODO structures and therefore we need to derive the periodic 
dispersion function in a more general form. To do this, we include in the equation 
of motion also the linear energy error term from, for example, (5.46) 


ul" + K(z)u = ko(z)6. (10.77) 


For particles having the ideal energy (6 = 0) the right hand side vanishes and the 
solutions are composed of betatron oscillations and the trivial solution 


uo(z) = 0. (10.78) 


This trivial solution of (10.77) is clearly periodic and represents what is called 
in beam transport systems the ideal path and in circular accelerators the equilibrium 
orbit or closed orbit about which particles perform betatron oscillations. The 
expression for the ideal equilibrium orbit is this simple since we decided to use a 
curvilinear coordinate system which follows the design orbit (10.78) as determined 
by the placement of bending magnets and quadrupoles. 
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For off momentum particles (6 4 0) the ideal path or closed orbit is displaced. 
Ignoring for a moment the z-dependence of K and ko, this systematic displacement 
of the orbit is of the order of 


Aga "3 (10.79) 
K 


as suggested by (10.77) . In a real circular accelerator we expect a similar although 
z-dependent displacement of the equilibrium orbit for off momentum particles. Only 
one equilibrium orbit exists for each particle energy in a given closed lattice. If there 
were two solutions uw, and wu of (10.77) we could write for the difference 


(uy — uz)" + K(z) (uy, — uw) = 0, (10.80) 


which is the differential equation for betatron oscillations. Different solutions for 
the same energy, therefore, differ only by energy independent betatron oscillations 
which are already included in the general solution as the homogeneous part of 
the differential equation (10.77). Therefore, in a particular circular lattice only one 
unique equilibrium orbit or closed orbit exists for each energy. 

Chromatic transformation matrices have been derived in Sect. 8.4. If we apply 
these 3 x 3-matrices to a circular lattice and calculate the total transformation matrix 
around the whole ring, we will be able to determine a self-consistent solution for 
equilibrium orbits. Before we calculate the periodic equilibrium orbits, we note that 
the solutions of (10.77) are proportional to the momentum deviation 5. We therefore 
define the generalized periodic dispersion function as the equilibrium orbit for 6 = 1 
which we call the y-function. The transformation matrix for a periodic lattice of 
length L, is 


C(c+ Lp) S(z+Lp) D(z + Lp) 
M(z+ Lp |z) = | C’ (z+ Lp) 8’ (z+ Lp) D! (z+ Ly) (10.81) 
0 0 1 


and we get for the 7-function with n(z + Ly) = n(z), n'(z + Lp) = n'(z) 


n(z) = C(z+ Lp) n(z) + SZ+ Lp) + DZ+ Ly), 


10.82 
f@ = CG+ 1p) 1@ 4+ StL) 1@O+D e+ Ly). re 


These two equations can be solved for n(z) and n/(z), the periodic dispersion 
function at the point z. The equilibrium orbit for any off momentum particle can 
be derived from this solution by multiplying with 6 


us(z) = n(z) 6. (10.83) 
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In a more formal way the periodic solution for the dispersion function can be 
derived from (10.82) while we drop the arguments for increased clarity 


(C—1)n+ Sx’ + D=0, 
10.84 
C’n + (S’—1)n' + D' = 0, ( ) 


which, in vector notation is 
(M,—-Z)n = 0, (10.85) 


where M, is defined by (10.81) and 7 = (7, 7’, 1). The periodic dispersion function 
is therefore the eigenvector of the eigenvalue equation (10.85) . 

A particularly simple result is obtained again if the point z is chosen at a 
symmetry point, where Heo = 0. In this case the dispersion function at the 
symmetry point is 


D 
Nsym = T-c and ae =O: (10.86) 
Once the values of the y-functions are known at one point it is straightforward to 
obtain the values at any other point in the periodic lattice by matrix multiplication. 
We may also try to derive an analytical solution for the periodic dispersion from 
the differential equation 


n +Kn=k. (10.87) 


The solution is again the composition of the solutions for the homogeneous 
and the inhomogeneous differential equation. First, we transform (10.87) into 
normalized coordinates w, = n/ VB and dg = dz/(vf). In these coordinates 
(10.87) becomes 


dw, 


ip? + vw, = v2 Bx = v?F). (10.88) 


An analytical solution to (10.88) has been derived in Sect.5.5.4 and we have 
accordingly 


W7(P~) = Won COS VY + — sin vp 

+v ff F(t) sinv(g — 1) dr, 
*n(g) = —Wop Sin ve + aa COS VP 

+v fj F(t) cos v(y —t) dr, 


(10.89) 


where we have set w = ie w(g). To select a periodic solution, we set 


WwW, (270) = wy (0) = Won and W, (27) = Won - 
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Inserting these boundary conditions into (10.89) to determine (won, Won) and use 


the results in the first equation of (10.89) to get the general periodic solution for the 
normalized dispersion function after some manipulations 


gt+2n 
w(~) = | F(t) cos[v(g —t + m)| dt. (10.90) 


2sin wv 


Now we return to the original variables (7, z), and get from (10.90) the equation 
for the periodic dispersion or 7-function 


VB) [~ VB) 
z p(s) 


This solution shows clearly that the periodic dispersion function at any point 
z depends on all bending magnets in the ring. We also observe a fundamental 
resonance phenomenon which occurs should the tune of the ring approach an integer 
in which case finite equilibrium orbits for off momentum particles do not exist 
anymore. To get stable equilibrium orbits, the tune of the ring must not be chosen to 
be an integer or in accelerator terminology an integer resonance must be avoided 


H(z) = 


=a cos v[p(z) — y(S) + mJ] dg. (10.91) 
sin ZV 


vén, (10.92) 


where 7 is an integer. 

This is consistent with the solution (10.86) demanding that |c (z+ Ly)| be less 
than unity. Since C is the matrix element for the total ring we have C = cos2zv 
which obviously is equal to +1 only for integer values of the tune v. While (10.89) 
is not particularly convenient to calculate the dispersion function, it clearly exhibits 
the resonance character and will be very useful later in some other context, for 
example, if we want to determine the effect of a single bending magnet. 

Another way to solve the differential equation (10.88) will be considered to 
introduce a powerful mathematical method useful in periodic systems. We note that 
the perturbation term F(z) = */?(z) xk (z) is a periodic function with the period 
Ly or 27% using normalized coordinates. The perturbation term can therefore be 
expanded into a Fourier series 


Pee=> Fe”, (10.93) 


where 


1 : 
F, = — p Bo? ei"? dp (10.94) 
20 
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or if we go back to regular variables 


eee: VBE) — ing) 
(=o0 a (10.95) 


Similarly, we may expand the periodic 7-function into a Fourier series 
WO = > Wine (10.96) 
Using both (10.93), (10.96) in (10.88) we get 
(=n? + v2) S0 Wei"? = v? ye (10.97) 


which can be solved for the Fourier coefficients W,,, of the periodic dispersion 
function 


ae! ge 
Win = on (10.98) 
The periodic solution of the differential equation (10.88) is finally 
+o 12 in 
v°F,e”? 
w,() = Be ae. (10.99) 


It is obvious again, that the tune must not be an integer to avoid a resonance. This 
solution is intrinsically periodic since @ is periodic and the relation to (10.90) can be 
established by replacing F’, by its definition (10.94). Using the property F_, = F, 
we get for a symmetric lattice and with formula GR[1.445.6]! 


e a Y Px (6) J/BOe mde 


7 ae 


Wn) (10.100) 


n=—oo 


Lyn cosm(t = 9) 
2 § 0 VBR] s+ SE | 


n=1 


fr (© VBO costulp —¢ + xpat 


2 sin vit 


which is the same as (10.90) since dt = v6 dé. For an asymmetric lattice the proof is 
similar albeit somewhat more elaborate. Solution (10.100) expresses the dispersion 


'We will abbreviate in this way formulas from the Table of Integrals, Series and Products, LS. 
Gradshteyn/I.M. Ryzhik, 4th edition. 
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function as the combination of a constant and a sum of oscillatory terms. Evaluating 
the non-oscillatory part of the integral, we find the average value of the dispersion 
or 7-function, 


(q) (10.101) 


This result by itself is of limited usefulness but can be used to obtain an estimate 
for the momentum compaction factor a, defined analogous to (8.125) by 


1p Men() ove 


A good approximation for the momentum compaction factor is therefore a, ~ 
(B)/(ov) and with (10.70) integrated only over the arcs of the ring 


1 
Ae >. (10.103) 
v 
Thus we find the interesting result that the transition energy ); is approximately 
equal to the horizontal tune of a circular accelerator 


Vp Vy. (10.104) 


As a cautionary note for circular accelerators with long straight sections, only the 
tune of the arc sections should be used here since straight sections do not contribute 
to the momentum compaction factor but can add significantly to the tune. 


10.2.4 Periodic Lattices in Circular Accelerators 


Circular accelerators and long beam transport lines can be constructed from 
fundamental building blocks like FODO cells or other magnet sequences which 
are then repeated many times. Any cell or lattice unit for which a periodic solution 
of the lattice functions can be found may be used as a basic building block for a 
periodic lattice. Such units need not be symmetric but the solution for a symmetric 
lattice segment is always periodic. 

FODO cells as elementary building blocks for larger beam transport lattices 
may lack some design features necessary to meet the objectives of the whole 
facility. In a circular accelerator we need for example some component free spaces 
along the orbit to allow the installation of experimental detectors or other machine 
components like accelerating sections, injection magnets or synchrotron radiation 
producing insertion devices. A lattice made up of standard FODO cells with bending 
magnets would not provide such spaces. 
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The lattice of a circular accelerator therefore exhibits generally more complexity 
than that of a simple FODO cell. Often, a circular accelerator is made up of a 
number of superperiods which may be further subdivided into segments with 
special features like dispersion suppression section, achromatic sections, insertions, 
matching sections or simple focusing and bending units like FODO cells. To 
illustrate basic lattice design concepts, we will discuss specific lattice solutions to 
achieve a variety of objectives. 


Synchrotron Lattice 


For a synchrotron whose sole function is to accelerate particles the problem of 
free space can be solved quite easily. Most existing synchrotrons are based on a 
FODO lattice recognizing its simplicity, beam dynamical stability and efficient use 
of space. To provide magnet free spaces, we merely eliminate some of the bending 
magnets. As a consequence the whole ring lattice is composed of curved as well 
as straight FODO cells. The elimination of bending magnets must, however, be 
done thoughtfully since the dispersion function depends critically on the distribution 
of the bending magnets. Random elimination of bending magnets may lead to an 
uncontrollable perturbation of the dispersion function. 

Often it is desirable to have the dispersion function vanish or at least be small in 
magnet free straight sections to simplify injection and avoid possible instabilities if 
rf-cavities are placed where the dispersion function is finite. The general approach 
to this design goal is, for example, to use regular FODO cells for the arcs followed 
by a dispersion matching section, where the dispersion function is brought to zero or 
at least to a small value leading finally to a number of bending magnet free straight 
FODO cells. As an example such a lattice is shown in Fig. 10.11 for a 3.5 GeV 
synchrotron [9]. 

Figure 10.11 shows one quadrant of the whole ring and we clearly recognize 
three different lattice segments including seven arc FODO half cells, two half 


Fig. 10.11 Typical FODO lattice for a separated function synchrotron 
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cells to match the dispersion function and one half cell for installation of other 
machine components. Such a quadrant is mirror reflected at one or the other end 
to form one of two ring lattice superperiods. In this example the ring consists of 
two superperiods although another ring could be composed by a different number 
of superperiods. A specific property of the lattice shown in Fig. 10.11 is, as far as 
focusing is concerned, that the whole ring is made up of equal FODO cells with 
only two quadrupole families QF and QD. The betatron functions are periodic and 
are not significantly affected by the presence or omission of bending magnets which 
are assumed to have negligible edge focusing. By eliminating bending magnets in 
an otherwise unperturbed FODO lattice, we obtain magnet free spaces equal to the 
length of the bending magnets which are used for the installation of accelerating 
components, injection magnets, and beam monitoring equipment. 


Phase Space Matching 


Periodic lattices like FODO channels exhibit unique solutions for the betatron and 
dispersion functions. In realistic accelerator designs, however, we will not be able 
to restrict the lattice to periodic cells only. We will find the need for a variety of 
lattice modifications which necessarily require locally other than periodic solutions. 
Within a lattice of a circular accelerator, for example, we encountered the need 
to provide some magnet free spaces, where the dispersion function vanishes. In 
colliding beam facilities it is desirable to provide for a very low value of the betatron 
function at the beam collision point to maximize the luminosity. These and other 
lattice requirements necessitate a deviation from the periodic cell structure. 

Beam transport lines are in most cases not based on periodic focusing. If 
such transport lines carry beam to be injected into a circular accelerator or must 
carry beam from such an accelerator to some other point, we must consider 
proper matching conditions at locations, where lattices of different machines or 
beam transport systems meet [10, 11]. Joining arbitrary lattices may result in an 
inadequate over lap of the phase ellipse for the incoming beam with the acceptance 
of the downstream lattice as shown in Fig. 10.12a. 


a) b) c) 


incoming beam acceptance 


Fig. 10.12 Matching conditions in phase space: mismatch (a), perfect match (b), efficient 
match (c) 
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For a perfect match of two lattices, all lattice functions must be the same at the 
joining point as shown in Fig. 10.12b 


(Bx, Ox, By, dy, 1, W)i = (Bx, Oz, By, Oy, 7, n')2« (10.105) 


In this case, the phase ellipse at the end of lattice ; is similar to the acceptance 
ellipse at the entrance of lattice 2 (see Fig. 10.12). To avoid dilution of particles in 
phase space perfect matching is desired in proton and ion beam transport systems 
and accelerators. For electrons this is less critical because electron beams regain 
the appropriate phase ellipse through synchrotron radiation and damping. The main 
goal of matching an electron beam is to assure that the emittance of the incoming 
beam is fully accepted by the downstream lattice as shown in Fig. 10.12b, c. Perfect 
matching of all lattice functions and acceptances with beam emittance, however, 
provides the most economic solution since no unused acceptance exist. Matching 
of the dispersion function (7, 7’) in addition also assures that phase ellipses for off 
momentum particles match as well. 

Matching in circular accelerators is much more restrictive than that between 
independent lattices. In circular accelerators a variety of lattice segments for 
different functions must be tied together to form a periodic magnet structure. 
To preserve the periodic lattice functions, we must match them exactly between 
different lattice segments. Failure of perfect matching between lattice segments can 
lead to lattice functions which are vastly different from design goals or do not exist 
at all. 

In general there are six lattice functions to be matched requiring six variables or 
quadrupoles in the focusing structure of the upstream lattice to produce a perfect 
match. Matching quadrupoles must not be too close together in order to provide 
some independent matching power for individual quadrupoles. As an example, the 
betatron functions can be modified most effectively if a quadrupole is used at a 
location, where the betatron function is large and not separated from the matching 
point by multiples of z in betatron phase. Most independent matching conditions 
for both the horizontal and vertical betatron functions are created if matching 
quadrupoles are located where one betatron function is much larger than the other 
allowing almost independent control of matching condition. 

It is impossible to perform such general matching tasks by analytic methods 
and a number of numerical codes are available to solve such problems. Frequently 
used matching codes are TRANSPORT [12], or MAD [13]. Such programs are 
an indispensable tool for lattice design and allow the fitting of any number of 
lattice functions to desired values including boundary conditions to be met along 
the matching section. 


Dispersion Matching 


A very simple, although not perfect, method to reduce the dispersion function in 
magnet free straight sections is to eliminate one or more bending magnets close 


ww 
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to but not at the end of the arc and preferably following a focusing quadrupole, 
QF. In this arrangement of magnets the dispersion function reaches a smaller 
value compared to those in regular FODO cells with a slope that becomes mostly 
compensated by the dispersion generated in the last bending magnet. The match 
is not perfect but the dispersion function is significantly reduced, where this is 
desirable, and magnet free sections can be created in the lattice. This method 
requires no change in the quadrupole or bending magnet strength and is therefore 
also operationally very simple as demonstrated in the example of a synchrotron 
lattice shown in Fig. 10.11. We note the less than perfect matching of the dispersion 
function which causes a beating of an originally periodic dispersion function. In 
the magnet free straight sections, however, the dispersion function is considerably 
reduced compared to the values in the regular FODO cells. 

More sophisticated matching methods must be employed, where a perfect match 
of the dispersion function is required. Matching of the dispersion to zero requires 
the adjustment of two parameters, 7 = 0 and 7’ = 0, at the beginning of 
the straight section. This can be achieved by controlling some of the upstream 
quadrupoles. Compared to a simple two parameter FODO lattice (Fig. 10.11) this 
variation requires a more complicated control system and additional power supplies 
to specially control the matching quadrupoles. This dispersion matching process 
disturbs the betatron functions which must be separately controlled and matched by 
other quadrupoles in dispersion free sections. Such a matching method is utilized in 
a number of storage rings with a special example shown in Fig. 10.13 [14]. 

Here, we note the perfect matching of the dispersion function as well as the 
associated perturbation of the betatron function requiring additional matching. 
Quadrupoles QFM and QDM are adjusted such that 7 = 0 and 7’ = 0 in the 
straight section. In principle this could be done even without eliminating a bending 
magnet, but the strength of the dispersion matching quadrupoles would significantly 
deviate from that of the regular FODO quadrupoles and cause a large distortion of 
the betatron function in the straight section. To preserve a symmetric lattice, the 


Fig. 10.13 Lattice for a 1.2 GeV low emittance damping ring 
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betatron function must be matched with the quadrupoles Q1 and Q2 to get a, = 0 
and a, = 0 at the symmetry points of the lattice. 


Dispersion Suppressor 


A rather elegant method of dispersion matching has been developed by Keil [15]. 
Noting that dispersion matching requires two parameters he chooses to vary the last 
bending magnets at the end of the arcs rather than quadrupoles. The great advantage 
of this method is to leave the betatron functions and the tunes undisturbed at least 
as long as we may ignore the end field focusing of the bending magnets which 
is justified in large high energy accelerators. This dispersion suppressor consists 
of four FODO half cells following directly the regular FODO cells at a focusing 
quadrupole QF as shown in Fig. 10.14. The strength of the bending magnets are 
altered into two types with a total bending angle of all four magnets to be equal to 
two regular bending magnets. 

The matching conditions can be derived analytically from the transformation 
matrix for the full dispersion suppressor as a function of the individual magnet 
parameters. An algebraic manipulation program has been used to derive a result 
that is surprisingly simple. If 6 is the bending angle for regular FODO cell bending 
magnets and yw the betatron phase for a regular FODO half cell, the bending angles 
6; and 6 are determined by [15] 


.=90 (1 - azz) (10.106) 
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Fig. 10.14 Dispersion suppressor lattice 
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and 
1 
6,= 0 ( ) ; (10.107) 


where 
6=6,4+ 63. (10.108) 


This elegant method requires several FODO cells to match the dispersion function 
and is therefore most appropriately used in large systems. Where a compact lattice 
is important, matching by quadrupoles as discussed earlier might be more space 
efficient. 


Magnet Free Insertions 


An important part of practical lattice design is to provide magnet free spaces 
which are needed for the installation of other essential accelerator components 
or experimental facilities. Methods to provide limited magnet free spaces by 
eliminating bending magnets in FODO lattices have been discussed earlier. Often, 
however, much larger magnet free spaces are required and procedures to provide 
such sections need to be formulated. 

The most simple and straight forward approach is to use a set of quadrupoles 
and focus the lattice functions 6,, 8, and n into a magnet free section such that 
the derivatives @,,a and n’ vanish in the center of this section. This method 
is commonly applied to interaction areas in colliding beam facilities to provide 
optimum beam conditions for maximum luminosity at the collision point. A typical 
example is shown in Fig. 10.15. 


0 10 20 30 40 50 2(m) 


Fig. 10.15 Lattice of the SPEAR storage ring 
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Fig. 10.16 Lattice of the 10 B 
ADONE storage ring (m) —— y 


Another scheme to provide magnet free spaces is exercised in the SPEAR lattice 
(Fig. 10.15) where the FODO structure remains unaltered except that the FODO 
cells have been separated in the middle of the QF quadrupoles. A separation in 
the middle of the QD quadrupoles would have worked as well. Since the middle 
of FODO quadrupoles are symmetry points a modest separation can be made with 
minimal perturbation to the betatron functions and no perturbation to the dispersion 
function since n’ = 0 in the middle of FODO quadrupoles. 

A more general design approach to provide magnet free spaces in a periodic 
lattice is exercised in the storage ring shown in Fig. 10.16 [16] or the storage ring as 
shown in Fig. 10.15 [17]. In the ADONE lattice the quadrupoles of a FODO lattice 
are moved together to form doublets and alternate free spaces are filled with bending 
magnets or left free for the installations of other components. 


Collins Insertion 


A simple magnet free insertion for dispersion free segments of the lattice has 
been proposed by Collins [18]. The proposed insertion consists of a focusing and 
a defocusing quadrupole of equal strength with a long drift space in between as 
shown in Fig. 10.17. In thin lens approximation, the transformation matrix for the 
insertion is 


oe (NOI )GD: 00m 


This insertion matrix must be equated with the transformation matrix for this 
same insertion expressed in terms of lattice functions at the insertion point with the 
regular lattice 


a (eee Bsinw . 


— He sin y cosy —asiny nan) 
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B—function 


D 
Collins insertion 


Fig. 10.17 Collins insertion 


Both matrices provide three independent equations to be solved for the drift lengths 
d and D and for the focal length f of the quadrupoles. After multiplications of all 
matrices we equate matrix elements and get 


(10.111) 


These relations are valid for both planes only if a, = —a,. Generally, this is not 
the case for arbitrary lattices but for a weak focusing FODO lattice this condition 
is met well. We note that this design provides an insertion of length D which is 
proportional to the value of the betatron functions at the insertion point and requires 
that a # 0. 

Of course any arbitrary insertion with a unity transformation matrix Z in both 
planes is a valid solution as well. Such solutions can in principle always be enforced 
by matching with a sufficient number of quadrupoles. If the dispersion function and 
its derivative is zero such an insertion may also have a transformation matrix of 
—T. This property of insertions is widely used in computer designs of insertions 
when fitting routines are available to numerically adjust quadrupole strength such 
that desired lattice features are met including the matching of the lattice functions 
to the insertion point. A special version of such a solution is the low beta insertion 
for colliding beam facilities. 


Low Beta Insertions 


In colliding beam facilities long magnet free straight sections are required to allow 
the installation of high energy particle detectors. In the center of these sections, 
where two counter rotating particle beams collide, the betatron functions must 
reach very small values forming a narrow beam waist. This requirement allows to 
minimize the destructive beam-beam effect when two beams collide and thereby 
maximize the luminosity of the colliding beam facility [19]. 
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Fig. 10.18 Lattice functions of a colliding beam storage ring [21]. Shown is half the circumference 
with the collision point, low beta and vanishing dispersion in the center 


An example for the incorporation of such a low beta insertion is shown in 
Fig. 10.18 representing one of many variations of a low beta insertion in colliding 
beam facilities [20]. The special challenge in this matching problem is to provide 
a very small value for the betatron functions at the collision point. To balance the 
asymmetry of the focusing in the closest quadrupoles the betatron functions in both 
planes are generally not made equally small but the vertical betatron function is 
chosen smaller than the horizontal to maximize the luminosity. The length of the 
magnet free straight section is determined by the maximum value for the betatron 
function that can be accepted in the first vertically focusing quadrupole. The limit 
may be determined by just the physical aperture available or technically possible in 
these insertion quadrupoles or by the chromaticity and ability to correct and control 
chromatic and geometric aberrations. 

The maximum value of the betatron function at the entrance to the first 
quadrupole, the minimum value at the collision point, and the magnet free section 
are correlated by the equation for the betatron function in a drift space. Assuming 
symmetry about the collision point, the betatron functions develop from there like 


B@ =p +=, (10.112) 


where f* is the value of the betatron function at the symmetry point, z the distance 
from the collision point and 2Zin, the full length of the insertion between the 
innermost quadrupoles. 
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The distance L tended to be quite large to allow the installation of large particle 
detectors for high energy physics experiment. As a consequence, the betatron 
function became very large in the first quadrupoles causing severe perturbations 
and limitations in particle dynamics. This, of course, created a limit in the 
achievable luminosity. In new colliding beam facilities, like B-factories, the low- 
beta creating quadrupoles are incorporated deeply into the detectors, thus reducing 
L and the maximum value for the betatron functions. This compromise resulted in 
significantly higher luminosity of colliding beams. 


10.3. FODO Lattice and Acceleration* 


So far we have ignored the effect of acceleration in beam dynamics. In specific 
cases, however, acceleration effects must be considered specifically if the particle 
energy changes significantly along the beam line. In linear accelerators such a need 
occurs at low energies when we try to accelerate a large emittance beam through 
the small apertures of the accelerating sections. For example, when a positron beam 
is to be created the positrons emerging from a target within a wide solid angle are 
focused into the small aperture of a linear accelerator. After some initial acceleration 
in the presence of a solenoid field along the accelerating structure it is desirable to 
switch over to more economic quadrupole focusing. Even at higher energies when 
the beam diameter is much smaller than the aperture strong focusing is still desired 
to minimize beam break up instabilities. 


10.3.1 Lattice Structure 


A common mode of focusing uses a FODO lattice in conjunction with the linac 
structure. We may, however, not apply the formalism developed for FODO lattices 
without modifications because the particle energy changes significantly along the 
lattice. A thin lens theory has been derived by Helm [22] based on a regular FODO 
channel in the particle reference system. Due to Lorentz contraction the constant 
quadrupole separations L* in the particle system become increasing distances in 
the laboratory system as the beam energy increases. To show this quantitatively, 
we consider a FODO channel installed along a linear accelerator and starting at the 
energy Yo with a constant cell half length L= YoL*. The tick-marks along the scale 
in Fig. 10.19 indicate the locations of the quadrupoles and the distances between 
magnets in the laboratory system are designated by L),Ly.... 


Fig. 10.19 FODO channel and acceleration 
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With the acceleration a in units of the rest energy per unit length and yo the 
particle energy at the center of the first quadrupole, the condition to have a FODO 
channel in the particle system is 


Mod aL; 
Lt =. —_ Lar (+5 =). (10.113) 
0 1+ yo o YO 


The quantity 2L* is the length of a FODO cell in the particle system and L, is the 
distance between the first and second quadrupole in the laboratory system. Solving 
for L, we get 


ner 
Ly =L F (10.114) 
K 


where 


a 
K= —L"*. (10.115) 
Yo 


At the same time the beam energy has increased from yo to 
vi =Yot+ aly . (10.116) 


Equation (10.113) can be applied to any of the downstream distances between 
quadrupoles. The n" distance L,, for example, is determined by an integration from 
Zn—1 tO Z, or equivalently from 0 to L,, 


In d n Ln 
is =| ai, (1+ 2 a) . (10.117) 
0 1+ = a Yn-1 


While solving as In, we express the energy yn—1 by addition of the energy 
gains y,-1 = >; Ay; = =o ' L; and taking the distances L; from expressions 
(10.114) and (10.117) we get fork « 1 


* esd (n-1)k 
L, = L"——e : (10.118) 
K 

In thin lens approximation, the distances between successive quadrupoles 
increase exponentially in the laboratory system like (10.118) to resemble the 
focusing properties of a regular FODO channel with a cell length 2Z* in the particle 
system under the influence of an accelerating field. 

Such FODO channels are used to focus large emittance particle beams in linear 
accelerators as is the case for positron beams in positron linacs. For strong focusing 
as is needed for low energies where the beam emittance is large, the thin lens 
approximation, however, is not accurate enough and a more exact formulation of 
the transformation matrices must be applied [23], which we will derive here in some 
detail. 
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10.3.2 Transverse Beam Dynamics and Acceleration 


Transverse focusing can be significantly different along a linear accelerator due 
to the rapid changing particle energy compared to a fixed energy transport line 
and the proper beam dynamics must be formulated in the presence of longitudinal 
acceleration. To derive the correct equations of motion we consider the particle 
dynamics in the presence of the complete Lorentz force including electrical fields 


p=cE+e[rxB]. (10.119) 


To solve this differential equation we consider a straight beam transport line with 
quadrupoles aligned along the z-coordinate as we would have in a linear accelerator. 
The accelerating fields are assumed to be uniform with a finite component only 
along the z-coordinate. At the location r = (x, y, z), the fields can be expressed by 
E = (0,0, a/e) and B = (gx, gy, 0), where the acceleration a is defined by 


a =elE|. (10.120) 


To evaluate (10.119), we express the time derivative of the momentum, p = 
ymr by 


p=ynr+ymr, (10.121) 


From cp = E/B we find that » = aB/mc? and (10.121) becomes for the x- 
component 


1 
CPy = aoBmk + —EX. (10.122) 
c 


In this subsection, we make ample use of quantities a, 6, y being acceleration 
and relativistic parameters which should not be confused with the lattice functions, 
which we will not need here. Bowing to convention, we refrain from introducing 
new labels. 

The variation of the momentum with time can be expressed also with the Lorentz 
equation (10.119) and with the specified fields, we get 


Px = —ceB gx. (10.123) 


We replace the time derivatives in (10.122) by derivatives with respect to the 
independent variable z 


x= Bex’, (10.124) 


- a 
X= PP c?x”+ x, 
yom 
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and after insertion into (10.122) and equating with (10.123) the equation of motion 
becomes 


dx a dx ceg 


e o BIE dz 4 BE x=0, (10.125) 
where we used the relation 6? + 1/y? = 1. With ,= senee and defining the 
quantity 
dp/dz a 
oe = = (10.126) 
0 
we get for the equation of motion in the horizontal plane, u = x 
d? d k 
2 I Dp, (10.127) 


+ as u 
dz l+nozdz 14+ 0z 


introducing the quadrupole strength kp = as Equation (10.127) is valid also for 
the vertical plane u = y if we only change the sign of the quadrupole strength ko. 
Equation (10.127) is a Bessel’s differential equation, which becomes obvious by 
defining a new independent variable 


2 
f= io + oz) (10.128) 
0 


transforming (10.127) into 

fs kay =O. (10.129) 
which is the equation of motion in the presence of both transverse and longitudinal 
fields. 


Analytical Solutions 


The solutions of the differential equation (10.129) are Bessel’s functions of the first 
and second kind in zero order 


u(z) = Cy Io(E) + C2 Yoé). (10.130) 


In terms of initial conditions (uo, uj) for z = 0 we can express the solutions in 
matrix formulation 


=i ¥; Yip 
ee _ vk Win. see ' vk (“”) (10.131) 
u'(z) M0 \ Tien Jitmes \ 0 Ze) \¥ 
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Here we defined Z,; = Z, (2 kd + n02)) and Zin = Zio (22 vk) where Z, stands 
for either of the Bessel’s functions J; or Y; andi = 0, 1. 


Transformation Matrices 


The transformation matrix for a drift space can be obtained from (10.131) by letting 
k — 0, but it is much easier to just integrate (10.127) directly with k = 0. We get 
from (10.127) w =-7 rae and after logarithmic integration u’ = Gut const. 
After still another integration 


/ 
“= tty + 7, ost + noz) (10.132) 
0 


or for a drift space of length L 


( u(L) ) ad > 0 5 log . + noL) & ’) (10.133) 
u'(L) 0 Gar Wy) 


For most practical purposes we may assume that ave > 1 and may, therefore, 
use asymptotic expressions for the Bessel’s functions. In this approximation the 
transformation matrix of a focusing quadrupole of length 2 is 


Ke = oO COS as Tk sin AE (10.134) 
ksin AE a cos AE 


z (3 + 3) sin A&é wi oe cos AE 


+ 
ae ee V/kcos AE — = (i+ 3) sin AE 
where 
' : (10.135) 
1+ nol 
and with A& = & — &, 
Fe 
f& =—wVJk and (10.136) 
No 
2 
& = — Vk(1+ of). (10.137) 
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Similarly we get for a defocusing quadrupole 


o cosh Aé m7; sinh Aé 
Ma = (10.138) 
ksinh AE o paginas 


Lf 


¢ (2 + 4) sinh Ag Ege AE cosh Ag 
03 . 
a AE = Vk cosh AE —= e(z + 3) sinh Ag 


These transformation matrices can be further simplified for low accelerating fields 


noting that mo < 1. In this case & — & ~ Jké = and with 
1/3 1 1/3 1 
A= + ) ~ ( + ) (10.139) 
8 ic & BLE &o 
we get for a focusing quadrupole the approximate transformation matrix 
i . 
cos — sin 
M; = (¢ =) vee (10.140) 
0 o —VJksinw cosy 


Asiny 0 
+ ( 0 Hall 


and similar for a defocusing quadrupole 


o 0 cosh y 7a sinh y 
= 0 10.141 
Ma (° ) (e¢ cosh w a 


A sinh y 0 
7 ( 0 —-A a) , 


Finally, the transformation matrix for a drift space of length L in an accelerating 
system can be derived from either (10.140) or (10.141) by letting k — 0 for 


1 —+ logo* 
Mo = ( m OBO (10.142) 


0 o* 


where ot = 1/(1 + noL) in agreement with (10.122). In the limit of vanishing 
accelerating fields yo —> O and we obtain back the well-known transformation 
matrices for a drift space. Similarly, we may test (10.140) and (10.141) for 
consistency with regular transformation matrices. 

In Eqs. (10.140)—(10.142) we have the transformation matrices for all elements 
to form a FODO channel in the presence of acceleration. We may now apply all 
formalisms used to derive periodic betatron, dispersion functions or beam envelopes 
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as derived in Sect. 10.1 for regular FODO cells. Considering one half cell we note 
that the quadrupole strength k of the first half quadrupole is determined by the last 
half quadrupole of the previous FODO half cell. We have therefore two variables 
left, the half cell drift length L and the strength k; of the second half quadrupole of 
the FODO half cell, to fit the lattice functions to a symmetric solution by requiring 
that vw, = 0 anda, = 0. 


Adiabatic Damping 


Transformation matrices derived in this section are not phase space conserving 
because their determinant is no more equal to unity. The determinant for a drift 
space with acceleration is, for example, 


1 


det Mo = = 
1+ oz 


(10.143) 


which is different from unity if there is a finite acceleration. The two-dimensional 
(x, x’)-phase space, for example, is not invariant anymore. For example, the area of 
a rhombus in phase space, defined by the two vectors x9 = (x, 0) and Ce = (0, Hq) 
is reduced according to (10.143) to 


|x0, 25 (10.144) 


and the beam emittance, defined by x and x’, is therefore not preserved in the 
presence of accelerating fields. This phenomenon is known as adiabatic damping 
under which the beam emittance varies like 


1 
gm e. (10.145) 


1+ noz Dp 


where np Az = AE/Ep is the relative energy gain along the length Az of the 
accelerator. From this we see immediately that the normalized phase space area 
cp€ is conserved in full agreement with Liouville’s theorem. In beam transport 
systems where the particle energy is changing it is therefore more convenient and 
dynamically correct to use the truly invariant normalized beam emittance defined by 


€, = BYE. (10.146) 


This normalized emittance remains constant even when the particle energy is 
changing due to external electric fields. In the presence of dissipating processes like 
synchrotron radiation, scattering or damping, however, even the normalized beam 
emittance changes because Liouville’s theorem of the conservation of phase space 
is not valid anymore. 
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From (10.144) we obtain formally the constancy of the normalized beam 
emittance by multiplying with the momenta pp and p = po(1 + noz) for 


Ix, (1 + no z)pox'| = |xo, poxe'| (10.147) 
or with the transverse momenta po x’ = po, and (1 + 10z) pox’ = p, 
|x, p,| = |Xo. Pox| = const. (10.148) 


This can be generalized to a six-dimensional phase space, remembering that in this 


3 1 
case det(Mo) = (9) since the matrix has the oon | no 


1 — logo* 0 0 00 
0 of 0 0 00 
m=? ~ qq logot 0 0 | (10.149) 
0 0 0 «o 00 
0 oO oO 0 1A 
0 0 0 0 Oot 


where A is an rf related quantity irrelevant for our present arguments. For the 
six-dimensional phase space with coordinates x, p,, y, p,, T, AE, where p,, p, 
are the transverse momenta, t the longitudinal position of the particle with 
respect to a reference particle and AE the energy deviation we get finally with 
x 000 00 
0 por 9 0 0 0 


, Yo: Dov» To, AEo| = 
IX0, Pox Yo Poy, To ol 0 0 0 poy 0 O 
0000 0 
000 0 0ot 
Ix, Pxo Ys Pys Ts AE| = Xo, Pox» Yoo Poy Tos AE)| ee on 


These results do not change if we had included focusing in the transformation 
matrix. From (10.140), (10.141), we see immediately that the determinants for both 
matrices are 


det(M,) ~ det(M,) 04 (10.151) 


ignoring small terms proportional to A. 
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Problems 


Use thin lens approximation unless otherwise noted. 


10.1 (S). Produce a conceptual design for a separated function proton synchrotron 
to be used to accelerate protons from a kinetic energy of 10—150GeV/c. The 
circular vacuum chamber aperture has a radius of R = 20mm and is supposed 
to accommodate a beam with a uniform beam emittance of € = 5mm mrad in both 
planes and a uniform momentum spread of og/E = +0.1%. The peak magnetic 
bending field is B = 1.8 T at 150 GeV/c. 


10.2 (S). Specify a FODO cell to be used as the basic lattice unit for a 50 GeV 
synchrotron or storage ring. The quadrupole aperture for the beam shall have a 
radius of R = 3 cm. Adjust parameters such that a Gaussian beam with an emittance 
of € = 5mmmrad in the horizontal plane, of € = 0.5 mm mrad in the vertical plane 
and an energy spread of AE/Eo = 0.01 would fit within the quadrupole aperture. 
Ignore wall thickness of the vacuum chamber. 


(a) Considering the magnetic field limitations of conventional magnets, adjust 
bending radius, focal length and if necessary cell length to stay within realistic 
limits for conventional magnets. 

(b) What is the dipole field and the pole tip field of the quadrupoles? Adjust the 
total number of cells such that there is an even number of FODO cells and the 
tunes are far away from an integer or half integer resonance? 


10.3 (S). Consider a ring composed of an even number 2n, of FODO cells. To 
provide two component free spaces, we cut the ring at a symmetry line through the 
middle of two quadrupoles on opposite sides of the ring and insert a drift space 
of length 2€ which is assumed to be much shorter than the value of the betatron 
function at this symmetry point £ < Bo. Derive the transformation matrix for this 
ring and compare with that of the unperturbed ring. What is the tune change of 
the accelerator. The betatron functions will be modified. Derive the new value of 
the horizontal betatron function at the symmetry point in units of the unperturbed 
betatron function. Is there a difference to whether the free section is inserted in the 
middle of a focusing or defocusing quadrupole? How does the -function change? 


10.4 (S). Sometimes two FODO channels of different parameters must be matched. 
Show that a lattice section can be designed with a phase advance of Ay, = Ay = 
zt/2, which will provide the desired matching of the betatron functions from the 
symmetry point of one FODO cell to the symmetry point of the other cells. Such a 
matching section is also called a quarter wavelength transformer and is applicable 
to any matching of symmetry points. Does this transformer also work for curved 
FODO channels, where the dispersion is finite? 


10.5 (S). The quadrupole lattice of the synchrotron in Fig. 10.11 forms a pure 
FODO lattice. Yet the horizontal betatron function shows some beating perturbation 
while the vertical betatron function is periodic. What is the source of perturbation 
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for the horizontal betatron function? An even stronger perturbation is apparent for 
the dispersion function. Explain why the dispersion function is perturbed. 


10.6. For one example determine the real quadrupole length required to produce 
the quoted betatron phase advances per FODO cell in Table 10.1. Compare with 
thin lens quadrupole strengths. 


10.7. Calculate the values of the betatron functions in the center of the quadrupoles 
for {1 and 12 FODO cells in Table 10.1 and compare with the actual thick lens 
betatron functions in Figs. 10.6 and 10.7. Discuss the difference. 


10.8. The original lattice of Problem 10.4 is to be expanded to include dispersion 
free cells. Incorporate into the lattice two symmetric dispersion suppressors based 
on the FODO lattice of the ring following the scheme shown in Fig. 10.14. Adjust 
the bending magnet strength to retain a total bending angle of 2z in the ring. 
Incorporate the two dispersion suppressors symmetrically into the ring and make 
a schematic sketch of the lattice. 


10.9. In the dispersion free region of Problem 10.8 introduce a symmetric Collins 
insertion to provide a long magnet free section of the ring. Determine the parameters 
of the insertion magnets and drift spaces. Use thin lens approximation to calculate a 
few values of the betatron functions in the Collins insertions and plot betatron and 
dispersion functions through the Collins insertion. 


10.10. For the complete ring lattice of Problem 10.9 make a parameter list including 
such parameters as circumference, revolution time, number of cells, tunes (use 
simple numerical integration to calculate the phase advance in the Collins insertion), 
max. beam sizes, magnet types, length, strengths, etc. 


10.11. The fact that a Collins straight section can be inserted into any transport line 
without creating perturbations outside the insertion makes these insertions also a 
periodic lattice. A series of Collins straight sections can be considered as a periodic 
lattice composed of quadrupole doublets and long drift spaces in between. Construct 
a circular accelerator by inserting bending magnets into the drift spaces d and 
adjusting the drift spaces to D = 5m. What is the phase advance per period? 
Calculate the periodic 7-function and make a sketch with lattice and lattice functions 
for one period. 


10.12. Consider a regular FODO lattice as shown in Fig. 10.11, where some 
bending magnets are eliminated to provide magnet free spaces and to reduce the 
n-function in the straight section. How does the minimum value of the 7-function 
scale with the phase per FODO cell. Show if conditions exist to match the 7-function 
perfectly in the straight section of this lattice? 


10.13. How many protons would produce a circulating beam of | A in the ring 
of Problem 10.1? Calculate the total power stored in that beam at 150 GeV/c. By 
how many degrees could one liter of water be heated up by this energy? The proton 
beam emittance be €,,, = 5mm mrad at the injection energy of 10 GeV/c. Calculate 
the average beam width at 150 GeV/c along the lattice and assume this beam to hit 
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because of a sudden miss-steering a straight piece of vacuum chamber at an angle 
of 10 mrad. If all available beam energy is absorbed in a | mm thick steel vacuum 
chamber by how much will the strip of steel heat up? Will it melt? (specific heat 


CFe 


= 0.11 cal/g/°C, melting temperature Tp. = 1528 °C. 
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Part IV 
Beam Parameters 


Chapter 11 
Particle Beam Parameters 


Particle beams are characterized by a set of quantifying parameters being either 
constants of motion or functions varying from point to point along a beam transport 
line. The parameters may be a single particle property like the betatron function 
which is the same for all particles within a beam or quantities that are defined only 
for a collection of particles like beam sizes or beam intensity. We will define and 
derive expressions for such beam parameters and use them to characterize particle 
beams and develop methods for manipulation of such parameters. 


11.1 Definition of Beam Parameters 


Particle beams and individual particles are characterized by a number of parameters 
which we use in beam dynamics. We will define such parameters first before we 
discuss the determination of their numerical value. 


11.1.1 Beam Energy 


Often we refer to the energy of a particle beam although we actually describe 
only the nominal energy of a single particle within this beam. Similarly, we speak 
of the beam momentum, beam kinetic energy or the velocity of the beam, when 
we mean to say that the beam is composed of particles with nominal values of 
these quantities. We found in earlier chapters that the most convenient quantity 
to characterize the “energy” of a particle is the momentum for transverse beam 
dynamics and the kinetic energy for acceleration. To unify the nomenclature it has 
become common to use the term energy for both quantities noting that the quantity 
of pure momentum should be multiplied with the velocity of light (cp) to become 


Ww 
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dimensionally correct. Thus, the particle momentum is expressed in the dimension 
of an energy without being numerically identical either to the total energy or the 
kinetic energy but approaching both for highly relativistic energies. 


11.1.2 Time Structure 


A true collective beam parameter is the time structure of the particle stream. 
We make the distinction between a continuous beam being a continuous flow of 
particles and a bunched beam. Whenever particles are accelerated by means of 
rf-fields a bunched beam is generated, while continuous beams can in general 
be sustained only by dc accelerating fields or when no acceleration is required as 
may be true for a proton beam in a storage ring. A pulsed beam consists of a finite 
number of bunches or a continuous stream of particles for a finite length of time. 
For example, a beam pulse from a linear accelerator is made up of a finite string of 
micro bunches generated by rf-accelerating fields. 


11.1.3) Beam Current 


The beam intensity or beam current is expressed in terms of an electrical current 
using the common definition of the ratio of the electrical charge passing by a current 
monitor per unit time. For bunched beams the time span during which the charge is 
measured can be either shorter than the duration of the bunch or the beam pulse or 
may be long compared to both. Depending on which time scale we use, we define the 
bunch current or peak current, the pulse current or the average current respectively. 

In Fig. 11.1 the general time structure of bunched beams is shown. The smallest 
unit is the microbunch, which is separated from the next microbunch by the 
wavelength of the accelerating rf-field or a multiple thereof. The microbunch current 
or peak current 7 is defined as the total microbunch charge q divided by the 
microbunch duration t,,, 


ij=—. (11.1) 


The micro pulse duration must be specially defined to take a nonuniform charge 
distribution of the particular accelerator into account. A series of microbunches 
form a beam pulse which is generally determined by the duration of the rf-pulse. 
In a conventional S-band electron linear accelerator the rf-pulse duration is of 
the order of a few micro seconds while a superconducting linac can produce a 
continuous stream of microbunches thus eliminating the pulse structure of the beam. 
An electrostatic accelerator may produce pulsed beams if the accelerating voltage is 
applied only for short time intervals. The pulse current J, is defined as the average 
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Fig. 11.1 Definitions for time structure and pulse currents. (a) Peak current, / = q/Ty, where t,, is 


the microbunch duration and q the charge per microbunch. (b) Pulse current J, = i T/T = 9/T ys 
where T;, is the micro-bunch period. (c) Average current (7) = I)TpVrep, with T, the pulse duration 
and Vyep the pulse repetition rate. (d) Continuous beam current 


current during the duration of the pulse. If the duration of the micro bunch is t,, and 
the time between successive microbunches T,, the pulse current is 


T 


fa (11.2) 


ES 
Ty 

The average beam current, finally, is the beam current averaged over a complete 
cycle of the particular accelerator. 


; (11.3) 


where n,, is the number of microbunches per pulse and g the charge in a microbunch. 
In a beam transport line, this is the total charge passing by per unit time, where the 
unit time is as long as the distance between beam pulses. In a circular accelerator it 
is, for example, the total circulating charge divided by the revolution time. For the 
experimenter using particles from a cycling synchrotron accelerator the average 
current is the total charge delivered to the experiment during a time long compared 
to the cycling time divided by that time. 
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The “beam on—beam off” time is measured by the duty factor defined as the 
fraction of actual beam time to total time at the experimental station. Depending on 
the application, it is desirable to have a high duty factor where the particles come 
more uniformly distributed in time compared to a low duty factor where the same 
number of particles come in short bursts. 


11.1.4 Beam Dimensions 


Of great importance for the design of particle accelerators is the knowledge of beam 
size parameters like transverse dimensions, bunch length and energy spread as well 
as the particle intensity distribution in six-dimensional phase space. In this respect, 
electron beams may behave different from beams of heavier particles like protons 
which is a consequence of synchrotron radiation and effects of quantized emission 
of photons on the dynamic parameters of the electrons. Where such radiation effects 
are negligible beams of any kind of particles evolve the same way along a beam line. 
Specifically, we have seen that in such cases the beam emittances are a constant 
of motion and the beam sizes are therefore modulated only by the variation of 
the betatron and dispersion functions as determined by the focusing structure. The 
particle distribution stays constant while rotating in phase space. This is true for the 
transverse as well as for the longitudinal and energy parameters. 

A linear variation of beam emittance with energy is introduced when particles 
are accelerated or decelerated. We call this variation adiabatic damping, where the 
beam emittance scales inversely proportional with the particle momentum and the 
transverse beam size, divergence, bunch length and energy spread scale inversely to 
the square root of the particle momentum. This adiabatic damping actually is not a 
true damping process where the area in phase space is reduced. It rather reflects the 
particular definition of beam emittance with respect to the canonical dimensions 
of phase space. In transverse beam dynamics we are concerned with geometric 
parameters and a phase space element would be expressed by the product Au Au’. 
Liouville’s theorem, however, requires the use of canonical variables, momentum 
and position, and the same phase space element is Au Ap,, where Ap, = pou! 
and u is any of the three degrees of freedom. Acceleration increases the particle 
momentum po and as a consequence the geometric emittance Au Au’ must be 
reduced to keep the product Au Ap, constant. This reduction of the geometric 
emittance by acceleration is called adiabatic damping and occurs in all three degrees 
of freedom. 

More consistent with Liouville’s theorem of constant phase space density is the 
normalized emittance defined by 


€n = BYE, (11.4) 
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where y is the particle energy in units of the rest energy and B = v/c. This 
normalized emittance obviously has the appropriate definition to stay constant under 
the theorem of Liouville. 

It is often difficult and not practical to define a beam emittance for the whole 
beam. Whenever the beam is fuzzy at the edges it may not make sense to include all 
particles into the definition of the beam emittance and provide expensive aperture 
for the fuzzy part of the beam. Relativistic electron beams in circular accelerators 
are particularly fuzzy due to the quantized emission of synchrotron radiation and 
as a consequence the particle distribution transforms into a Gaussian distribution. 
Later, we will discuss the evolution of the beam emittance due to statistical effects 
in great detail and derive the particle distribution from the Fokker-Planck equation. 
The electron beam emittance is defined for that part of the beam which is contained 
within one standard unit of the Gaussian distribution. this is true also for any other 
parameter which assumes a Gaussian distribution like beam size, divergence, energy 
spread, phase etc. 

The beam emittance for particle beams is primarily defined by the characteristic 
source parameters and source energy. Given perfect matching between different 
accelerators and beam lines during subsequent acceleration, this source emittance 
is reduced inversely proportional to the particle momentum by adiabatic damping 
and stays constant in terms of normalized emittance. This describes accurately the 
ideal situation for proton and ion beams, for nonrelativistic electrons and electrons 
in linear accelerators as long as statistical effects are absent. A variation of the 
emittance occurs in the presence of statistical effects in the form of collisions with 
other particles or emission of synchrotron radiation and we will concentrate here 
in more detail on the evolution of beam emittances in highly relativistic electron 
beams. 

Statistical processes cause a spreading of particles in phase space or a continuous 
increase of beam emittance. In cases where this diffusion is due to the particle 
density, the emittance increase may decrease significantly because the scattering 
occurrence drops to lower and lower values as the particle density decreases. Such 
a case appears in intra-beam scattering [1-3], where particles within the same 
bunch collide and exchange energy. Specifically when particles exchange longi- 
tudinal momentum into transverse momentum and gain back the lost longitudinal 
momentum from the accelerating cavities. The beam “heats” up transversely which 
becomes evident in the increased beam emittance and beam sizes. 

Statistical perturbations due to synchrotron radiation, however, lead to truly 
equilibrium states where the continuous excitation due to quantized emission 
of photons is compensated by damping. Discussing first the effect of damping 
will prepare us to combine the results with statistical perturbations leading to an 
equilibrium state of the beam dimensions. 
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Emission of synchrotron radiation causes the appearance of a reaction force on the 
emitting particle which must be taken into account to accurately describe particle 
dynamics. In doing so, we note from the theory of synchrotron radiation that the 
energy lost into synchrotron radiation is lost through the emission of many photons 
and we may assume that the energy loss is continuous. Specifically, we assume that 
single photon emissions occur fast compared to the oscillation period of the particle 
such that we may treat the effect of the recoil force as an impulse. 

In general, we must consider the motion of a particle in all three degrees of 
freedom or in six-dimensional phase space. The appearance of damping stems from 
the emission of synchrotron radiation in general, but the physics leading to damping 
in the longitudinal degree of freedom is different from that in the transverse degrees 
of freedom. The rate of energy loss into synchrotron radiation depends on the 
particle energy itself being high at high energies and low at low energies. As a 
consequence, a particle with a higher than ideal energy will loose more energy to 
synchrotron radiation than the ideal particle and a particle with lower energy will 
loose less energy. The combined result is that the energy difference between such 
three particles has been reduced, an effect that shows up as damping of the beam 
energy spread. With the damping of the energy spread, we observe also a damping 
of its conjugate variable, the longitudinal phase or bunch length. 

In the transverse plane we note that the emission of a photon leads to a loss of 
longitudinal as well as transverse momentum since the particle performs betatron 
oscillations. The total lost momentum is, however, replaced in the cavity only in the 
longitudinal direction. Consequently, the combined effect of emission of a photon 
and the replacement of the lost energy in accelerating cavities leads to a net loss of 
transverse momentum or transverse damping. 

Although damping mechanisms are different for transverse and longitudinal 
degrees of freedom, the total amount of damping is limited and determined by 
the amount of synchrotron radiation. This correlation of damping decrements in all 
degrees of freedom was derived first by Robinson [4] for general accelerating fields 
as long as they are not so strong that they would appreciably affect the particle orbit. 


11.2.1 Robinson Criterion 


Following Robinson’s idea we will derive what is now known as Robinson’s 
damping criterion by observing the change of a six dimensional vector in phase 
space due to synchrotron radiation and acceleration. The components of this vector 
are the four transverse coordinates (x,x’, y, y’), the energy deviation AE, and the 
longitudinal phase deviation from the synchronous phase g = wW — wy. Consistent 
with smooth approximation a continuous distribution of synchrotron radiation along 
the orbit is assumed as well as continuous acceleration to compensate energy losses. 
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During the short time df the six-dimensional vector 
u = (x,x', yy, g, SE) (11.5) 
will change by an amount proportional to dt. We may expand the transformations 


into a Taylor series keeping only linear terms and express the change of the phase 
space vector in form of a matrix transformation 


Au =u, —Up = dtMuo. (11.6) 
From the eigenvalue equation for this transformation matrix, 
Mu; = Aju; , 


where u; are the eigenvectors, A; the eigenvalues being the roots of the characteristic 
equation det(M — AZ) = 0 and Z the unity matrix. From (11.6) we get 


uy = (1+ Mdr)uo = (1 + Ajdt)uo © ue”. (11.7) 
Since the eigenvectors must be real the eigenvalues come in conjugate complex pairs 
Aj = a; + iB;, 
where i = 1,2,3 and 
j=6 i=3 
y= 2) ow, (11.8) 
j=l i=! 


The quantities @; cause exponential damping or excitation of the eigenvectors 
depending on whether they are negative or positive, while the 6; contribute only a 
frequency shift of the oscillations. 

Utilizing the transformation matrix MM, we derive expressions for the eigenvalues 
by evaluating the expression a det(t M—AT)|,=0 in two different ways. With M = 
AZ we get 


d do Pins 
az det (rai - A) To = ae | 1% —A)|=0 = —A di: (11.9) 
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On the other hand, we may execute the differentiation on the determinant directly 
and get 


d 
az det M — 2Dhrao = (11.10) 
ee 


mM] m2 M43 
TM, TM22 — x TM23 
TM31 TM32 TM33 — Aes 


t=0 


TM — Xr TM 12 TM43 
m2) m2 M3 


TM31 TM32 TM33 — Aes 


j=6 
_ 45 5 _ 45 - 
= —?m,-+-—A?Meg = —A ) Mj « 


j=l 


Comparing (11.9) and (11.10) we note with (11.8) the relation 


j=6 j=6 i=3 
Soap = omy = 230 a; (11.11) 
jel jel i=l 
between eigenvalues, matrix elements, and damping decrements. To further identify 
the damping we must determine the transformation. The elements 7,, 33, and m55 
are all zero because the particle positions (x, y, g) are not changed by the emission 
of a photon or by acceleration during the time dr. 


my = 0 m33 = 0 ms55 = 0. (11.12) 


The slopes, however, will change. Since synchrotron radiation is emitted in the 
forward direction we have no direct change of the particle trajectory due to the 
emission process. We ignore at this point the effects of a finite radiation opening 
angle @ = +1/y and show in connection with the derivation of the vertical beam 
emittance that this effect is negligible while determining damping. Acceleration will 
change the particle direction because the longitudinal momentum is increased while 
the transverse momentum stays constant (see Fig. 11.2). 

As shown in Fig. 11.2, a particle with a total momentum pg and a transverse 
momentum po; due to betatron oscillation emits a photon of energy ¢,. This process 
leads to a loss of momentum of —Ap = ¢,/B, where B = v/c and a loss of 
transverse momentum. Acceleration will again compensate for this energy loss. 
During acceleration the momentum is increased by Ap; = +(Pu/cP) dt, where 
Py is the rf-power to the beam. The transverse momentum during this acceleration 
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Fig. 11.2 Reduction of the 
transverse momentum of 
trajectories by acceleration. 
For simplicity we assume 
here that the energy loss —Ap 
due to the emission of a 
photon is immediately 
compensated by accelerating 
fields in a rf-cavity (Ap,r) 


is not changed and we have therefore (pp — Ap) uy = [po — Ap + (Pre/cB) de] u', 
where u, and u/, are the slopes of the particle trajectory before and after acceleration, 
respectively. With wu’ = u/Bc and cpp = BE we have to first order in Ap and P,- dt 


E Pydt\ . 
= — ie (1- : ) io. (11.13) 
Eo + Pre dt Eo 


From (11.7) we get with (11.13) using average values for the synchrotron 
radiation power around the ring and with u = x or y 


P P 
m22 = _ and m4 = _ 5 (11.14) 


where we note that in the absence of acceleration the rf-power is equal to the 
nominal synchrotron radiation power (P,) = Uo/Tp. The energy variation of the 
particle is the combination of energy loss —P,, dt and gain P,- dt. With 


oP aP 
P,(E) = P, (Eo) + ae AE) and Py(W) = Pu(Ws) + —“| 9, 
E | ow We 
where y = W — YW we get 
AE, = AE —(P,(E)) dt + Pr(W) dt 
= AE a(Py) Arai OE) ope (11.15) 
= 0 JE Ow 7 Q : 
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because P,(Eo) = Pr(Ws). Equation (11.15) exhibits two more elements of the 
transformation matrix 


(11.16) 


and ne = - — 


Ws 


We have now all elements necessary to determine the damping decrements. 
From (11.12), (11.14), (11.16) we get the sum of the damping decrements 


i=3 j=6 
(P,) O(P 

Doar = $ Yom =e — 5 

jel 


i=1 


(11.17) 


0 


which depends only on the synchrotron radiation power and the particle energy. 
This result was first derived by Robinson [4] and is known as Robinson’s damping 
criterion. 

We may separate the damping decrements. For a plane circular accelerator 
without vertical bending magnets and coupling, the vertical damping decrement 
Q@y = Q can be extracted. Since the vertical motion is not coupled to either the 
horizontal or the synchrotron oscillations, we get from (11.14) and (11.17) 


dy = —3— > 


(11.18) 


tl] 
& 


The damping decrement for synchrotron oscillations has been derived in (9.27) 
and is 


(11.19) 


The horizontal damping decrement finally can be derived from Robinson’s 
damping criterion (11.17) and the two known decrements (11.18), (11.19) to be 


(11.20) 


We may further evaluate the total and partial differential of the synchrotron 
radiation power P,, with energy E. The synchrotron radiation power is proportional 
to the square of the particle energy E and magnetic field B at the source of radiation 
and the partial differential is therefore 


OP, 


a (11.21) 


0 
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The total differential of the synchrotron radiation power depends not only on the 
particle energy directly but also on the variation of the magnetic field with energy as 
seen by the particle. A change in the particle energy causes a shift in the particle orbit 
where the 7-function is nonzero and this shift may move the particle to a location 
with different field strength. To include all energy dependent contributions, we 
inspect the definition of the average synchrotron radiation power (P,) = a f Py dt 


and noting that for highly relativistic particles cdt = dz = (1 + 145) dz the 
average radiation power becomes 


(P= PP eg (11.22) 
7 ~~ Tp y p Eo e . 


Differentiating (11.22) with respect to the energy 


d(P. 1 dP 
AT f Y) 4 p,—_]| az, (11.23) 
dE 0 cTo dE 0 pEo 
where 
dP P P, dB d& P 
UG) ee ge a X_45ty 422 pkn. 
dE|, Ey Bo dxdE © Ey) Ey 


Collecting all components, the synchrotron oscillation damping decrement (11.19) 
is finally 


ie) 


y) 


(2+ 9), (11.24) 


where we used (P,) « f «dz and Py « K* with k = 1/p 


_ Pken (1 + 2p7k) dz 


v (11.25) 
fxdz 
Similarly, we get from (11.20) for the horizontal damping decrement 
1(P 
pa YG), (11.26) 


2 Eo 
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In summary the damping decrements for betatron and synchrotron oscillations can 
be expressed by 


1 (Py) 1 (Py) 
_ _1(Py) __1 (Py) 
y= =5o, ay a BU (11.27) 
pol ail, 
: 2E 2. fe 


where the factors J; are the damping partition numbers, 


J, =24+09, 
J, =1-0, (11.28) 
Jy = 1, 


Robinson’s damping criterion can be expressed by 


Seat (11.29) 


In more practical quantities, the damping decrements can be obtained with (24.35) 
from 


1 3/1 
Oy = —3zleCY 2 oe (11.30) 


Damping occurs in circular electron accelerators in all degrees of freedom. 
In transverse planes particles oscillate in the potential created by quadrupole 
focusing and any finite amplitude is damped by synchrotron radiation damping. 
Similarly, longitudinal synchrotron oscillations are contained by a potential well 
created by the rf-fields and the momentum compaction and finite deviations of 
particles in energy and phase are damped by synchrotron radiation damping. 
We note that the synchrotron oscillation damping is twice as strong as transverse 
damping. 

All oscillation amplitudes a, in six dimensional phase space are damped (a < 0) 
or anti-damped (@ > 0) like 


dy = aye (11.31) 


and the damping or rise times are 


™=—. (11.32) 
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In a particular choice of lattice, damping rates can be shifted between different 
degrees of freedom and special care must be exercised when combined function 
magnets or strong sector magnets are introduced into a ring lattice. 

Both the synchrotron and betatron oscillation damping can be modified by a 
particular choice of lattice. From (11.25) we note the contribution «+7 which is 
caused by sector magnets. Particles with higher energies follow a longer path in 
a sector magnet and therefore radiate more. Consequently synchrotron damping is 
increased with . This term vanishes for rectangular magnets and must be modified 
appropriately for wedge magnets. For a rectangular magnet 


2k nk dz 
Drect = g tes! (11.33) 
px dz 
and for wedge magnets 
[K2Gon0 + f 2(Knk) dz+ K7Gene |, 
2 meh (11.34) 


fi xdz 


where we add all contributions from all magnets in the ring. The edge angles at the 
entrance 69 and exit @. are defined to be positive going from a rectangular magnet 
toward a sector magnet. 

The second term in the nominator of (11.25) becomes significant for combined 
function magnets and vanishes for separated function magnets. Specifically. a 
strong focusing gradient (k > 0) combined with beam deflection can contribute 
significantly to ?. For # = 1 all damping in the horizontal plane is lost and 
anti-damping or excitation of betatron oscillations appears for } > 1. This 
occurs, for example, in older combined function synchrotrons. At low energies, 
however, the beam in such lattices is still stable due to strong adiabatic damping 
and only at higher energies when synchrotron radiation reduces acceleration will 
horizontal anti-damping take over and dictate an upper limit to the feasibility of 
such accelerators. Conversely, vertical focusing (k < 0) can be implemented into 
bending magnets such that the horizontal damping is actually increased since ? < 0. 
However, there is a limit for the stability of synchrotron oscillations for 7 = 2. 


11.3 Particle Distribution in Longitudinal Phase Space 


The particle distribution in phase space is rarely uniform. To determine the required 
aperture in a particle transport system avoiding excessive losses we must, however, 
know the particle distribution. Proton and ion beams involve particle distributions 
which due to Liouville’s theorem do not change along a beam transport system, 
except for the variation of the betatron and dispersion function. The particle 
distribution can therefore be determined by measurements of beam transmission 
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through a slit for varying openings. If this is done at two points about 90° apart in 
betatron phase space, angular as well as spatial distribution can be determined. 

This procedure can be applied also to electrons in a transport system. The 
distribution changes, however, significantly when electrons are injected into a 
circular accelerator. We will discuss the physics behind this violation of Liouville’s 
theorem and determine the resulting electron distribution in phase space. 

Relativistic electron and positron beams passing through bending magnets emit 
synchrotron radiation, a process that leads to quantum excitation and damping. 
As a result the original beam emittance at the source is completely replaced by 
an equilibrium emittance that is unrelated to the original source characteristics. 
Postponing a rigorous treatment of statistical effects to Chap. 12 we concentrate 
here on a more visual discussion of the reaction of synchrotron radiation on particle 
and beam parameters. 


11.3.1 Energy Spread 


Statistical emission of photons causes primarily a change of particle energy leading 
to an energy spread within the beam. To evaluate the effect of quantized emission of 
photons on the beam energy spread, we observe particles undergoing synchrotron 
oscillations so that a particle with an energy deviation Ao at time fo will have an 
energy error at time ¢ of 


A(t) = Ao e220) (11.35) 


Emission of a photon with energy ¢ at time ¢; causes a perturbation and the 
particle continues to undergo synchrotron oscillations but with a new amplitude 


Ay = Agel?O-) — ¢ ein) (11.36) 


The change in oscillation amplitude due to the emission of one photon of energy 
é can be derived from (11.36) by multiplying with its imaginary conjugate for 


Aj = Ao + &? — 2eAo cos [Q (ti — to)] . (11.37) 


Because the times at which photon emission occurs is random we have for the 
average increase in oscillation amplitude due to the emission of a photon of energy ¢€ 


(AA?) = (At — AQ) = &” (11.38) 


The rate of change in amplitude per unit time due to this statistical or quantum 
excitation while averaging around the ring is 


dA2 


—— 11. 
a (11.39) 


Zz 


= [ e’n(e) de = (Non (e*}) 


Wz 
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where 71 (€) is the number of photons of energy ¢ emitted per unit time and energy 
bin de. This can be equated to the total photon flux Non multiplied by the average 
square of the photon energy and again taking the average along the orbit. 

Damping causes a reduction in the synchrotron oscillation amplitude and with 
A = Aoe* and the synchrotron oscillation damping time t, = 1/ |a,| (11.27) 


dA2 
(F 


_ 2 tay, (11.40) 
diz Tz 


Both quantum excitation and damping lead to an equilibrium state 


(Non (€7)). — = (AP) = 0, (11.41) 


or solving for (A) 
(A?) = 31, (Non (€")). (11.42) 


Due to the central limit theorem of statistics the energy distribution caused by 
the statistical emission of photons assumes a Gaussian distribution with the standard 
root mean square energy spread 07 = 5 (A’). The photon spectrum will be derived 
in Part 22.6 and the integral in (11.39) can be evaluated to give [5] 


55 


Non (e7) = ry ieee 


(11.43) 


Replacing the synchrotron radiation power P, by its expression in (24.34) and 
the critical photon energy €. = ha, by (24.49) we get 


. 55 4 
Non (e2) = ec ac (me? "| (11.44) 
P ( ) 320 /3 Y ( ) Y 
with C, = = “; = 8.8460 x 10° m/GeV*and the equilibrium energy 


spreadbecomes finally with (11.27) and (24.34) 


eo ot,  (e), 
= = (Nae) —c, 11.45 
F2 roa p (e )). a7, (x2). ( ) 
where 
99 Fe _ 5 g4x 10-8 (11.46) 
= =o. m . 
1 32/3 me? 


for electrons and positrons. The equilibrium energy spread in an electron storage 
ring depends only on the beam energy and the bending radius. 
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11.3.2. Bunch Length 


The conjugate coordinate to the energy deviation is the phase and a spread of particle 
energy appears also as a spread in phase or as a longitudinal particle distribution and 
an equilibrium bunch length. The bunch length is 


clne| Ge 


— 11.47 
2 Ey ( ) 


op = 


and replacing the synchrotron oscillation frequency by its expression (9.35) we get 
finally for the equilibrium bunch length in a circular electron accelerator 


2 cE E 
pe | (11.48) 
rey heV COs Ws Eo 


The equilibrium electron bunch length can be varied by varying the rf-voltage 


and scales like og « 1/ Nal which is a much stronger dependence than the scaling 
obtained for non-radiating particles in Sect. 9.3.5. A very small bunch length can be 
obtained by adjusting the momentum compaction to a small value including zero. 
As the momentum compaction approaches zero, but second order terms must be 
considered which has been discussed in detail in Sect.9.4.4. An electron storage 
ring where the momentum compaction is adjusted to be zero or close to zero is 
called an isochronous ring [6] or a quasi isochronous ring [7]. Such rings do not yet 
exist at this time but are intensely studied and problems are being solved in view of 
great benefits for research in high energy physics, synchrotron radiation sources and 
free electron lasers to produce short particle or light pulses. 


11.4 Transverse Beam Emittance 


The sudden change of particles energy due to the quantized emission of photons also 
causes a change in the characteristics of transverse particle motion. Neither position 
nor the direction of the particle trajectory is changed during the forward emission 
of photons. From beam dynamics, however, we know that different reference 
trajectories exist for particles with different energies. Two particles with energies 
cp; and cp2 follow two different reference trajectories separated at the position z 
along the beam transport line by a distance 


Ax (2) = n(—) PLP? (11.49) 


CPo 


where 7(z) is the dispersion function and cpo the reference energy. Although 
particles in general do not exactly follow these reference trajectories, they do 
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perform betatron oscillations about these trajectories. The sudden change of the 
particle energy causes a sudden change in the reference path and thereby a sudden 
change in the betatron oscillation amplitude. 


11.4.1 Equilibrium Beam Emittance 


Postponing again a rigorous discussion of the evolution of phase space due to 
statistical perturbations to the next chapter, we follow here a more intuitive path 
to determine the equilibrium transverse beam emittance. Similar to the discussion 
leading to the equilibrium energy spread we will observe perturbations to the 
transverse motion caused by photon emission. In the case of longitudinal quantum 
excitation it was sufficient to consider the effect of photon emission on the particle 
energy alone since the particle phase is not changed by this process. 

As a particle emits a photon it will not change its actual position and direction. 
However, the position of a particle with respect to the ideal reference orbit is the 
combination of its betatron oscillation amplitude and a chromatic contribution due 
to a finite energy deviation and dispersion. Variation of the particle position u = 
ug + n (AE/Ep), and direction wu! = uz + 1 (AE/Ep) due ¢ is described by 


du=O=dug+np or dug =—N5 
(11.50) 
bu’ = 0 = du, +7'— or buy = —7' 5. 

We note the sudden changes in the betatron amplitudes and slopes because the 
sudden energy loss leads to a simultaneous change in the reference orbit. This 
perturbation will modify the phase ellipse the particles move on. The variation of 
the phase ellipse yu? + 2auu' + Bu’” = a? is expressed by 


yd(ug) + 2a6(upuls) + B5(u'y”) = 8(a’) 


and inserting the relations (11.50) we get terms of the form 5(u5) = (ugo + dug)? — 
Ugo etc. Emission of photons can occur at any betatron phase and we therefore 
average over all phases. As a consequence, all terms depending linearly on the 
betatron amplitude and its derivatives or variations thereof vanish. The average 
variation of the phase ellipse or oscillation amplitude a due to the emission of 
photons with energy ¢ becomes then 


(6a’) = © He) (11.51) 
E , , 
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where 
H(z) = Bn’ +2ann! + yr’. (11.52) 


We average again over all photon energies, multiply by the total number of 
photons emitted per unit time and integrate over the whole ring to get the variation 
of the oscillation amplitude per turn 


A(a’) = =a f Ne?) HO) dz. (11.53) 
0 


The rate of change of the oscillation amplitude is then with z = ct 


— (None?) H(2)), . (11.54) 


where the index z indicates averaging around the ring. This quantum excitation of 
the oscillation amplitude is compensated by damping for which we have similar 
to (11.40) 


= Ina"). (11.55) 


Equilibrium is reached when quantum excitation and damping are of equal 
strength which occurs for 


(11.56) 


Here we have used the definition of the standard width of a Gaussian particle 
distribution 


o, = (w(@)) = (5a°Bu) (11.57) 


with the betatron function 6, and u = x or y. With (11.27), (11.44) and (24.34) we 
get finally 


ey = 


(11.58) 


which we define as the equilibrium beam emittance of a relativistic electron in a 
circular accelerator. 
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11.4.2. Emittance Increase in a Beam Transport Line 


In (11.53) we decided to integrate the quantum excitation over a complete turn 
of a circular accelerator. This should not be taken as a restriction but rather as 
an example. If we integrate along an open beam transport line we would get the 
increase of the beam emittance along this beam line. This becomes important for 
very high energy linear colliders where beams are transported along the linear 
accelerator and some beam transport system in the final focus section just ahead of 
the collision point. Any dipole field along the beam path contributes to an increase 
of the beam emittance, whether it be real dipole magnets, dipole field errors, path 
displacements in a quadrupole, or small correction magnets for beam steering. Since 
there is no damping, the emittance growth is therefore in both planes from (11.53) 
and (11.57) 


1 3 2 
Ae, = sera | Nol \H,(z) dz. (11.59) 


The function 1 is now evaluated with the dispersion functions D,,(z) instead of 
the periodic 7-function with contributions from any dipole field. Since such fields 
can occur in both planes there is an emittance increase in both planes as well. 
With (11.44) the increase in beam emittance is finally 


55C,he(mc*)” 
Ae, = — * f Pryde, (11.60) 
bas 


where the integration is taken along the beam line. The perturbation of the beam 
emittance in a beam transport line increases with the fifth power of the particle 
energy. At very high energies we expect therefore a significant effect of dipole errors 
on the beam emittance even if the basic beam transport line is straight. 

So far, we have not yet distinguished between the horizontal and vertical 
plane since the evolution of the phase space does not depend on the particular 
degree of freedom. The equilibrium beam emittance, however, depends on machine 
parameters and circular accelerators are not constructed symmetrically. Specifically, 
accelerators are mostly constructed in a plane and therefore there is no deflection in 
the plane normal to the ring plane. Assuming bending only occurs in the horizontal 
plane, we may use (11.58) directly as the result for the horizontal beam emittance 
u = x only. 


11.4.3 Vertical Beam Emittance 


In the vertical plane, the bending radius p, — oo and the vertical beam emittance 
reduces to zero by virtue of damping. Whenever we have ideal conditions like this it 
is prudent to consider effects that we may have neglected leading to less than ideal 
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results. In this case, we have neglected the fact that synchrotron radiation photons 
are emitted not strictly in the forward direction but rather into a small angle +1/y. 
Photons emitted at a slight angle exert a recoil on the particle normal to the direction 
of the trajectory. A photon emitted at an angle 6 with respect to the direction of the 
trajectory and an azimuth ¢ causes a variation of the vertical slope by 


€ 
by’ = -0 —, 
y Cone, 


while the position is not changed dy = 0. This leads to a finite beam emittance 
which can be derived analogous to the general derivation above 


; 7 a (Non {£707 cos? ) By)... hen 
We set 
(e707 cos’ p) ~ (7) (07) (cos? $) = (isG 


and get finally for the fundamental lower limit of the vertical beam emittance 
oy By 


Very roughly e,/e, = 1/ y? < 1 and it is therefore justified to neglect this 
term in the calculation of the horizontal beam emittance. This fundamental lower 
limit of the equilibrium beam emittance is of the order of 107! m, assuming the 
betatron function and the bending radius to be of similar magnitude, and therefore 
indeed very small compared to actual achieved beam emittances in real accelerators. 
In reality, we observe a larger beam emittance in the vertical plane due to coupling 
or due to vertical steering errors which create a small vertical dispersion and, 
consequently, a small yet finite vertical beam emittance. As a practical rule the 
vertical beam emittance is of the order of one percent or less of the horizontal beam 
emittance due to field and alignment tolerances of the accelerator magnets. For very 
small horizontal beam emittances, however, this percentage may increase because 
the vertical beam emittance due to vertical dipole errors becomes more significant. 

Sometimes it is necessary to include vertical bending magnets in an otherwise 
horizontal ring. In this case the vertical dispersion function is finite and so is H,(z). 
The vertical emittance is determined by evaluating (11.58) while using the vertical 
dispersion function. Note, however, that all bending magnets must be included in 
the calculation of equilibrium beam emittances because for quantum excitation it 
is immaterial whether the energy loss was caused in a horizontally or vertically 
bending magnet. The same is true for the damping term in the denominator. 
Differences in the horizontal and vertical beam emittance come from the different 
betatron and 7-functions at the location of the radiation source. 


w 
XQ 
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11.4 Transverse Beam Emittance 
11.4.4 Beam Sizes 


Beam parameters like width, height, length, divergence, and energy spread are 
not all fixed independent quantities, but rather depend on emittances and lattice 
and rf-parameters. These multiple dependencies allow the adjustment of beam 
parameters, within limits, to be optimum for the intended application. In this section 
we will discuss such dependencies. 

A particle beam at any point of a beam transport line may be represented by a 
few phase ellipses for different particle momenta as shown in Fig. 11.3. The phase 
ellipses for different momenta are shifted proportional to the dispersion function at 
that point and its derivative. Generally, the form and orientation of the ellipses are 
slightly different too due to chromatic aberrations in the focusing properties of the 
beam line. For the definition of beam parameters we need therefore the knowledge 
of the lattice functions including chromatic aberrations and the beam emittance and 
momentum spread. 

The particle beam width or beam height is determined by the emittance, betatron 
function, dispersion function and energy spread. The betatron and dispersion 
functions vary along a beam transport line and depend on the distribution of the 
beam focusing elements. The beam sizes are therefore also functions of the location 
along the beam line. From the focusing lattice these functions can be derived and 
the beam sizes be calculated. 

The beam size of a particle beam is generally not well defined since the 
boundaries of a beam tends to be fuzzy. We may be interested in the beam size 
that defines all of a particle beam. In this case we look for that phase ellipse that 


Fig. 11.3. Distribution of 

beam ellipses for a beam with x 
finite emittance and 

momentum spread 

(schematic). The variation in Ap/p > 0 
the shape of the phase ellipses 
for different energies reflect 
the effect of chromatic 
aberrations 7 


Ap/p = 0 


Ap/p < 0 
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encloses all particles and obtain the beam size in the form of the beam envelope. 
The beam half-width or half-height of this beam envelope is defined by 


ug(Z) = V €uBu(Z) (11.63) 


with u = (x, y). If there is also a finite momentum spread within the beam particles 
the overall beam size or beam envelope is increased by the dispersion 


Ac 
Uy(Z) = Nu(z) —_ (11.64) 
CPo 
and the total beam size is 
——— Acp 
Urot(2) = Up(Z) + Uy(z) = VeuBulZ) + Mu(Z) an (11.65) 


This definition of the beam size assumes a uniform particle distribution within the 
beam and is used mostly to determine the acceptance of a beam transport system. 
The acceptance of a beam transport system is defined as the maximum emittance 
a beam may have and still pass through the vacuum chambers of a beam line. 
In Fig. 11.3 this would be the area of that ellipse that encloses the whole beam 
including off momentum particles. In practice, however, we would choose a larger 
acceptance to allow for errors in the beam path. 

Since the lattice functions vary along a beam line the required aperture to let a 
beam with the maximum allowable emittance pass is not the same everywhere along 
the system. To characterize the aperture variation consistent with the acceptance, a 
beam stay clear area, BSC, is defined as the required material free aperture of the 
beam line. 

The beam parameters for a Gaussian particle distributions are defined as the 
standard values of the Gaussian distribution 0,, oy, Oy, Oy’, 03,0¢ ,where most des- 
ignations have been defined and used in previous chapters and where 03 = 0. /cpo 
and oy the bunch length. Quoting beam sizes for any particle type in units of o 
can be misleading specifically in connection with beam intensities. For example, a 
beam with a horizontal and vertical size of one sigma has a cross section of 20,20, 
and includes only 46.59 % of the beam. This is accepted for electron beams with 
Gaussian distribution but for proton beams intensities are often given for s/60’s to 
cover most of the beam. In Table 11.1 the fraction of the total beam intensity is 
compiled for a few generally used units of beam size measurement and for beam 
size, cross section, and volume. The beam size for Gaussian beams is thereby 


Ou.tot = VeuBul2) + n?(z)os : (11.66) 


Four parameters are required to determine the beam size in each plane although 
in most cases the vertical dispersion vanishes. 
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Table 11.1 Fraction of total beam intensity 


Three-dimension (%) 
io =(6826~«*d AS 31381 
279A S=* OD 96.9 
oo (9856 SSS«*T 95.74 


11.4.5 Beam Divergence 


The angular distribution of particles within a beam depends on the rotation of the 
phase ellipse and we define analogous to the beam size an angular beam envelope by 


Ow tot = Vearu(2) + 1! (z) ce . (11.67) 


Again, there is a contribution from the betatron motion, from a finite momentum 
spread and from associated chromatic aberration. The horizontal and vertical beam 
divergencies are also determined by four parameters in each plane. 


11.5 Variation of the Damping Distribution 


Robinson’s criterion provides an expression for the overall damping in 
six-dimensional phase space without specifying the distribution of damping in 
the three degrees of freedom. In accelerators we make an effort to decouple the 
particle motion in the three degrees of freedom as much as possible and as a result 
we try to optimize the beam parameters in each plane separately from the other 
planes for our application. Part of this optimization is the adjustment of damping 
and as a consequence of beam emittances to desired values. Robinson’s criterion 
allows us to modify the damping in one plane at the expense of damping in another 
plane. This shifting of damping is done by varying damping partition numbers. 

From the definition of the % parameter is clear that damping partition numbers 
can be modified depending on whether the accelerator lattice is a combined function 
or a separated function lattice. Furthermore, we may adjust virtually any distribution 
between partition numbers by choosing a combination of gradient and separated 
function magnets. 


11.5.1 Damping Partition and Rf-Frequency 


Actually such “gradients” can be introduced even in a separated function lattice. 
If the rf-frequency is varied the beam will follow a path that meets the synchronicity 
condition. Increasing the rf-frequency, for example, leads to a shorter wavelength 


376 11 Particle Beam Parameters 


and therefore the total path length in the ring need to be shorter. As a consequence 
of the principle of phase stability the beam energy is reduced and the beam follows 
a lower energy equilibrium orbit with the same harmonic number as the reference 
orbit for the reference energy. Decreasing the rf-frequency leads just to the opposite 
effect. The off momentum orbits pass systematically off center through quadrupoles 
which therefore function like combined function gradient magnets. 

To quantify this effect we use only the second term in the expression (11.25) 
for ?. The first term, coming from sector magnets, will stay unaffected. Displace- 
ment of the orbit in the quadrupoles will cause a bending with a bending radius 


1 
—=ko6x. (11.68) 
Pq 


An rf-frequency shift causes a momentum change of 


Ap _ 1 Aftt 


; (11.69) 
Po Ao St 
which in turn causes a shift in the equilibrium orbit of 
A Aft 
a a (11.70) 
Po Ae Sit 
and the bending radius of the shifted orbit in quadrupoles is 
1 A A 
ae | ee ee any ec (11.71) 
Pq Po Ae St 
Inserted into the second term of (11.25), we get 
1 f2k’n? dz A 
Ha Ore ae (11.72) 


Ao f wade St , 


where / is the bending radius of the ring bending magnets All quantities in (11.72) 
are fixed properties of the lattice and changing the rf-frequency leads just to 
the expected effect. Specifically, we note that all quadrupoles contribute additive 
irrespective of their polarity. We may apply this to a simple isomagnetic FODO 
lattice where all bending magnets and quadrupoles have the same absolute strength 
respectively with ¢ dz/ Lo = 27/9. Integration of the nominator in (11.72) leads to 


f 2K Pde = 2K Chea a5 Min )4q2 Me 2 


where /, is half the quadrupole length in a FODO lattice, max and min the values 
of the y-function in the focusing QF and defocusing QD quadrupoles, respectively, 
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and n, the number of FODO cells in the ring. With all this the variation of the 7 
parameter 


20 Max + Marin Aft 
Tela f? fe” 


Ad = —n, (11.73) 


Here we have used the focal length f~! = kJ,. Replacing in (11.73) the 
functions by the expressions (10.74) derived for a FODO lattice, we recall the 
relation f = « Land get finally [8] 


1 
Ad = — P(e? 4. 1) At (11.74) 
c fq 


pa te 


where p is the average bending radius in the FODO cell. The variation of the 7 
parameter in a FODO lattice is the more sensitive to rf-frequency variations the 
longer the cell compared to the quadrupole length and the weaker the focusing. For 
other lattices the expressions may not be as simple as for the FODO lattice but can 
always be computed numerically by integrations and evaluation of (11.72). 

By varying the rf-frequency and thereby the horizontal and longitudinal damping 
partition number we have found a way to either increase or decrease the horizontal 
beam emittance. The adjustments, however, are limited. To decrease the horizontal 
beam emittance we would increase the horizontal partition number and at the same 
time the longitudinal partition number would be reduced. The limit is reached when 
the longitudinal motion becomes unstable or in practical cases when the partition 
number drops below about half a unit. Other more practical limits may occur before 
stability limits are reached if, for example, the momentum change becomes too large 
to fit the beam into the vacuum chamber aperture. 


11.6 Variation of the Equilibrium Beam Emittance 


In circular electron accelerators the beam emittance is determined by the emission of 
synchrotron radiation and the resulting emittance is not always equal to the desired 
value. In such situations methods to alter the equilibrium emittance are desired and 
we will discuss in the next sections such methods which may be used to either 
increase or decrease the beam emittance. 


11.6.1 Beam Emittance and Wiggler Magnets 


The beam emittance in an electron storage ring can be greatly modified by the use of 
wiggler magnets both to increase [9] or to decrease the beam emittance. A decrease 
in beam emittance has been noted by Tazzari [10] while studying the effect of a 
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number of wiggler magnets in a low emittance storage ring design. Manipulation of 
the beam emittance in electron storage rings has become of great interest to obtain 
extremely small beam emittances and we will therefore derive systematic scaling 
laws for the effect of wiggler magnets on the beam emittance as well as on the beam 
energy spread [10, 11]. 

The particle beam emittance in a storage ring is the result of two competing 
effects, the quantum excitation caused by the quantized emission of photons and 
the damping effect. Both effects lead to an equilibrium beam emittance observed in 
electron storage rings. 

Independent of the value of the equilibrium beam emittance in a particular 
storage ring, it can be further reduced by increasing the damping without also 
increasing the quantum excitation. More damping can be established by causing 
additional synchrotron radiation through the installation of deflecting dipole mag- 
nets like strong wigglers magnets. In order to avoid quantum excitation of the 
beam emittance, however, the placement of wiggler magnets has to be chosen 
carefully. As discussed earlier, an increase of the beam emittance through quantum 
excitation is caused only when synchrotron radiation is emitted at a place in the 
storage ring where the dispersion function is finite. Emittance reducing wiggler 
magnets therefore must be placed in areas around the storage ring where the 
dispersion vanishes to minimize quantum excitation. To calculate the modified 
equilibrium beam emittance, we start from (11.54) and get with (11.44) and (11.57) 
an expression for the quantum excitation of the emittance which can be expanded to 
include wiggler magnets 


S| 2 Cay? (KH) (11.75) 
dt |g 3 aie nol : 

The quantity H is evaluated for the plane for which the emittance is to be 
determined, EF is the particle energy, and p the bending radius of the regular ring 
magnets. The average () is to be taken for the whole ring and the index 9 indicates 
that the average (eT), be taken only for the ring proper without wiggler magnets. 

Since the contributions of different magnets, specifically, of regular storage ring 
magnets and wiggler magnets are independent of each other, we may use the results 
of the basic ring lattice and add to the regular quantum excitation and damping the 
appropriate additions due to the wiggler magnets, 


de 
dt 


=f Cay? [(e?H), + (PH), | - (11.76) 


q.w 3 


Both, ring magnets and wiggler magnets produce synchrotron radiation and 
contribute to damping of the transverse particle oscillations. Again, we may consider 
both contributions separately and adding the averages we get the combined rate of 
emittance damping from (11.55) and (11.27) 


de 


] = —Jrecew ter’ [ely + (eel (11.77 
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where €,, is the beam emittance with wiggler magnets and J,, the damping partition 
number with u = x,y. The equilibrium beam emittance is reached when the 
quantum excitation rate and the damping rates are of equal magnitude. We add 
therefore (11.76) and (11.77) and solve for the emittance 


by =G, yee w (11.78) 
Je (kg + (K*)y 
With €9 being the unperturbed beam emittance the relative emittance change due 

to the presence of wiggler magnets is 


és 1+ («?H) 


/ 
= 11.79 
€0 1+ (x?) / ( ) 
Making use of the definition of average parameter values we get with the circum- 
ference of the storage ring C = 27R 


(PH). = oH |Ke5| Hae, (PH), = oh || Hdz, (11.80) 
(7) = ep xode, and (x2), = Lfx2dc. : 


Evaluation of these integrals for the particular storage ring and wiggler magnet 
employed gives from (11.79) the relative change in the equilibrium beam emittance. 
We note that the quantum excitation term scales like the cube while the damping 
scales only quadratically with the wiggler curvature. This feature leads to the effect 
that the beam emittance is always reduced for small wiggler fields and increases 
only when the third power terms become significant. 

Concurrent with a change in the beam emittance a change in the momentum 
spread due to the wiggler radiation can be derived similarly, 


(11.81) 


Closer inspection of (11.79) and (11.81) reveals basic rules and conditions for 
the manipulations of beam emittance and energy spread. If the ring dispersion 
function is finite in the wiggler section strong quantum excitation may occur 
depending on the magnitude of the wiggler magnet bending radius p,,. This situation 
is desired if the beam emittance must be increased [9]. If wiggler magnets are 
placed into a storage ring lattice were the ring dispersion function vanishes, only 
the small dispersion function due to the wiggler magnets must be considered for 
the calculation of (Hy) and therefore only little quantum excitation occurs. In this 
case the beam emittance can be reduced since the wiggler radiation contributes more 
strongly to damping and we call such magnets damping wigglers [10, 11]. Whenever 
wiggler magnets are used which are stronger than the ordinary ring magnets py < po 
the momentum spread in the beam is increased. This is true for virtually all cases of 
interest. 
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Conceptual methods to reduce the beam emittance in a storage ring have been 
derived which are based on increased synchrotron radiation damping while avoiding 
quantum excitation effects. Optimum lattice parameters necessary to achieve this 
will be derived in the next section. 


11.6.2 Damping Wigglers 


The general effects of wiggler magnet radiation on the beam emittance has been 
described and we found that the beam emittance can be reduced if the wiggler is 
placed where 7 = 0 to eliminate quantum excitation. This assumption, however, 
is not quite correct. Even though we have chosen a place, where the storage ring 
dispersion function vanishes, the quantum excitation factor Hy is not exactly zero 
once the wiggler magnets are turned on because they create their own dispersion 
function. To calculate this dispersion function, we assume a sinusoidal wiggler 
field [11] 


B(z) = By coskpz, (11.82) 


where ky = 27/A, and A, the wiggler period length (Fig. 11.4). The differential 
equation for the dispersion function is then 


N 


H = K = ky tosk,z, (11.83) 
which can be solved by 


n(z) = (1 —coskpz) , 
; " : r*) (11.84) 
n (Zz) = ie sin kpz, 


where we have assumed that the wiggler magnet is placed in a dispersion free 
location no = = 0. With this solution, the first two Eqs. (11.80) can be 
evaluated. To simplify the formalism we ignore the z-dependence of the lattice 
functions within the wiggler magnet setting a, = 0 and 6, = const. Evaluating the 
integrals (11.80), we note that the absolute value of the bending radius must be used 
along the integration path because the synchrotron radiation does not depend on the 


sign of the deflection. With this in mind, we evaluate the integrals ie ’ |x? | n° dz 


Fig. 11.4 Dispersion n-function 
function in one period of a 
wiggler magnet Ss 
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and fr pf? |x| nf! * dz. For each half period of the wiggler magnet the contribution to 
the integral is 


Ap/2 Zine 4p 4 «> By 
A 3H dz = w wo Ys 11.85 
I fae 5B. 15 ~ 15 oe 


where the approximation A, < 6, was used. For the whole wiggler magnet with Ny 
periods the total quantum excitation integral is with the deflection angle per wiggler 
half pole Oy, = Kw/kp 


8 Px 
[ al teae~ my Ba O;,. (11.86) 


Similarly, the damping integral for the total wiggler magnet is 
/ K? dz = 1 Ny KwOw . (11.87) 


Inserting expressions (11.80), (11.86), (11.87) into (11.79), we get for the emittance 
ratio 


i+ egw di 5 Ae: 
= 11.88 
Ex0 1+ TN, 2 Oy ( ) 


where (Ho) is the average value of H in the ring bending magnets excluding the 
wiggler magnets. We note from (11.88) that the beam emittance indeed can be 
reduced by wiggler magnets if @,, is kept small. For easier numerical calculation 
we replace (Ho) by the unperturbed beam emittance which is in the limit py — oo 


J P0ExO 
Ho) = ——— 11.89 
(Ho) Cay? ( ) 
and get instead of (11.88) 
4Cq , 
Exw 1 ag Taf, aan ee 
= a : (11.90) 
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The beam emittance is reduced by wiggler magnets whenever the condition 


8 C,. Bb 
oe 2@2 <1 (11.91) 
15x Jy €o Pw 
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is fulfilled. For large numbers of wiggler poles Ny, —> oo the beam emittance reaches 
asymptotically a lower limit given by 


8 C, Be 
Exw > Bele (11.92) 
15x Jy Pw 


In this limit the ultimate beam emittance is independent of the unperturbed beam 
emittance. This derivation did not include any perturbation of the original lattice 
functions due to focusing effects by the wiggler poles. Such perturbations are either 
small or must be compensated such that our assumptions still are valid. 

For many wiggler poles the increase in momentum spread also reaches an 
asymptotic limit which is given from (11.81) 


By, 
B66 ag PE ee (11.93) 
one Pw Bo 


where Bo is the magnetic field strength in the ring magnets. Beam stability and 
acceptance problems may occur if the beam momentum spread is allowed to 
increase too much and therefore inclusion of damping wigglers must be planned 
with some caution. 


11.7 Robinson Wiggler* 


The horizontal betatron motion in a combined function synchrotron FODO lattice 
is not damped because 3 > 1. Beam stability in a synchrotron therefore exists 
only during acceleration when the anti-damping is over compensated by adiabatic 
damping, and the maximum energy achievable in a combined function synchrotron 
is determined when the quantum excitation becomes too large to be compensated 
by adiabatic damping. In an attempt, at the Cambridge Electron Accelerator CEA, 
to convert the synchrotron into a storage ring the problem of horizontal beam 
instability was solved by the proposal [12] to insert a damping wiggler consisting 
of a series of poles with alternating fields and gradients designed such that the 
horizontal partition number becomes positive and —2 < # < 1. 

Such magnets can be used generally to vary the damping partition numbers 
without having to vary the rf-frequency and thereby moving the beam away from 
the center of the beam line. 


11.7.1 Damping Partition and Synchrotron Oscillation 


The damping partition number and damping depend on the relative momentum 
spread of the whole beam. During synchrotron oscillations, significant momentum 
deviations can occur, specifically, in the tails of a Gaussian distribution. Such 
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momentum deviations, although only temporary, can lead to reduced damping or 
outright anti-damping [8]. To quantify this effect, we write (11.72) in the form 


Ad 


_ f2k*1? dz Ap =C¢ Ap 
= = Co 


; (11.94) 
f Kdz Po Po 


The momentum deviation is not a constant but rather oscillates with the 
synchrotron oscillation frequency, 


Ap _ Ap 
Po PO |max 


sin Qt = dmax sin Qt, (11.95) 


where {2 is the synchrotron oscillation frequency. The damping partition number 
oscillates as well (11.94) and the damping decrement is therefore 


1 1 
= (1 Coban SD): (11.96) 
T Tx 


If the perturbation is too large we have anti-damping during part of the 
synchrotron oscillation period. As a consequence the beam is “breathing” in its 
horizontal and longitudinal dimensions while undergoing synchrotron oscillations. 
To quantify this, we calculate similar to (11.56) the total rate of change of the 
betatron oscillation amplitude a’, as defined by the phase space ellipse yu? + 
2auu' + Bul 2 =, composed of quantum excitation and modified damping 


d(a2) 7 (Non €y) #) _ 2(a’) (11.97) 
dt Eo - 


The amplitude a* has the dimension of an emittance but we are interested 
here in the maximum amplitude which can be expressed in terms of a betatron 
amplitude by a? = u?,./B,. Replacing the varying damping time by 17! 
te (1 — dbmaxCp sin 21) (11.97) becomes 


a ae 
| a) = — bmaxCo sin Qt dt, 
Umax To 


which can be readily integrated to give 


2 bmaxCi 
(ur.9,) = (Wraxo) exp | anc ° (1 —cos 20] (11.98) 
T0 


A particle with a betatron amplitude uwmax will, during the course of a syn- 
chrotron oscillation period, reach amplitudes as large as uwmax. The effect is the 
largest for particles with large energy oscillations. On the other hand, the effect 
on the core of the beam is generally very small since da is small. 
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11.7.2. Can We Eliminate the Beam Energy Spread? 


To conclude the discussions on beam manipulation we try to conceive a way to 
eliminate the energy spread in a particle beam. From beam dynamics we know 
that the beam particles can be sorted according to their energy by introducing a 
dispersion function. The distance of a particle from the reference axis is proportional 
to its energy and given by 


xs = D6, (11.99) 


where D is the value of the dispersion at the location under consideration and 
6 = AE/Ep the energy error. For simplicity we make no difference between energy 
and momentum during this discussion. We consider now a cavity excited at a higher 
mode such that the accelerating field is zero along the axis, but varies linearly with 
the distance from the axis. If now the accelerating field, or after integration through 
the cavity, the accelerating voltage off axis is 


eVie(Xs) = SE. (11.100) 


we have just compensated the energy spread in the beam. The particle beam 
has become monochromatic, at least to the accuracy assumed here. In reality the 
dispersion of the beam is not perfect due to the finite beam emittance. 

We will discuss cavity modes and find that the desired mode exists indeed and 
the lowest order of such modes is the TM);9-mode. So far we seem to have made 
no mistake and yet, Liouville’s theorem seems to be violated because this scheme 
does not change the bunch length and the longitudinal emittance has been indeed 
reduced by application of macroscopic fields. 

The problem is that we are by now used to consider transverse and longitudinal 
phase space separate. While this separation is desirable to manage the mathematics 
of beam dynamics, we must not forget, that ultimately beam dynamics occurs in 
six-dimensional phase space. Since Liouville’s theorem must be true, its apparent 
violation warns us to observe changes in other phase space dimensions. In the case 
of beam monochromatization we notice that the transverse beam emittance has been 
increased. The transverse variation of the longitudinal electric field causes by virtue 
of Maxwell’s equations the appearance of transverse magnetic fields which deflect 
the particles transversely thus increasing the transverse phase space at the expense 
of the longitudinal phase space. 

This is a general feature of electromagnetic fields which is known as the 
Panofsky-Wenzel theorem [13] stating that transverse acceleration occurs whenever 
there is a transverse variation of the longitudinal accelerating field. We will discuss 
this in more detail in Sect. 22.1.4. So, indeed we may monochromatize a particle 
beam with the use of a TMj19-mode, but only at the expense of an increase in the 
transverse beam emittance. 
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11.8 Beam Life Time 


Particles travelling along a beam transport line or orbiting in a circular accelerator 
can be lost due to a variety of causes. We ignore the trivial cases of beam loss due 
to technical malfunctioning of beam line components or losses caused by either 
complete physical obstruction of the beam line or a mismatch of vacuum chamber 
aperture and beam dimensions. For a well designed beam transport line or circular 
accelerator we distinguish two main classes for particle loss which are losses due to 
scattering and losses due to instabilities. While particle losses due to scattering with 
other particles is a single particle effect leading to a gradual loss of beam intensity, 
instabilities can lead to catastrophic loss of part or all of the beam. In this chapter we 
will concentrate on single particle losses due to interactions with residual gas atoms. 

The effect of particle scattering on the beam parameters is different in a beam 
transport line compared to a circular accelerator especially compared to storage 
rings. Since a beam passes through transport lines only once, we are not concerned 
about beam life time but rather with the effect of particle scattering on the transverse 
beam size. For storage rings, in contrast, we consider both the effect of scattering 
on the beam emittance as well as the overall effect on the beam lifetime. Since 
long lifetimes of the order of many hours are desired in storage rings even small 
effects can accumulate to reduce beam performance significantly. In proton rings 
continuous scattering with residual gas atoms or with other protons of the same 
beam can change the beam parameters considerably for lack of damping. Even for 
electron beams, where we expect the effects of scattering to vanish within a few 
damping times, we may observe an increase in beam emittance. This is specifically 
true due to intra beam scattering for dense low emittance beams at low energies 
when damping is weak. 

Collisions of particles with components of residual gas atoms, losses due to 
a finite acceptance limited by the physical or dynamic aperture, collisions with 
other particles of the same beam, or with synchrotron radiation photons can lead 
to absorption of the scattered particles or cause large deflections leading to instable 
trajectories and eventual particle loss. The continuous loss of single particles leads 
to a finite beam lifetime and may in severe cases require significant hardware 
modifications or a different mode of operation to restore a reasonable beam lifetime. 

Each of these loss mechanisms has a particular parameter characterizing and 
determining the severity of the losses. Scattering effects with residual gas atoms 
are clearly dominated by the vacuum pressure while scattering effects with other 
particles in the same beam depend on the particle density. Some absorption of 
particles at the vacuum chamber walls will always occur due to the Gaussian 
distribution of particles in space. Even for non-radiating proton beams which are 
initially confined to a small cross section, we observe the development of a halo of 
particles outside the beam proper due to intra beam scattering. The expansion of this 
halo is obviously limited by the vacuum chamber aperture. In circular accelerators 
this aperture limitation may not only be effected by solid vacuum chambers but also 
by “soft walls” due to stability limits imposed by the dynamic aperture. 
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Longitudinal phase or energy oscillations are limited either by the available 
rf-parameters determining the momentum acceptance or by the transverse accep- 
tance at locations, where the dispersion function is nonzero whichever is more 
restrictive. A momentum deviation or spread translates at such locations into a 
widening of the beam and particle loss occurs if the momentum error is too large 
to fit within the stable aperture. Transverse oscillation amplitudes are limited by the 
transverse acceptance as limited by the vacuum chamber wall or by aberrations due 
to nonlinear fields. 


11.8.1 Beam Lifetime and Vacuum 


Particle beams are generally confined within evacuated chambers to avoid excessive 
scattering on residual gas atoms. Considering multiple Coulomb scattering alone 
the rms radial scattering angle of particles with momentum p and velocity f passing 
through a scattering material of thickness L can be described by [14, 15] 


Dicey: 1 i 
ge Z al Ss (11.101) 
Bcp Ly 


where Z is the charge multiplicity of the particle and L, the radiation length of the 
scattering material. We may integrate (11.101) and get the beam radius r of a pencil 
beam after passing through a scatterer of thickness L 


Arch 1h 
pe (11.102) 
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The beam emittance generated by scattering effects is then in both the horizontal 
and vertical plane just the product of the projections of the distance r of the particles 
from the reference path and the radial scattering angles 7 onto the respective plane. 
From (11.101), (11.102) the beam emittance growth due to Coulomb scattering in a 
scatterer of length L is then 


€x,y(rad m) = (11.103) 
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For atmospheric air the radiation length is L, = 300.5 m and a pencil electron 
beam with a momentum of say cp = 1,000MeV passing through 20m of 
atmospheric air would grow through scattering to a beam diameter of 6.9cm or 
to a beam emittance of about 177 mrad mm in each plane. This is much too big 
an increase in beam size to be practical in a 20m beam transport line let alone in 
a circular accelerator or storage ring, where particles are expected to circulate at 
nearly the speed of light for many turns like in a synchrotron or for many hours in a 
storage ring. 
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To avoid beam blow up due to scattering we obviously need to provide an 
evacuated environment to the beam with a residual gas pressure which must be 
the lower the longer the beam is supposed to survive scattering effects. This does 
not mean that beam transport in atmospheric pressure must be avoided at all cost. 
Sometimes it is very useful to let a beam pass though air to provide free access 
for special beam monitoring devices specifically at the end of a beam transport line 
before the beam is injected into a circular accelerator. Obviously, this can be done 
only if the scattering effects through very thin metallic windows and the short length 
of atmospheric air will not spoil the beam emittance too much. 


Elastic Scattering 


As particles travel along an evacuated pipe they occasionally collide with atoms 
of the residual gas. These collisions can be either on nuclei or electrons of the 
residual gas atoms. The physical nature of the collision depends on the mass of 
the colliding partners. Particles heavier than electrons suffer mostly an energy loss 
in collisions with the atomic shell electrons while they lose little or no energy during 
collisions with massive nuclei but are merely deflected from their path by elastic 
scattering. The lighter electrons in contrast suffer both deflection as well as energy 
losses during collisions. 

In this section we concentrate on the elastic scattering process, where the energy 
of the fast particle is not changed. For the purpose of calculating particle beam 
lifetimes due to elastic or Coulomb scattering we ignore screening effects by shell 
electrons and mathematical divergence problems at very small scattering angles. 
The scattering process therefore is described by the classical Rutherford scattering 
with the differential cross section per atom 


do 1 (*) 1 anion 
dQ  4xe9 \2Bcp) sin* (6/2) ’ ; 


where z is the charge multiplicity of the incident particle eZ the charge of the heavy 
scattering nucleus, 6 the scattering angle with respect to the incident path, (2 the 
solid angle with dQ = sin 6 dé dg, and ¢ the polar angle. 

To determine the particle beam lifetime or the particle loss rate we will calculate 
the rate of events for scattering angles larger than a maximum value of 6 which is 
limited by the acceptance of the beam transport line. Any particle being deflected 
by an angle larger than this maximum scattering angle will be lost. We integrate the 
scattering cross section over all angles greater than 6 up to the maximum scattering 
angle z. With n scattering centers or atoms per unit volume and N beam particles 
the loss rate is 


dN mq 
-—= 2repnn | <a Sin 8 dd. (11.105) 


388 11 Particle Beam Parameters 


Under normal conditions at 0°C and a gas pressure of 760mm mercury the 
number of scattering centers in a homogeneous gas is equal to twice Avogadro’s 
number A and becomes for an arbitrary gas pressure P 


P(Torr) 


=34 
? 760 


= 2 x 2.68675 x | (11.106) 
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The factor 2 comes from the fact that homogeneous gases are composed of 
two atomic molecules, where each atom acts as a separate scattering center. This 
assumption would not be true for single atomic noble gases which we do not 
consider here, but will be included in a later generalization. The integral on the 
r.h.s. of (11.105) becomes with (11.104) 


[= — (11.107) 
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Dividing (11.105) by N we find an exponential decay of beam intensity with time 
N=Noe"", (11.108) 


where the decay time constant or beam lifetime is 


(11.109) 


—— soy (= ) An 
760 \2Bcp/ tan?(6/2) 

The maximum acceptable scattering angle 6 is limited by the acceptance €, of 
the beam transport line. A particle being scattered by an angle @ at a location where 
the betatron function has the value fg reaches a maximum betatron oscillation 
amplitude of a = ./ Ba Be 0 elsewhere along the beam transport line where the 
betatron function is B,. The minimum value of A?/, along the ring lattice, where 
A is the vacuum chamber aperture or the limit of the dynamic aperture whichever is 
smaller, is equal to the ring acceptance 


AZ 
Ba min 
For simplicity we ignore here the variation of the betatron function and take 


an average value (6) at the location of the scattering event and get finally for the 
maximum allowable scattering angle 


EA (11.110) 
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This angle is generally rather small and we may set tan( 6 /2)% (6 /2). Utilizing 
these definitions and approximations we obtain for the lifetime of a beam made up 
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of singly charged particles z = 1 due to elastic Coulomb scattering expressed in 
more practical units 


(cp) (GeV’) €a (mm mrad) 
(B (m)) P (nTorr) 


Ts (hours) = 10.25 (11.112) 


where we have assumed that the residual gas composition is equivalent to nitrogen 
gas N2 with Z? ~ 49. The Coulomb scattering lifetime is proportional to the ring 
acceptance or proportional to the square of the aperture A where A?/B is a minimum. 

The particle loss due to Coulomb scattering is most severe at low energies and 
increases with the acceptance of the beam transport line. Furthermore, the beam 
lifetime depends on the focusing in the transport line through the average value of 
the betatron function. If instead of averaging the betatron function we integrate the 
contributions to the beam lifetime along the transport line we find that the effect of 
the scattering event depends on the betatron function at the location of the collision 
and the probability that such a collision occurs at this location depends on the gas 
pressure there. Therefore, it is prudent to not only minimize the magnitude of the 
betatron functions alone but rather minimize the product BP along the transport 
line. Specifically, where large values of the betatron function cannot be avoided, 
extra pumping capacity should be provided to reach locally a low vacuum pressure 
for long Coulomb scattering lifetime. 

We have made several simplifications and approximations by assuming a homo- 
geneous gas and assuming that the maximum scattering angle be the same in all 
directions. In practical situations, however, the acceptance need not be the same in 
the vertical and horizontal plane. First we will derive the beam lifetime for non- 
isotropic aperture limits. We assume that the apertures in the horizontal and vertical 
plane allow maximum scattering angles of 6, and 6y. Particles are then lost if the 
scattering angle @ into a polar angle y exceeds the limits 
6, 


6, 
and @> — (11.113) 


6> - ; 
COS @ sin @ 


The horizontal aperture will be relevant for all particles scattered into a polar 
angle between zero and arctan(O, / 6.) while particles scattered into a polar angle 
of arctan(6, / 6.) and z/2 will be absorbed by the vertical aperture whenever the 
scattering angle exceeds this limit. We calculate the losses in only one quadrant of 
the polar variable and multiply the result by 4 since the scattering and absorption 
process is symmetric about the polar axis. The integral (11.107) becomes in this case 


™ sin 6 dO dy arctan(6,/4x) i sin 6 dé 
/ —-~ = af ae | re (11.114) 
6 sin® (6/2) 0 4,/cosy sin® (6/2) 


af P s sin 0 d6é 
. GF |. Sa oan 
arctan(6,/6,) 6,/ sing sin‘ (0/2) 
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The solutions of the integrals are similar to that in (11.107) and we get 


= [x + (R° + 1) sin (2 arctan R) (11.115) 


[ sinddédpy 8 
g sin4(0/2) 2 


+2 (R— 1)arctanR] , 


where R = 6, / 6. 

Using (11.115) instead of (11.107) in (11.109) gives a more accurate expression 
for the beam lifetime due to Coulomb scattering. We note that for R = 1 we do 
not get exactly the lifetime (11.109) but find a lifetime that is larger by a factor 
of 1 + 2/2. This is because we used a rectangular aperture in (11.115) compared 
to a circular aperture in (11.107). The beam lifetime (11.112) becomes now for a 
rectangular acceptance 


_ 2x (cp)?(GeV7) €,(mm mrad) 
Tes(hours) = 10.25 FR) (Bam) P(nTom : (11.116) 


The function F(R) 
F(R) = [a + (R? + 1) sin(2 arctan R) + 2(R? — 1) arctanR (11.117) 


is shown in Fig. 11.5. For some special cases the factor 2 2/ F(R) assumes the values 


shape of aperture round square rectangular 
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Tacitly we have assumed that the vertical acceptance is smaller than the 
horizontal acceptance which in most cases is true. In cases, where A, > 6, we 
may use the same equations with x and y exchanged. 

Particles performing large amplitude betatron oscillations form a Coulomb 
scattering halo around the beam proper. In case of an electron storage ring the 
particle intensity in this halo reaches an equilibrium between the constant supply 
of scattered electrons and synchrotron radiation damping. 

The deviation of the particle density distribution from a Gaussian distribution 
due to scattering can be observed and measured. In Fig. 11.6 beam lifetime 
measurements are shown for an electron beam in a storage ring as a function of 
a variable ring acceptance as established by a movable scraper. The abscissa is 
the actual position of the scraper during the beam lifetime measurement, while 
the variable for the ordinate is the aperture for which a pure Gaussian particle 
distribution would give the same beam lifetime. 


pure Gaussian 


distribution tail due to Coulomb 
Ng ° scattering 
rae c x 
yee OR . ‘ 
x + ee ee ae ae 105 
? 4 - beam lifetime = 
T(sec) 
(+ 4 
4 Ke 10 
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Yee(mm) | 
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| | | | 
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Fig. 11.6 Measurement of beam lifetime in an electron storage ring with a movable scraper. The 
curve on the left shows the Coulomp scattering halo for amplitudes larger than 60 indicating a 
strong deviation from Gaussian particle distribution. the curve on the right shows the beam life 
time as a function of scraper position 
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If the particle distribution had been purely Gaussian the measured points would 
lie along a straight line. In reality, however, we observe an overpopulation of 
particles in the tails of the distribution for amplitudes larger than about 60 forcing 
the scraper to be located farther away from the beam center to get a beam lifetime 
equal to that of a pure Gaussian distribution. This overpopulation or halo at large 
amplitudes is due to elastic Coulomb scattering on the residual gas atoms. 

Since the acceptance of the storage ring is proportional to the square of the 
aperture at the scraper, we expect the beam lifetime due to Coulomb scattering to 
vary proportional to the square of the scraper position. This is shown in Fig. 11.7 
for good vacuum and poor vacuum conditions. In the case of poor vacuum we find 
a saturation of the beam lifetime at large scraper openings which indicates that the 
scraper is no longer the limiting aperture in the ring. This measurement therefore 
allows an accurate determination of the physical ring acceptance or the dynamic 
aperture whichever is smaller. 

So far we have assumed the residual gas to consist of homogeneous two atom 
molecules. This is not an accurate description of the real composition of the residual 
gas although on average the residual gas composition is equivalent to a nitrogen 
gas. Where the effects of a more complex gas composition becomes important, 
we apply (11.109) to each different molecule and atom of the residual gas and we 
replace the relevant factor P Z* by a summation over all gas components. If P; is 
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Fig. 11.7 Beam lifetime in an electron storage ring as a function of the acceptance. The transition 
of the curve on the right from a linear dependence of beam lifetime on the acceptance to a constant 
life time occurs when the acceptance due to the scraper position is equal to the ring acceptance 
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the partial pressure of the molecules i and Z; the atomic number of the atom j in the 
molecule i we replace in (11.109) 


PE > oP Z (11.118) 
ij 
and sum over all atoms i in the molecule j. 


Inelastic Scattering 


Charged particles passing through matter become deflected by strong electrical 
fields from the atomic nuclei. This deflection constitutes an acceleration and 
the charged particles lose energy through emission of radiation which is called 
bremsstrahlung. If this energy loss is too large such that the particle energy error 
becomes larger than the storage ring energy acceptance the particle gets lost. We 
are therefore interested to calculate the probability for such large energy losses to 
estimate the beam lifetime. 

The probability to suffer a relative energy loss 6 =dE/Ep due to such an inelastic 
scattering process has been derived by Bethe and Heitler [16, 17]. For extreme 
relativistic particles and full screening this probability per unit thickness of matter 
is [17] 


_ dé 2-25+6 2 1832 
dP = 26n= (1- ac) O1 Ll 
ns ( 0|( i=! 5) nats seers 


where n is the number of atoms per unit volume and the factor ¢ is with the fine 
structure constant a = 1/137 


= P72. (11.120) 


We integrate this probability over all energy losses larger than the energy acceptance 
of the storage ring 6 > 6,¢¢ and get after some manipulation and setting dace K 1 


1 
= 2 
2G n Si -8) I(- cca =) 2in + 3 (11.121) 
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The radiation length L, is defined as the distance over which the particle energy 
has dropped to 1/e due to inelastic scattering. For highly relativistic particles this 
length is given by [17] 


Z =@n(41 ee ae 11.122 
—_ nN nat 5 < ( ‘ ) 


Combining (11.121) and (11.122), we find the simple solution that the probability 
for a particle to suffer a relative energy loss of more than dacc per radiation length is 


4 
Prad = — 3 In Succ - (11,123) 


To calculate the beam lifetime or beam decay rate due to bremsstrahlung we note 
that the probability for a particle loss per unit time is equal to the beam decay rate or 
equal to the inverse of the beam lifetime. The bremsstrahlung lifetime is therefore 


1 dN 4 
th = ce eee ae (11.124) 
No dt Ty. B15 


The radiation length for gases are usually expressed for a standard temperature 
of 20°C and a pressure of 760 Torr. Under vacuum conditions the radiation length 
of the residual gas is therefore increased by the factor 760/Pton. We recognize 
again the complex composition of the residual gas and define an effective radiation 
length by 


ay 24. (11.125) 


where L,; is the radiation length for gas molecules of type i. The beam lifetime due 
to bremsstrahlung for a composite residual gas is from (11.124), (11.125) 


Ths = Ole In dace 5 (11.126) 


where P; is the residual partial gas pressure for gas molecules of type 7. Although 
the residual gas of ultra high vacuum systems rarely includes a significant amount 
of nitrogen gas, the average value for (Z*) of the residual gas components is 
approximately 50 or equivalent to nitrogen gas. For all practical purposes we may 
therefore assume the residual gas to be nitrogen with a radiation length under normal 
conditions of L;n, = 290 m and scaling to the actual vacuum pressure Pyac we get 
for the beam lifetime 


1 
T._ (hours~') = 0.00653 Pyac(nTorr) In : (11.127) 


acc 
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Basically the bremsstrahlung lifetime depends only on the vacuum pressure and 
the energy acceptance and the product of beam lifetime and vacuum pressure is a 
function of the energy acceptance bac. = Ay/y, 


153.14 


Tps(hour) P(nTorr) = in(yfAy) ; 


(11.128) 


In tabular form we get: 


dace = Ay/y 0.005 0.010 0.015 0.020 0.025 
t(hr) P(nTorr) 28.90 33.25 36.46 39.15 41.51 


There are many more forms of interaction possible between energetic particles 
and residual gas atoms. Chemical, atomic, and nuclear reactions leading to the 
formation of new molecules like ozone, ionization of atoms or radioactive products 
contribute further to energy loss of the beam particles and eventual loss from the 
beam. These effects, however, are very small compared to Coulomb scattering or 
bremsstrahlung losses and may therefore be neglected in the estimation of beam 
lifetime. 


11.8.2. Ultra High Vacuum System 


Accelerated particles interact strongly with residual gas atoms and molecules by 
elastic and inelastic collisions. To minimize particle loss due to such collisions 
we provide an evacuated beam pipe along the desired beam path. For open beam 
transport systems high vacuum of 10~°-10~7 Torr is sufficient. This is even 
sufficient for pulsed circular accelerators like synchrotrons, where the particles 
remain only for a short time. In storage rings, however, particles are expected to 
circulate for hours and therefore ultra high vacuum conditions must be created. 


Thermal Gas Desorption 


To reach very low gas pressures in the region of 10~!°-10~!! Torr in the regime 
of ultra high vacuum (UHV) we must consider the continuous desorption of gas 
molecules from the walls due to thermal desorption. Gas molecules adsorbed on the 
chamber surface are in thermal equilibrium with the environment and the thermal 
energy of the molecules assumes a statistically determined Boltzmann distribution. 
This distribution includes a finite probability for molecules to gain a large enough 
amount of energy to overcome the adsorption energy and be released from the wall. 
The total gas flow from the wall due to this thermal gas desorption depends 
mostly on the preparation of the material. While for carefully cleaned surfaces the 
thermal desorption coefficient may be of the order of 10~!7-10~ Torr It/sec/em? a 
bakeout to 140-300°C can reduce this coefficient by another order of magnitude. 
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Synchrotron Radiation Induced Desorption 


In high-energy electron or positron accelerators a significant amount of energy is 
emitted in form of synchrotron radiation. This radiation is absorbed by vacuum 
chamber walls and causes not only a heating effect of the chamber walls but also the 
desorption of gas molecules adsorbed on the surface. 

The physical process of photon induced gas desorption evolves in two steps [18]. 
First a photon hitting the chamber walls causes a secondary electron emission with 
the probability 7.(¢), where ¢ is the photon energy. Secondly, the emission as well 
as the subsequent absorption of that photoelectron can desorb neutral atoms from 
the chamber surface with the probability ng. To calculate the total desorption in a 
storage ring, we start from the differential synchrotron radiation photon flux (24.56) 
which we integrate over the ring circumference and write now in the form 


dN(e) 82a I, Aw 
dt 9 “eo 


S(g) (11.129) 


where € = hw is the photon energy, J, the beam current, EF the beam energy and 
S(¢) a mathematical function defined by (24.57). 

The photoelectron current Nz results from the folding of (11.129) with the 
photoelectron emission coefficient 7.(@) for the material used to construct the 
vacuum chamber and the integration over all photon energies, 


lee) 
= 8a et [ Es (=) ani, (11.130) 
0 


9 e mc? We 


The photoelectron emission coefficient depends on the choice of the material 
for the vacuum chamber. Figure 11.8 displays the photoelectron coefficient for 
aluminum as a function of photon energy [19]. 
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We note there are virtually no photoelectrons for photon energies of less 
than 10eV. At 1,460eV the K-edge of aluminum causes a sharp increase of the 
coefficient followed by a monotonous decrease for higher photon energies. 

The photoelectron coefficient depends not only on the material of the photon 
absorber but also on the incident angle. The probability to release an electron from 
the surface is increased for shallow incidence of the photon. The enhancement factor 
F(@) represents the increase in the photoelectron-emission coefficient 7.(€) due to 
anon normal incidence of a photon on the surface, where @ is the angle between the 
photon trajectory and the plane to the absorbing surface. For angles close to normal 
incidence (© = 90°) the enhancement factor scales like the inverse of the sine of 
the angle 


F(O)= (11.131) 


sin ~ 

For small angles, however, the enhancement factor falls off from the inverse sine 
dependence as has been determined by measurements [20] and reaches a maximum 
value of about seven for small angles. The gas production is determined by the 
desorption rate Q, defined as the total number of neutral atoms released along the 
circumference from the chamber surface, 


22.4 x 760 eas 
6x10s ° "4 


where Q is expressed in Torr It/sec and nq is the desorption coefficient. The factor 2 
is due to the fact that a photo electron can desorb an atom while leaving or arriving 
at a surface. With (11.132) we get the average vacuum pressure (P) from 


Q=2 


(P) = 3° (11.133) 
where S is the total installed pumping speed in the storage ring. For a reasonably 
accurate estimate of the photon flux we may use the small argument approxima- 
tion (24.60) for photon energies ¢ < &,. Photons of higher energies generally do 
not contribute significantly to the desorption since there are only few. To obtain the 
photon flux we therefore need to integrate only from 10 eV to € ~ e, the differential 
photon flux (24.60) folded with the photoelectron-emission coefficient ne(e). 

The desorption coefficient nq is largely determined by the treatment of the 
vacuum chamber like baking, beam cleaning, argon discharge cleaning, etc. For 
example in the aluminum chamber of the storage ring SPEAR [21] the desorption 
coefficient at 1.5 GeV was initially about ny ~ 5 x 1077 then 5 x 1074 after 1 
month of operation, 10~+ after 2 months of operation and reached about 3 x 10~° 
after about 1 year of operation. These numbers are not to be viewed too generally, 
since the cleaning process depends strongly on the particular preparation of the 
surfaces. However, following well established cleaning procedures and handling of 
ultra high vacuum components these numbers can be of general guidance consistent 
with observations on other storage rings. 
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Laboratory measurements [19] show the following relationship between photo- 
electron current Jphe = e Ne, desorption coefficient 74 and total integrated beam time 
of a vacuum system 


Tye (A) 
t(hr)-63 7 


Na = 7x 10> (11.134) 


New vacuum chambers release much gas when the first synchrotron radiation strikes 
the surface, but cleans quickly as the radiation cleaning continues. 


Problems 


11.1 (S). The Rf-frequency of a storage ring is 500 MHz. Every bucket is filled 
with particles. What is the time difference between successive bunches? 


11.2 (S). Calculate the synchrotron damping time for a 3 GeV storage ring with a 
bending radius of p = 10m and pure rectangular dipole magnets. Assume 100 % 
bending magnet fill factor. What is the synchrotron damping time in this ring? How 
long does it take to radiate away all its energy? 


11.3. Consider a circular electron storage ring of your choice and specify beam 
energy, current, ring circumference and average vacuum chamber dimensions. 
Calculate the total thermal gas desorption and the total required pumping capacity 
in the ring. Now add synchrotron radiation and estimate the increase of pumping 
speed needed after say 100 Ah of beam operation. Plot the average gas pressure as 
a function of integrated beam time. 


11.4. An electron beam circulating in a 1.5 GeV storage ring emits synchrotron 
radiation. The rms emission angle of photons is 1/y about the forward direction 
of the particle trajectory. Determine the photon phase space distribution at the 
source point and at a distance of 10m away while ignoring the finite particle beam 
emittance. Now assume a Gaussian particle distribution with a horizontal beam 
emittance of €, = 1.5 x 10~’ radm. Fold both the photon and particle distributions 
and determine the photon phase space distribution 10m away from the source 
point if the electron beam size is o, = 1.225mm, the electron beam divergence 
0, = 0.1225 mrad and the source point is a symmetry point of the storage ring. 
Assume the dispersion function to vanish at the source point. For what minimum 
photon wavelength would the vertical electron beam size appear diffraction limited 
if the emittance coupling is 10 %? 


11.5. Consider an electron beam in an isomagnetic 6GeV storage ring with a 
bending radius of p = 20m . Calculate the rms energy spread o,/Eo and the 
synchrotron oscillation damping time T;. 
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Chapter 12 
Vlasov and Fokker—Planck Equations* 


Mathematical tools have been derived in previous chapters to describe the dynamics 
of singly charged particles in electromagnetic fields. While the knowledge of 
single-particle dynamics is essential for the development of particle beam transport 
systems, we are still missing a formal treatment of the behavior of multiparticle 
beams. In principle a multiparticle beam can be described simply by calculating the 
trajectories of every single particle within this beam, a procedure that is obviously 
too inefficient to be useful for the description of any real beam involving a very large 
number of particles. 

In this paragraph we will derive concepts to describe the collective dynamics of a 
beam composed of a large number of particles and its evolution along a transport line 
utilizing statistical methods that lead to well defined descriptions of the total beam 
parameters. Mathematical problems arise only when we have a particle beam with 
neither few particles nor very many particles. Numerical methods must be employed 
if the number of particles are of importance and where statistical methods would 
lead to incorrect results. 

The evolution of a particle beam has been derived based on Liouville’s theorem 
assuring the constancy of the particle density in phase space. However, this concept 
has not allowed us to determine modifications of particle distributions due to 
external forces. Particle distributions are greatly determined by particle source 
parameters, quantum effects due to synchrotron radiation, nonlinear magnetic fields, 
collisions with other particles in the same beam, with particles in another beam 
or with atoms of the residual gases in the beam environment to name only a few 
phenomena that could influence that distribution. In this chapter, we will derive 
mathematical methods that allow the determination of particle distributions under 
the influence of various external electromagnetic forces. 
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12.1 The Vlasov Equation 


To study the development of a particle beam along a transport line, we will 
concentrate on the evolution of a particle density distribution function W(r, p,t) in 
six-dimensional phase space where every particle is represented by a single point. 
We consider a volume element of phase space that is small enough that we may 
assume the particle density to be constant throughout that element and determine its 
evolution in time. In doing so, we will further assume a large, statistically significant 
number of particles in each volume element and only a slow variation of the particle 
density from one volume element to any adjacent volume element. To simplify 
the equations, we restrict the following discussion to two-dimensional phase space 
(w, pw) and use exclusively normalized coordinates. The derivation is exactly the 
same for other coordinates. 

The dynamics of a collection of particles can be studied by observing the 
evolution of their phase space. Specifically, we may choose a particular phase space 
element and follow it along its path taking into account the forces acting on it. To 
do this, we select a phase space element in form of a rectangular box defined by the 
four corner points P; in Fig. 12.1. 

At the time ¢ these corners have the coordinates 


Py (wv, Pw) J 

Po(w + Aw, pw), (12.1) 

P3(w+ Aw, py + Apw), 

P4(w, Pw + Apw) : 
A short time At later, this rectangular box will have moved and may be deformed 
into a new form of a quadrilateral (Qi, Q2, Q3,Q4) as shown in Fig. 12.1. In 
determining the volume of the new box at time ¢ + At we will assume the 


conservation of particles allowing no particles to be generated or getting lost. To 
keep the derivation general the rate of change in the conjugate variables is defined by 


w = fol, Pw; t) > 
Pw = gwlw, Pw; t) ; 


(129) 


Fig. 12.1 Two-dimensional 
motion of a rectangle in phase 
space 
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where w = dw/dt and p,, = dp,,/dt and the time interval Af is small enough 
to allow linear expansion of the particle motion. In other words, the time interval 
shall be chosen such that no physical parameters of the dynamical system change 
significantly and a Taylor’s expansion up to linear terms can be applied. The new 
corners of the volume element are then given by 


Qi [w+ fulW, Pw. t) At, Pw + 8w(W, Pw, t) Ad] , 

Qo [w+ Aw + fu(w + Aw, py, t) At, 
Pw + 8wlw + Aw, py, t) At] , 

Q3[w + Aw + fu(w + Aw, py + Apw, 2) At, (12.3) 
Pw t+ Apwt gw(w+ Aw, pw + Apy,t) Ad] , 

Os [w + fw (W, Pw + Apw,t) At, 
Pw+ Apw + 8w(W, Pw + Ap, t) At] . 


The goal of our discussion is now to derive an expression for the particle density 
W(w, pw, t) after a time At. Because of the conservation of particles we have 


W(w +h At, Pw an 8w At, t=F At) AAg = Ww, Pws t) AAp > (12.4) 
where AAp and AAg are the areas in phase space as defined by the corner points 
P; and Q;, respectively. From Fig. 12.1 and (12.1) we derive an expression for the 
phase space areas which are at the starting time rf 


AAp = Aw Apy (12.5) 


and at the time t + Ar from (12.3) 


AAg = Aw Apy | 1+ (+: + “i ) At} , (12.6) 
dws OD 


where Taylor’s expansions have been used for the functions f,, and g,, retaining 
only linear terms. To prove (12.6) we note that the area AAp has the form of a 
quadrilateral with its sides determined by two vectors and the area, therefore, is 
equal to the determinant formed by these two vectors. In our case these vectors are 
P,, = (Aw, 0) pointing from P; to Pz and p, = (0, Ap») pointing from P; to Py. 
The area therefore is 


Aw 0 


= Aw Ap, = AA 12.7 
0 Ap, w Ap P (12.7) 


Pw»Ppl = | 


in agreement with (12.5). A time interval At later these vectors will have changed as 
determined by (12.2). Each of the corner points P; is moving although with different 
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speed thus distorting the rectangle P; into the shape Q; of Fig. 12.1. To calculate 
the new vectors defining the distorted area we expand the functions f, and gy in 
a Taylor’s series at the point (w, p,,). While, for example, the w-component of the 
movement of point P; along the w coordinate is given by f,, At the same component 
for Pz changes by f,, At+ Tho Aw At. The w-component of the vector q,, = Q1 —Q2 
therefore becomes Aw + She Aw At. Similarly, we can calculate the p-component 
of this vector as well as both components for the vector g, = Q; — Q4. The phase 
space area of the distorted rectangle (Q;, Q2, Q3, Q4) at time t+ At with these vector 
components is then given by 


Aw+ % Aw At  #* Ap, At 


OPw 
Sw 
ae Apy At 


| = AAg. 12.8 
de Aw At  Apy + 4 . ve) 


lQw> Ql = 


Dropping second-order terms in At we get indeed the expression (12.6). Obviously, 
the phase space volume does not change if 


Of Ow 
=0 12.9 
dw 7 OPw ( ) 


in agreement with the result obtained in Chap. 8, where we have assumed that the 
Lorentz force is the only force acting on the particle. In this paragraph, however, 
we have made no such restrictions and it is this generality that allows us to 
derive, at least in principle, the particle distribution under the influence of any 
forces. Equation (12.9) tells us that there is no damping if the velocity w = fi, 


is independent of the position and the forces p = g, are independent of the 
momentum. 
The factor 
0 Ww ) Ww 
4 (Se 4 28") ay (12.10) 
dw OP 


in (12.6) is the general Wronskian of the transformation and is not necessarily equal 
to unity. We have such an example in the form of adiabatic damping. Indeed we 
have damping or anti-damping whenever the Wronskian is different from unity. 

To illustrate this, we use the example of a damped harmonic oscillator, which is 
described by the second-order differential equation wv + 2a,,% + ww = 0, or in 
form of a set of two linear differential equations 


W = WoPw = fw, Pw. ), 


: (12.11) 
Pw = —WoWw — Cy Pw = Sw, Pws t). 
From this we find indeed the relation 
fy O08 
iv 5 Bw oy (12.12) 


ow Pw 
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where @,, is the damping decrement! of the oscillator. We have obtained on a general 
basis that the phase space density for harmonic oscillators will vary only if damping 
forces are present. Here we use the term damping in a very general way including 
excitation depending on the sign of the damping decrement a,,. The designation 
a,, for the damping decrement may potentially lead to some confusion with the 
same use for the betatron function a = —4 ’. However, we choose here to rather 
require some care than introduce against common use new designations for the 
damping decrement or the betatron functions. We also note that for all cases where 
the damping time is long compared to the oscillation time, and we consider here 
only such cases, the damping occurs for both conjugate trajectories. 

The derivation in two-dimensional phase space can easily be generalized to six- 
dimensional phase space with the generalized volume element 


AVp = ArAp (12.13) 
at time ¢ and a time interval Af later 
AVo = ArAp[l + V.f At+ Vpg At]. (12.14) 


The Nabla operators are defined by 


vu (2 2 2) wa vaf(h 22 — 
" \ dw’ dv’ du sa PN apy’ dpy” dp, J? : 


where (w, v, u) are normalized variables and the vector functions f and g are defined 
by the components f = (fv, fv.fu) and g = (gw, 8v, Bu)- 

Equation (12.4) can now be reduced further after applying a Taylor’s expansion 
to the density function YW. With (12.5), (12.6) and keeping only linear terms 


fe eg | Ot ap (12.16) 
a aw ny aw apy)” 


It is straightforward to generalize this result again to six-dimensional phase space 


aw 
ar td Vey So (V.f+Vpg) VY, (12.17) 


which is called the Vlasov equation. If there is no damping the r.h.s. of the Vlasov 
equation vanishes and we have 


aw 
a +E VY +eV,¥ =0. (12.18) 


'The letter a, is used here for the damping decrement. Since in beam dynamics @, is also used 
to identify a lattice function, a mixup of the quantities could occur. We have chosen not to use 
a different nomenclature, however, since this choice is too deeply entrenched in the community. 
With some care, confusion can be avoided. 
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This is simply the total time derivative of the phase space density W telling us 
that in the absence of damping it remains a constant of motion. The preservation of 
the phase space density is Liouville’s theorem and we have demonstrated in this 
paragraph the validity of this theorem for a Hamiltonian system with vanishing 
dissipating forces (Vf + Vpg) = 0. 

Equation (12.18) describes the evolution of a multiparticle system in phase space 
where the physics of the particular particle dynamics is introduced through the 
functions f (r, p, t) and g(r, p, t). The definition of these functions in (12.2) appears 
similar to that for the Hamiltonian equations of motion. In case r and p are canonical 
variables we may indeed derive these functions from the Hamiltonian 


ravi =7, 
(12.19) 
Pp = —V_-H =8, 


where H is the Hamiltonian of the system. We are therefore, at least in principle, 
able to solve the evolution of a multiparticle system in phase space if its Hamiltonian 
is known. It should be emphasized, however, that the variables (w, p) need not be 
canonical to be used in the Vlasov equation. 

It is interesting to apply the Vlasov equation to simple one-dimensional harmonic 
oscillators with vanishing perturbation. Introducing the canonical variable p through 
w = vp, the Hamiltonian becomes Ho = up" + suw and the equations of motion 
are 


ie + = pas, 
(12.20) 
p= — #Ho =-Vvw=g. 


It is customary for harmonic oscillators and similarly for particle beam dynamics 
to use the oscillation phase as the independent or “time” variable. Since we have not 
made any specific use of the real time in the derivation of the Vlasov equation, we 
choose here the phase as the “time” variable. For the simple case of an undamped 
harmonic oscillator a = 0 and it = 0 and consequently the Vlasov equation 
becomes from (12.16) with (12.20) 


ow ow ow 
+ vp —vw =0. (12.21) 
0) ow op 


In cylindrical phase space coordinates (w = rcos 6, p = rsin@, @) this reduces to 
the simple equation 


——y—=0. (12.22) 
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Fig. 12.2. Beam motion in 
phase space 


Y(w.Pws®) 


Any differentiable function with the argument (r,@ + vg) can be a solution 
of (12.22) describing the evolution of the particle density W with time 


Y(w, Pw. 9) = F(r,8 + vg), (12.23) 


Any arbitrary particle distribution in (w, p,,)-phase space merely rotates about the 
center with the frequency v and remains otherwise unchanged as shown in Fig. 12.2. 
This is just another way of saying that an ensemble of many particles behaves like 
the sum of all individual particles since any interaction between particles as well as 
damping forces have been ignored. In (x, x’)-phase space this rotation is deformed 
into a “rotation” along elliptical trajectories. The equation of motion in (w, p,)- 
phase space is solved by r = const indicating that the amplitude r is a constant of 
motion. In (x,x’)-phase space we set w = x/,/B andp = \/Bx’ + WF x and get 


2 


from r? = w* + p2 for this constant of motion 


Bx? + 2axx’ + yx = const (12.24) 


which is the Courant-Snyder invariant. The Vlasov equation allows us to generalize 
this result collectively to all particles in a beam. Any particular particle distri- 
bution a beam may have at the beginning of the beam transport line or circular 
accelerator will be preserved as long as damping or other statistical effects are 
absent. 
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12.1.1 Betatron Oscillations and Perturbations 


The Vlasov equation will prove to be a useful tool to derive particle beam 
parameters. Specifically, it allows us to study the influence of arbitrary macroscopic 
fields on particle density in phase space and on the characteristic frequency of 
particle motion. To demonstrate this, we expand the example of the harmonic 
oscillator to include also perturbation terms. For such a perturbed system the 
equation of motion is 


iv + vow = vB? Dp, Bw", (12.25) 


n>0 


where the coefficients p, are the strength parameters for the nth order perturbation 
term and the amplitude w is the normalized betatron oscillation amplitude. The 
Vlasov equation allows us to calculate the impact of these perturbation terms on 
the betatron frequency. We demonstrate this first with a linear perturbation term 
(n = 1) caused by a gradient field error p) = —6k in a quadrupole. In this case the 
equation of motion is from (12.25) 


iv + vow = —v6B*dkw (12.26) 
or 
w+ vo(1 + B76k)w = 0. (12.27) 


This second-order differential equation can be replaced by two first-order 
differential equations which is in general the most straight forward way to obtain 
the functions (12.2) 


W = vo/1+ B*dkp 
p= —voV1+ Brbkw. 


Here it is assumed that the betatron function 6 and the quadrupole field error 
6k are uniformly distributed along the beam line and therefore can be treated as 
constants. This approach is justified since we are interested only in the average 
oscillation frequency of the particles and not in fast oscillating terms. The desired 
result can be derived directly from (12.28) without any further mathematical 
manipulation by comparison with (12.20). From there the oscillating frequency for 
the perturbed system is given by 


(12.28) 


v = v9 V1 + 26k & vo (1 + 48764), (12.29) 
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for small perturbations. The betatron frequency shift can be expressed by the lowest 
order harmonic of the Fourier expansion for the periodic perturbation function 
Vo B? 5k to give 


2m vo (B*sk), = f v0p?3kap = Bak dc (12.30) 


making use of the definition for the betatron phase dg =dz/vof. The tune shift dv 
due to quadrupole field errors is therefore from (12.29) 


bv=v-Vvy= a Bokae. (12.31) 
4n 


in agreement with (15.64). Again, the Vlasov equation confirms this result for 
all particles irrespective of the distribution in phase space. This procedure can be 
expanded to any order of perturbation. From the differential equation (12.25) one 
gets in analogy to the equations of motion (12.28) 


y= vy _ B3/? Spl" Dp, 


n>0 


(12.32) 


Pp = —V te B3/2 Se pnBr/2w"! w. 
n>0 
For small perturbations the solution for the unperturbed harmonic oscillator 
w(~) = Wo Sin(vog + 6) may be used where 6 is an arbitrary phase constant. The 
tune shift Av = v — vo is thus while integrating over all perturbations around a 
circular accelerator 


1 n+1 = oye 
Av=-)) x ppb Z wo | sin”! [vop(z) + 4] dz, (12.33) 


n>0 


where we have changed the independent variable from ¢ to z by dz = vofdg. 

Not all perturbation terms contribute to a tune variation. All even terms n = 2m, 
where m is an integer, integrate, for example, to zero in this approximation and 
a sextupole field therefore does not contribute to a tune shift or tune spread. 
This conclusion must be modified, however, due to higher-order approximations 
which become necessary when perturbations cannot be considered small anymore. 
Furthermore, we find from (12.33) that the tune shift is independent of the particle 
oscillation amplitude only for quadrupole field errors n = 1. For higher-order 
multipoles the tune shift becomes amplitude dependent resulting in a tune spread 
within the particle beam rather than a coherent tune shift for all particles of the 
beam. 
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In a particular example, the tune spread caused by a single octupole (n = 3) ina 
circular accelerator is given by 


Ew 


Av3 = -= b pb? dz, (12.34) 


where Wo = €, is the emittance of the beam. Similar results can be found for higher- 
order multipoles. 


12.1.2. Damping 


At the beginning of this section we have decided to ignore damping and have 
used the undamped Vlasov equation (12.18). Damping or anti-damping effects do, 
however, occur in real systems and it is interesting to investigate if the Vlasov 
equation can be used to derive some general insight into damped systems as well. 
For a damped oscillator we use (12.11), (12.12) to form the Vlasov equation in the 
form of (12.16). Instead of the phase we now use the real time as the independent 
variable to allow the intuitive definition of the damping decrement as the relative 
decay of the oscillation amplitude with time 


ow ow ow 
— + WoPw— — (Mow + 2Qypyw)— = +20. (12.35) 
ot aw OPw 


This partial differential equation can be solved analytically in a way similar to 
the solution of the undamped harmonic oscillator by using cylindrical coordinates. 
For very weak damping we expect a solution close to (12.23) where the amplitude 
r in phase space was a constant of motion. For a damped oscillator we try to form a 
similar invariant from the solution of a damped harmonic oscillator 


w = woe cos fw), —a2t = re cosd. (12.36) 


With the conjugate component wo py, = w, we form the expression 


0 Pw + yw = : = : 
eS —woe ' sin \/@2 — a2 t = —re~" sin 6 (12.37) 


2 2 
Wp — &, 


and eliminate the phase 6 from (12.36), (12.37) keeping only terms linear in the 
damping decrement a,, to obtain the “invariant” 


Ay 
ee 2%wt yr 4 i as 27 WPw . (12.38) 
0 
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Obviously if we set a, = 0 we have the invariant of the harmonic oscillator. The 
time dependent factor due to finite damping modifies this “invariant”. However, 
for cases where the damping time is long compared to the oscillation period we 
may still consider (12.38) a quasi invariant. The phase coordinate 6 can be derived 
from (12.36), (12.37) as a function of w and p,, as may be verified by insertion 
into the differential equation (12.35). The solution for the phase space density of a 
damped oscillator is of the form 


W(w, py t) = e"'F(r, ®), (12.39) 


where F(r, ®) is any arbitrary but differentiable function of r and @ and the phase 
@ is defined by 


P= 0+ fap — a2, t = arctan Mica + 4/@5 — 021. (12.40) 
2 


— 
Wy — Of,w 


For very weak damping a, — 0 and the solution (12.39) approaches (12.23) 
where a, = 0 and vg = got as expected. Therefore even for finite damping 
a particle distribution rotates in phase space although with a somewhat reduced 
rotation frequency due to damping. The particle density WY, however, changes 
exponentially with time due to the factor e*“’. For damping a, > 0, we get an 
increase in the phase space density at the distance R from the beam center. At 
the same time the real particle oscillation amplitudes (w, p,,) are being reduced 
proportional to e~%”’ and the increase in the phase space density at R reflects 
the concentration of particles in the beam center from larger amplitudes due to 
damping. 

In conclusion we found that in systems where velocity dependent forces exist, 
we have damping (a, > 0) or anti-damping (a, < 0) of oscillation amplitudes. 
As has been discussed such forces do exist in accelerators leading to damping. 
Mostly, however, the Vlasov equation is applied to situations where particles interact 
with self or external fields that can lead to instabilities. It is the task of particle 
beam dynamics to determine the nature of such interactions and to derive the 
circumstances under which the damping coefficient a,,, if not zero, is positive for 
damping or negative leading to beam instability. 


12.2. Damping of Oscillations in Electron Accelerators 


In electron accelerators we are concerned mainly with damping effects caused 
by the emission of synchrotron radiation. All six degrees of freedom for particle 
motion are damped. Damping of energy oscillations occurs simply from the fact 
that the synchrotron radiation power is energy dependent. Therefore a particle 
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with a higher energy than the reference particle radiates more and a particle 
with less energy radiates less. The overall effect is that the energy deviation is 
reduced or damped. Damping of the transverse motion is principally a geometric 
effect. The photons of synchrotron radiation are emitted into the direction of 
the particle motion. Therefore part of the energy loss is correlated to a loss in 
transverse momentum. On the other hand, the lost energy is restored through 
accelerating fields with longitudinal components only. The overall effect of an 
energy loss during the course of betatron oscillations is therefore a loss of transverse 
momentum which leads to a reduction in the transverse oscillation amplitude, an 
effect we call damping. In the next section, we will discuss the physics leading 
to damping and derive the appropriate damping decrement for different modes of 
oscillations. 


12.2.1 Damping of Synchrotron Oscillations 


In a real beam particles are spread over a finite distribution of energies close to the 
reference energy. The magnitude of this energy spread is an important parameter 
to be considered for both beam transport systems as well as for experimental 
applications of particle beams. In general, an energy spread as small as possible 
is desired to minimize chromatic aberrations and for improved accuracy of experi- 
mental observation. We will therefore derive the parametric dependence of damping 
and discuss methods to reduce the energy spread within a particle beam. 

To do this, we consider a beam of electrons being injected with an arbitrary 
energy distribution into a storage ring ignoring incidental beam losses during the 
injection process due to a finite energy acceptance. Particles in a storage ring 
undergo synchrotron oscillations which are oscillations about the ideal momentum 
and the ideal longitudinal position. Since energy and time or equivalently energy 
and longitudinal position are conjugate phase space variables, we will investigate 
both the evolution of the energy spread as well as the longitudinal distribution or 
bunch length of the particle beam. 

The evolution of energy spread or bunch length of the particle beam will depend 
very much on the nature of particles and their energy. For heavy particles like 
protons or ions there is no synchrotron radiation damping and therefore the phase 
space for such beams remains constant. As a consequence, the energy spread or 
bunch length also stays a constant. A similar situation occurs for electrons or 
positrons at very low energies since synchrotron radiation is negligible. Highly 
relativistic electrons, however, produce intense synchrotron radiation leading to a 
strong damping effect. 

The damping decrement q,, is defined in the Vlasov equation by 


af . ag 


Sf ss 12.41 
ow ~ dp . ( ) 
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Fig. 12.3 Longitudinal reference par- 


particle position ticle 


L—1t>0 — 


particle bunch 


and can be calculated with the knowledge of the functions f and g. For the 
conjugate variables (w, p,,) we use the time deviation of a particle with respect to the 
synchronous particle w = t as shown in Fig. 12.3 and the difference of the particle’s 


energy E from the synchronous or reference energy Ep and set p,, = € = E — Ep. 
Since f = a =tandg= de = € we have to determine the rate of change for 


the conjugate variables. The rate of change of t is from (9.17) with cpp ~ Eo 


dt € 
— =-7,h—, (12.42) 
dt Eo 

where we have replaced the phase by the time Vv = cfhkot and the relative 


momentum error by the relative energy error since we consider here only highly 
relativistic particles. The latter replacement is a matter of convenience since we will 
be using the energy gain in accelerating fields. 

The energy rate of change € is the balance of the energy gained in accelerating 
fields and the energy lost due to synchrotron radiation or other losses 


es 2 (eVe(z, +2) Ue, +O). (12.43) 


Here T is the time it takes the particles to travel the distance L. The energy 
gain within the distance L for a particle traveling a time t behind the reference 
or synchronous particle is eV,s(ts + 7) and U is the energy loss to synchrotron 
radiation along the same distance of travel. here we assume the energy gain or loss 
to be distributed evenly over the length of L. 

Before we go on, we apply these expressions to the simple situation of a linear 
accelerator of length L where the momentum compaction factor vanishes a, = 0 
and where there is no energy loss due to synchrotron radiation U = 0. Furthermore, 
we ignore for now other energy losses and have with n, = 1/y? 


= 8 a1 
f=Ht= p2 2? 
(12.44) 
g=eé= reVet (ts + T). 


Inserted into (12.41) we find the damping decrement to vanish which is consistent 
with the constancy of phase space. From the Vlasov equation we learn that in the 
absence of damping the energy spread € stays constant as the particle beam gets 
accelerated. 
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The Vlasov equation still can be used to also describe adiabatic damping but we 
need to use the relative energy spread as one of the variables. Instead of the second 
equation (12.44) we have then with 6 = 5 


d 
=) = ; 12.4 
. i At ( >) 


where Eo and E, are the energies time fo and t = fo+dr, respectively. We choose 
the time interval dt small enough so that the energy increase dE = adt «< Ep and 
get 


g=-—e. (12.46) 


OS 2 Og ys (12.47) 
— = —-— = —-2y = = — a 
06 Eo 6 dt 
and after integration 
a ae Si SF a (12.48) 
3 etait! — =— —_ = n— 5 
6 50 Eo Eo E; 
or 
6 Ep 
>=>- 12.49 
i e ( ) 


The relative energy spread in the beam is reduced during acceleration inversely 
proportional to the energy. The reduction of the relative energy spread is called 
adiabatic damping. This name is unfortunate in the sense that it does not 
actually describe a damping effect in phase space as we just found out but 
rather describes the variation of the relative energy spread with energy which 
is merely a consequence of the constant phase space density or Liouville’s 
theorem. 

Returning to the general case (12.43) we apply a Taylor’s expansion to the rf- 
voltage in (12.44) and get for terms on the r.h.s. keeping only linear terms 


e Vit(ts + T) = eV e(t,) + € ave v., (12.50) 
—U(E, + €) = —U(E,)— oy €. (12.51) 
OE E, 
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Since the energy gain from the rf-field eV,«(t,) for the synchronous particle just 
compensates its energy loss U(E;), we have instead of (12.43) now 


fa Viliem © 
Tg ae 


( (12.52) 
E; 


where we have set V,- = ane The synchrotron oscillation damping decrement can 


now be derived from (12.41) with (12.44), (12.52) to give 


Q 


U 


a= +4 (12.53) 


NI 
Q 


We will now derive the damping decrement for the case that the energy loss 
is only due to synchrotron radiation. The energy loss along the transport line L is 
given by 


1 L 
Us = -f P,ds, (12.54) 
c Jo 
where P,, is the synchrotron radiation power and the integration is fake lee the 
actual anaiele oie aaa s. If o(z) is the bending radius along z, we have $ =1 bes =. 
With x = xg + NE and averaging over many betatron oscillations, we ee (xp) = ‘0 
and 
ds ne 


es oe 12: 
ele, (12.55) 


This asymmetric averaging of the betatron oscillation only is permissible if 
the synchrotron oscillation frequency is much lower than the betatron oscillation 
frequency as is the case in circular accelerators. With ds = [1 + (y/p)(€/E,)|dz 
in (12.54), the energy loss for a particle of energy E, + € is 


Us(Es + €) = -[? (1 + 1) dz (12.56) 
L s 


or after differentiation with respect to the energy 


1 dP, n 1 
= Py dz. (12.57) 
ER ¢ dE PEs Jr, 
; L 


The synchrotron radiation power is proportional to the square of the energy and 
the magnetic field P, ~ E2Bj which we use in the expansion 


aU, 
OE 


dP, dP, aP,dB Py, Py aB ax 
= ay ae 12.58 
dE 0E | OBE E, | B OxdE Mee) 
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The variation of the synchrotron radiation power with energy depends directly 
on the energy but also on the magnetic field if there is a field gradient 9B and a finite 


dispersion function 7 = E,%. The magnetic field as well as the field ‘gradient is to 


JE" 
be taken at the reference orbit. Collecting all these terms and setting Be = pkwe 
get for (12.57) 
aU; 1 
= [Cz ey) = p+ 7.2) dz (12.59) 
OE |p ¢ P/IE, 
‘ : 
= * f pn(++20k)| 4 
7 E, cU, vt p - Es : 
L 
where we have made use of U,; = Ff 1 Py (Es) dz. Recalling the expressions for 
the synchrotron radiation power and energy loss Py = Cy, Et/ p’ and U, = 
CE; jf dz/p”, we may simplify (12.59) for 
dU Us 
—| =—(24+?%), 12.60 
3E |, E, | +0) ( ) 


where the -parameter has been introduced in (11.25). We finally get from (12.53) 
with (12.60) the damping decrement for synchrotron oscillations 


OF (P,) 
ee Ly ae 12.61 
OTE, = OTe, <= “on, 7 wee) 


Ae = 


in full agreement with results obtained earlier. Since all parameters except 3 are 
positive we have shown that the synchrotron oscillations for radiating particles are 
damped. A potential situation for anti-damping can be created if 3 < —2. 

To calculate the damping decrement, we assume accelerating fields evenly 
distributed around the ring to restore the lost energy. In practice this is not true 
since only few rf-cavities in a ring are located at one or more places around the ring. 
As long as the revolution time around the ring is small compared to the damping 
time, however, we need not consider the exact location of the accelerating cavities 
and may assume an even and uniform distribution around the ring. 


12.2.2. Damping of Vertical Betatron Oscillations 


Particles orbiting in a circular accelerator undergo transverse betatron oscillations. 
These oscillations are damped in electron rings due to the emission of synchrotron 
radiation. First we will derive the damping decrement for the vertical betatron oscil- 
lation. In a plane accelerator with negligible coupling this motion is independent 
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of other oscillations. This is not the case for the horizontal betatron motion which 
is coupled to the synchrotron oscillation due to the presence of a finite dispersion 
function. We will therefore derive the vertical damping decrement first and then 
discuss a very general theorem applicable for the damping in circular accelerators. 
This theorem together with the damping decrement for the synchrotron and vertical 
betatron oscillations will enable us to derive the horizontal damping in a much 
simpler way than would be possible in a more direct way. 
In normalized coordinates the functions f and g are for the vertical plane 


io t+vp =f(v,p.¢), (12.62) 
p 
d 
“P = vw = g(w,p,¢9), (12.63) 
de 
where v = vy,w = y//By, ra = /Byy' - Tp and vy~y = Wy is the vertical 


betatron phase. 

Due to the emission of a synchrotron radiation photon alone the particle does not 
change its position y nor its direction of propagation y’. With this we derive now 
the damping within a path element Az which includes the emission of photons as 
well as the appropriate acceleration to compensate for that energy loss. Just after the 
emission of the photon but before the particle interacts with accelerating fields let 
the transverse momentum and total energy be p, and E;,respectively. The slope of 
the particle trajectory is therefore (Fig. 12.4) 

CPL 


= ‘ 12.64 
Y= Fe (12.64) 


Energy is transferred from the accelerating cavity to the particle at the rate of the 
synchrotron radiation power P,, and the particle energy increases in the cavity of 
length Az from E, to E, + P, oe and the slope of the particle trajectory becomes at 
the exit of the cavity of length Az due to this acceleration 


P, Az 
y, za Ve (1- a - ). (12.65) 


BE, +P,“ BE 


Fig. 12.4 Acceleration and 
damping 
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We are now in a position to express the functions f and g in terms of physical 
parameters. The function f is expressed by 


A - oA 
poe ee 2 = v4/B,%, (12.66) 


Ag JByAg  /B, AY 


where we made use of Ag = Az/(vf). The damping decrement will depend on 
the derivation aw which can be seen from (12.66) to vanish since f does not depend 
on w 


Of 


12.67 
em ( ) 
The variation of the conjugate variable p with phase is from (12.62) 
dw} dwo 
A — —_ 
or (12.68) 
Ag v Ag 


From linear beam dynamics, we find 


dw, dwo 


Ten ae = VBI =) 5 FEO) (12.69) 


and get with (12.65), (12.66) 


Ap — ~VByx 0 + FO) 
BWP, 9) = F = . 


12.70 
cae (12.70) 


The function F(y) is a collection of y-dependent terms that become irrelevant for 
our goal. Damping will be determined by the value of the derivative = which with 
Ve 1 dw 


"Ew 


+ 5B)3 zY0 becomes 


a P, A 
ge 2 (12.71) 
op ase BcE, Ag 


In the derivation of (12.71) we have used the betatron phase as the “time” and 
get tor the damping per unit betatron phase advance. Transforming to the real 


time with 25> = 3 and (12.41) 


= 2a, (12:72) 
° IU 
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and solving for the vertical damping decrement 


(12.73) 


In this last equation, we have used the average synchrotron radiation power which 
is the appropriate quantity in case of a non-isomagnetic ring. The damping of the 
vertical betatron function is proportional to the synchrotron radiation power. This 
fact can be used to increase damping when so desired by increasing the synchrotron 
radiation power from special magnets in the lattice structure. 


12.2.3 Robinson’s Damping Criterion 


The general motion of charged particles extends over all six degrees of freedom 
in phase space and therefore the particle motion is described in six-dimensional 
phase space as indicated in the general Vlasov equation (12.17). It is, however, 
a fortunate circumstance that it is technically possible to construct accelerator 
components in such a fashion that there is only little or no coupling between 
different pairs of conjugate coordinates. As a consequence, we can generally treat 
horizontal betatron oscillations separate from the vertical betatron oscillations and 
both of them separate from synchrotron oscillations. Coupling effects that do occur 
will be treated as perturbations. There is some direct coupling via the dispersion 
function between synchrotron and particularly the horizontal betatron oscillations 
but the frequencies are very different with the synchrotron oscillation frequency 
being in general much smaller than the betatron oscillation frequency. Therefore in 
most cases the synchrotron oscillation can be ignored while discussing transverse 
oscillations and we may average over many betatron oscillations when we discuss 
synchrotron motion. 

A special property of particle motion in six-dimensional phase space must be 
introduced allowing us to make general statements about the overall damping effects 
in a particle beam. We start from the Vlasov equation (12.17) 


ow 
a Th VW + eVph =— (Vif + Vpg)¥ (12.74) 


and define a total damping decrement a by setting 
VS + Veg = —20 . (12.75) 


The total damping decrement is related to the individual damping decrements of 
the transverse and longitudinal oscillations but the precise dependencies are not yet 
obvious. In the derivation of (12.17), we have expanded the functions f and g in 
a Taylor series neglecting all terms of second or higher order in time and got as a 
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result the simple expression (12.75) for the overall damping. Upon writing (12.75) in 
component form, we find from the components of the I.h.s. that the overall damping 
decrement a is just the sum of all three individual damping decrements and we may 
therefore set 


Vif + Vpg = —20, = —2 (aly + ty + Me). (12.76) 


From this equation and the linearity of the functions f and g describing the 
physics of the dynamical system general characteristics of the damping process can 
be derived. The damping decrement does not depend on the dynamic variables of 
the particles and coupling terms do not contribute to damping. The damping rate 
is therefore the same for all particles within a beam. In the following paragraphs, 
we will discuss in more detail the general characteristics of synchrotron radiation 
damping. Specifically, we will determine the functions f and g and derive an 
expression for the total damping. 

We consider a small section of a beam transport line or circular accelerator 
including all basic processes governing the particle dynamics. These processes are 
focusing, emission of photons and acceleration. All three processes are assumed 
to occur evenly along the beam line. The six-dimensional phase space to be 
considered is 


(x,x’,y,y,T,€). (12.77) 


During the short time At some of the transverse coordinates change and it is 
those changes that determine eventually the damping rate. Neither the emission 
of a synchrotron radiation photon nor the absorption of energy in the accelerating 
cavities causes any change in the particle positions x, y, and t. Indicating the initial 
coordinates by the index 9 and setting BcAt = Az we get for the evolution of the 
particle positions within the length element Az in the three space dimensions 


X= x9 + x Az, 


y=yoty Az, (12.78) 
T=Tot+ €9 Az 
~ OT TE Be’ 


The conjugate coordinates vary in a somewhat more complicated way. First 
we note that the Vlasov equation does not require the conjugate coordinates to 
be canonical variables. Indeed this derivation will become simplified if we do 
not use canonical variables but use the slopes of the particle trajectories with the 
reference path and the energy deviation. The change of the slopes due to focusing 
is proportional to the oscillation amplitude and vanishes on average. Emission of a 
synchrotron radiation photon occurs typically within an angle of £1/y causing a 
small transverse kick to the particle trajectory. In general, however, this transverse 
kick will be very small and we may assume for all practical purposes the slope 
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of the transverse trajectory not to be altered by photon emission. Forces parallel 
to the direction of propagation of the particles can be created, however, through 
the emission of synchrotron radiation photons. In this case, the energy or energy 
deviation of the particle will be changed like 


Az Az 
€=€0-P,— + Pr. (12.79) 
Be Be 

Here we use the power P,, to describe the synchrotron radiation energy loss rate 
a particle may suffer during the time BcAt = Az. No particular assumption has 
been made about the nature of the energy loss except that during the time Af it be 
small compared to the particle energy. To compensate this energy loss the particles 
become accelerated in rf-cavities. The power P,¢ is the energy flow from the cavity 
to the particle beam, not to be confused with the total power the rf-source delivers 
to the cavity. 

The transverse slopes x’ and y’ are determined by the ratio of the transverse to 
the longitudinal momentum uw’ = p,/p, where u stands for x or y, respectively. 
During acceleration in the rf-cavity the transverse momentum does not change but 
the total kinetic energy increases from EF, to E, + Pa and the transverse slope of 
the trajectory is reduced after a distance Az to 


Py A 
fe ie eS ul; (12.80) 
cp: + Pap sé E; Bc 


Explicitly, the transverse slopes vary now like 


x =xH- Ed 
E, Be 0” 
if if Pt Az.y (12.81) 


Y = Yo FE, Be¥0° 


All ingredients are available now to formulate expressions for the functions f and g 
in component form 


t= ( Yo: nek)» 


(12.82) 

g= (—28x, — ty), —P, + Pi). 

With these expressions we evaluate (12.76) and find that V,f = 0. For the 

determination of V,g we note that the power P,- from the cavity is just equal to the 

average radiation power (P,) and the derivative of the radiation power with respect 
to the particle energy is 


= = =. (12.83) 
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Finally, we note that the rf-power P,- is equal to the average radiation power (Py) 
and get from (12.76) 


Py 
E, 


— 


ith eo so (12.84) 


The sum of all damping decrements is a constant, a result which has been 
derived first by Robinson [1] and is known as the Robinson criterion. The total 
damping depends only on the synchrotron radiation power and the particle energy 
and variations of magnetic field distribution in the ring keeping the radiation power 
constant will not affect the total damping rate but may only shift damping from one 
degree of freedom to another. 


12.2.4 Damping of Horizontal Betatron Oscillations 


With the help of the Robinson criterion, the damping decrement for the horizontal 
betatron oscillation can be derived by simple subtraction. Inserting (12.61), (12.75) 
into (12.84) and solving for the horizontal damping decrement we get 


_ (Py) 
a, = nt — 9). (12.85) 


The damping decrements derived from the Vlasov equation agree completely 
with the results obtained in Sect. 11.2 by very different means. 

No matter what type of magnet lattice we use, the total damping depends only on 
the synchrotron radiation power and the particle energy. We may, however, vary the 
distribution of the damping rates through the %-parameter to different oscillation 
modes by proper design of the focusing and bending lattice in such a way that 
one damping rate is modified in the desired way limited only by the onset of 
anti-damping in another mode. Specifically, this is done by introducing gradient 
bending magnets with a field gradient such as to produce the desired sign of the 3 
parameter. 


12.3. The Fokker—Planck Equation 


From the discussions of the previous section it became clear that the Vlasov 
equation is a useful tool to determine the evolution of a multiparticle system 
under the influence of forces depending on the physical parameters of the system 
through differentiable functions. If, however, the dynamics of a system in phase 
space depends only on its instantaneous physical parameters where the physics of 
the particle dynamics cannot be expressed by differentiable functions, the Vlasov 
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equation will not be sufficient to describe the full particle dynamics. A process 
which depends only on the state of the system at the time ¢ and not on its history is 
called a Markoff process. 

In particle beam dynamics we have frequently the appearance of such processes 
where forces are of purely statistical nature like those caused, for example, by 
the quantized emission of synchrotron radiation photons or by collisions with 
other particles within the same bunch or residual gas atoms. To describe such a 
situation we still have variations of the coordinates per unit time similar to those 
in (12.2) but we must add a term describing the statistical process and we set 
therefore 


Ww = folW.Pwit) + > & 8-4), (12.86) 
Dw = 8wlW.Pw.t) + >. 1 5(t— fi) , (12.87) 


where &; and z; are instantaneous statistical changes in the variables w and p, with 
a Statistical distribution in time ¢; and where 6(t— 1;) is the Dirac delta function. The 
probabilities P,,(€) and P,(z) for statistical occurrences with amplitudes € and z 
be normalized and centered 


f PuG)d =1, f PulE)E dE = 0, 
{P,@)d0 =1,. {Pardee =0, (12.88) 


The first equations normalize the probability amplitudes and the second equa- 
tions are true for symmetric statistical processes. The sudden change in the 
amplitude by Aw; or in momentum by Ap,,; due to one such process is given by 


Aw; = fs b(t — ti) dt = § ; (12.89a) 
Apwi os fx b(t — ti) dt = 77;. (12.89b) 


Analogous to the discussion of the evolution of phase space in the previous 
section, we will now formulate a similar evolution including statistical processes. 
At the time t+ Af, the particle density in phase space is taken to be W(w, py, t+ At) 
and we intend to relate this to the particle density at time ¢. During the time interval 
At there are finite probabilities P,,(&), P,(zt) that the amplitude (w — &) or the 
momentum (p,, — 7) be changed by a statistical process to become w or p,, at time 
t. This definition of the probability function also covers the cases where particles 
during the time Af either jump out of the phase space area AAp or appear in the 
phase space area AAo. 

To determine the number of particles ending up within the area AAg, we look 
at all area elements AAp which at time ¢ are a distance Aw = & and Ap,, = a 
away from the final area element AAg at time t + Af. As a consequence of our 
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assumption that the particle density is only slowly varying in phase space, we may 
assume that the density W is uniform within the area elements AAp eliminating the 
need for a local integration. We may now write down the expression for the phase 
space element and the particle density at time ¢ + At by integrating over all values 
of € and z 


+00 +00 
l= Ar [ / W(w — E, pw — 1, t) Py(E) Pp(x) dédz , (12.90) 


where we used the abbreviation = W(w + fy, At, py + gywAt, t+ At) AAg. 
The volume elements AAp and AAg are given by (12.5), (12.6), respectively. 
The statistical fluctuations may in general be of any magnitude. In particle beam 
dynamics, however, we find that the fluctuations with reasonable probabilities are 
small compared to the values of the variables w and p,,. The phase space density 
can therefore be expanded into a Taylor series where we retain linear as well as 
quadratic terms in € and z 


dW OW 


W(w— §, Py — 1, t) = Wo — baa (12.91) 
OPw 
0? 07% a7 
1¢2 ee 
as dw? won dp? ver dwdpy 
where YW = W(w, py, t) and we finally get for the integrals with (12.88) 
WY ow 

1=%+1—> fe P(E) dé +4 7 [ PP pmar, (12.92) 


For simplicity, we leave off the integration limits which are still from —oo to +o. 
If we now set NV to be the number of statistical occurrences per unit time we may 
simplify the quadratic terms on the r.h.s. of (12.92) by setting 


| EP, (E) dE = 5 (NGE*) Ar, (12.93) 


sf n?P, (a) da = 5(N,77) At, (12.94) 


and get similarly to the derivation of the Vlasov equation in Sect. 12.1 


0 W dW OW _ Of gw 
: + fw ay + 8w ta (F a tne WY (12.95) 
a? 7 ary 
+ ENG?) + 2 Nar?) 


Op2, 


w 
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This partial differential equation is identical to the Vlasov equation except for the 
statistical excitation terms and is called the Fokker—Planck equation [2]. We define 
diffusion coefficients describing the flow in € and z space by 

Dz = 5 (NOE), (12.96) 


(Nw) , (12.97) 


NIF NIE 


D, = 
and the Fokker—Planck equation becomes finally 


2a,W + D = +D 7 
= 2Ay ae Waly 
Ow op?, 


aw 
ot 


ow ow 
+ fon + 8w (12.98) 
ow 


OPw 


For the case of damped oscillators the Fokker—Planck equation can be derived 
similar to (12.35) and is 


ow ow ow ow aw 
or ®WoPw > — (@ow ae 20wPw) = 2a + D: a... a D; a 
Pw ow? Op, 


aw (12.99) 


This form of the Fokker—Planck equation will be very useful to describe a particle 
beam under the influence of diffusion processes. In the following section, we will 
derive general solutions which will be applicable to specific situations in accelerator 
physics. 


12.3.1 Stationary Solution of the Fokker—Planck Equation 


A unique stationary solution exists for the particle density distribution described by 
the partial differential equation (12.98). To derive this solution we transform (12.98) 
to cylindrical coordinates (w,py) — (r,0@) with w = rcos@ and py = rsiné 
and note terms proportional to derivatives of the phase space density with respect 
to the angle @. One of these terms wpW exists even in the absence of diffusion 
and damping and describes merely the betatron motion in phase space while the 
other terms depend on damping and diffusion. The diffusion terms will introduce 
a statistical mixing of the phases 6 and after some damping times any initial 
azimuthal variation of the phase space density will be washed out. We are here only 
interested in the stationary solution and therefore set all derivatives of the phase 
space density with respect to the phase @ to zero. In addition we find it necessary to 
average square terms of cos 6 and sin 8. With these assumptions the Fokker—Planck 
Equation (12.98) becomes after some manipulations in the new coordinates 


D\ ow ow 
\> q (12.100) 


cad 20, + + 
Ts = 4hy Ayr — a> 
dt r) or or2 
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where we have defined a total diffusion coefficient 
D = 5(De + Dy). (12.101) 


Equation (12.100) has some similarity with, for example, wave equations in 
quantum mechanics which are solved by the method of separation of variables 
and we expect the stationary solution for the phase space density to be of the form 
W(r,t) = >°, F,(t) G,(r). The solution G,,(r) must meet some particular boundary 
conditions. Specifically, at time t = 0, we may have any arbitrary distribution of the 
phase space density G,o(r). Furthermore, we specify that there be a wall at r = R 
beyond which the phase space density drops to zero and consequently, the boundary 
conditions are 


G,(r < R) = G0 (7) , 


G,(r > R) =0. (12.102) 


By the method of separation of the constants we find for the functions F’,(f) 
F,,(t) = const.e°"", (12.103) 


where the quantity —q, is the separation constant. The general form of the 
solution for (12.100) may now be expressed by a series of orthogonal functions or 
eigenmodes of the distribution G,,(r) which fulfill the boundary conditions (12.102) 


Wry=) eoine (12.104) 


n=O 


The amplitudes c, in (12.104) are determined such as to fit the initial density 
distribution 


Wo(r,t = 0) =) enGro(r) - (12.105) 


n>0 


With the ansatz (12.104) we get from (12.100) for each of the eigenmodes the 
following second-order differential equation: 


0Gn 1 Ay OGn Ay Xn 
2 G, = 0. 12.106 
or? +(54 er) or = D ( +2) ( ) 


All terms with a coefficient a, > 0 vanish after some time due to damping (12.103). 
Negative values for the damping decrements aw, < 0 define instabilities which 
we will not consider here. Stationary solutions, therefore require the separation 
constants to be zero aw, = O. Furthermore, all solutions G, must vanish at the 
boundary r = R where R may be any value including infinity if there are no physical 
boundaries at all to limit the maximum particle oscillation amplitude. In the latter 
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case where there are no walls, the differential equation (12.106) can be solved by 
the stationary distribution 


W(r,t) = > Cn Gn(r) & exp (-2°) (12.107) 


an=0 


which can easily be verified by backinsertion into (12.106). The solution for the 
particle distribution in phase space under the influence of damping @,, and statistical 
fluctuations D is a Gaussian distribution with the standard width 


D 
oe le (12.108) 


Aw 


Normalizing the phase space density the stationary solution of the Fokker— 
Planck equation for a particle beam under the influence of damping and statistical 
fluctuations is 


1 
V 200, 


Eigenfunctions for which the eigenvalues @, are not zero, are needed to describe 
an arbitrary particle distribution, e.g., a rectangular distribution at time t = 0. The 
Fokker—Planck equation, however, tells us that after some damping times these 
eigensolutions have vanished and the Gaussian distribution is the only stationary 
solution left. The Gaussian distribution is not restricted to the r-space alone. 
The particle distribution in equilibrium between damping and fluctuations is also 
Gaussian in the normalized phase space (w, py) as well as in real space. With 
row + p>, we get immediately for the density distribution in (w, p,,)-space 


Wr) = et /20r (12.109) 


1 
Ww, pw) = ——$ 2% @Pwe/ ry , (12.110) 
270 Op, 
where we have set oy, = Op,, = / 2. The standard deviation in w and p,, is the same 


as for r which is to be expected since all three quantities have the same dimension 
and are linearly related. 
In real space we have for u = x or y by ane u = Bw and p = ~ where 


w= ae On the other hand, p = //B,x' — fae and inserted into (12. 107) we get 


the density distribution in real space 


2 Bray 12 
pa Rais we Heal i (12.111) 


Wu, u’ _ 
(u Wo exp ( 292 
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This distribution describes the particle distribution in real phase space where 
particle trajectories follow tilted ellipses. Note that we carefully avoid replacing the 
derivative of the betatron function with 6’ = —2a because this would lead to a 
definite confusion between the damping decrement and the betatron function. To 
further reduce confusion we also use the damping times 1; = a; !. Integrating the 
distribution (12.111) for all values of the angles u’, for example, gives the particle 
distribution in the horizontal or vertical midplane. Using the mathematical relation 


l Oe Pe Lardy = ve ef /(4p") [3], we get 


1 


—u? /202 
gt? /205 (12.112) 
V2 0 V/ Budw 


VWou)= 


where the standard width of the horizontal Gaussian particle distribution is 


Gy Sf PO = sf Ba BD (12.113) 


The index ,, has been added to the diffusion and damping terms to indicate that 
these quantities are in general different in the horizontal and vertical plane. The 
damping time depends on all bending magnets, vertical and horizontal, but only on 
the damping-partition number for the plane under consideration. Similar distinction 
applies to the diffusion term. 

In a similar way, we get the distribution for the angles by integrating (12.111) 
with respect to u 


72 
W(u'’) = vB o| PE (12.114) 


Vim J1 +4 Boy 2(1+3 BY) a; 


where the standard width of the angular distribution is 


4+ Bp’? 4+ pr? 
o, = iP Sa B Di. (12.115) 
4p 4p 


We have not made any special assumption as to the horizontal or vertical plane 
and find in (12.112)-(12.115) the solutions for the particle distribution in both 
planes. 

In the longitudinal phase space the equations of motion are mathematically equal 
to Eq. (12.11). First we define new variables 


Qo 
Ne 


T, (12.116) 


w= 
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where {2,9 is the synchrotron oscillation frequency, 7. the momentum compaction 
and t the longitudinal deviation of a particle from the reference particle. The 
conjugate variable we define by 


é 


= 12.117 
P Ey ( ) 


where € is the energy deviation from the reference energy Ey. After differentiation 
of (12.52) and making use of (12.53) and the definition of the synchrotron oscillation 


frequency, we use these new variables and obtain the two first-order differential 
equations 


w= +2sp, (12.118) 

Dp = —Q yw — 2aep. (12.119) 

These two equations are of the same form as (12.11) and the solution of the 
longitudinal Fokker—Planck equation is therefore similar to (12.112)—(12.115). The 


energy distribution within a particle beam under the influence of damping and 
statistical fluctuations becomes with p = 6 = €/Ep 


W(5) = eres, (12.120) 


V 205 
where the standard value for the energy spread in the particle beam is defined by 


or 

= = /t-De. (12.121) 
0 

Ina similar way, we get for the conjugate coordinate t with w = Gs t the distribution 


1 
V 210, 


The standard width of the longitudinal particle distribution is finally 


= ue Jude. (12.123) 


W(t) = et 20 (12.122) 


The deviation in time t of a particle from the synchronous particle is equivalent 
to the distance of these two particles and we may therefore define the standard value 
for the bunch length from (12.123) by 


og = piel iD. (12.124) 
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By application of the Fokker—Planck equation to systems of particles under the 
influence of damping and statistical fluctuations, we were able to derive expressions 
for the particle distribution within the beam. In fact, we were able to determine 
that the particle distribution is Gaussian in all six degrees of freedom. Since such 
a distribution does not exhibit any definite boundary for the beam, it becomes 
necessary to define the size of the distributions in all six degrees of freedom by 
the standard value of the Gaussian distribution. Specific knowledge of the nature 
for the statistical fluctuations are required to determine the numerical values of the 
beam sizes. 

In Chap.13 we will apply these results to determine the equilibrium beam 
emittance in an electron positron storage ring where the statistical fluctuations are 
generated by quantized emission of synchrotron radiation photons. 


12.3.2 Particle Distribution within a Finite Aperture 


The particle distribution in an electron beam circulating in a storage ring is a 
Gaussian if we ignore the presence of walls containing the beam. All other modes of 
particle distribution are associated with a finite damping time and vanish therefore 
after a short time. In a real storage ring we must, however, consider the presence of 
vacuum chamber walls which cut off the Gaussian tails of the particle distribution. 
Although the particle intensity is very small in the far tails of a Gaussian distribution, 
we cannot cut off those tails too tight without reducing significantly the beam 
lifetime. Due to quantum excitation, we observe a continuous flow of particles from 
the beam core into the tails and back by damping toward the core. A reduction of 
the aperture into the Gaussian distribution absorbs therefore not only those particles 
which populate these tails at a particular moment but also all particles which reach 
occasionally large oscillation amplitudes due to the emission of a high energy 
photon. The absorption of particles due to this effect causes a reduction in the beam 
lifetime which we call the quantum lifetime. 

The presence of a wall modifies the particle distribution especially close to the 
wall. This modification is described by normal mode solutions with a finite damping 
time which is acceptable now because any aperture less than an infinite aperture 
absorbs beam particles thus introducing a finite beam lifetime. Cutting off Gaussian 
tails at large amplitudes will not affect the Gaussian distribution in the core and 
we look therefore for small variations of the Gaussian distribution which become 
significant only quite close to the wall. Instead of (12.107) we try the ansatz 


W(r,t) =e DB" (rye, (12.125) 


where 1 /q is the time constant for the distribution, with the boundary condition that 
the particle density be zero at the aperture or acceptance defining wall r = A or 


W(A, t) =0. (12.126) 
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Equation (12.125) must be a solution of (12.100) and back insertion of (12.125) 
into (12.100) gives the condition on the function g(r) 


1 
+ ( : =) e+ = 0, (12.127) 
ro dy o 


Since g(r) = | in case there is no wall, we expand the correction into a power series 


r 


an =14+ >> x*, where ors 


k>1 


(12.128) 


Inserting (12.128) into (12.127) and collecting terms of equal powers in r we derive 
the coefficients 


=k 
_.4 _ _«k- 1! 
Gul ee Te XxX, (12.129) 


Qa 


where X = 5—- < 1. The approximation X < 1 is justified since we expect the 
vacuum chamber wall to be far away from the beam center such that the expected 
quantum lifetime 1/a is long compared to the damping time 1/a,, of the oscillation 


under consideration. With these coefficients (12.128) becomes 


a 1 


ieee 
alr) 2a, — kk! 
k>1 


(12.130) 


For x = A?/(20*) >> 1 where A is the amplitude or amplitude limit for the 
oscillation w, the sum in (12.130) can be replaced by an exponential function 


1 , e 
Si aww =. (12.131) 


From the condition g(A) = 0 we finally get for the quantum lifetime tz = 1/a 


e 


a= atv (12.132) 
where 
Az 
x= (12.133) 


The quantum lifetime ty is related to the damping time. To make the quantum 
life time very large of the order of 50 or more hours, the aperture must be at least 
about 7o,, in which case x = 24.5 and e*/x = 1.8 x 10°. 
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The aperture A is equal to the transverse acceptance of a storage ring for a one- 
dimensional oscillation like the vertical betatron oscillation while longitudinal or 
energy oscillations are limited through the maximum energy acceptance allowed by 
the rf-voltage. Upon closer look, however, we note a complication for horizontal 
betatron oscillations and synchrotron oscillations because of the coupling from 
energy oscillation into transverse position due to a finite dispersion function. We 
also have assumed that w/(2a,,) < 1 which is not true for tight apertures of less 
than one sigma. Both of these situations have been investigated in detail [4, 5] and 
the interested reader is referred to those references. 

Specifically, if the acceptance A of a storage ring is defined at a location where 
there is also a finite dispersion function, Chao [4] derives a combined quantum 
lifetime of 


er /2 1 


o> J 2ra,n3 (1+ r)/r (1 —r) , 


where n = A/o,, Oo. =o? + gers r= nos / er A the transverse aperture, 7 the 
dispersion function at the same location where the aperture is A, 0, the transverse 
beam size and os = o, /E the standard relative energy width in the beam. 


(12.134) 


12.3.3 Particle Distribution in the Absence of Damping 


To obtain a stationary solution for the particle distribution it was essential that there 
were eigensolutions with vanishing eigenvalues a, = 0. As a result, we obtained an 
equilibrium solution where the statistical fluctuations are compensated by damping. 
In cases where there is no damping, we would expect a different solution with 
particles spreading out due to the effect of diffusion alone. This case can become 
important in very high energy electron positron linear colliders where an extremely 
small beam emittance must be preserved along a long beam transport line. The 
differential equation (12.106) becomes in this case 


oe Gn 1 0G, Qn 
or r or D 


G, =0. (12.135) 


We will assume that a beam with a Gaussian particle distribution is injected into 
a damping free transport line and we therefore look for solutions of the form 


W,(r,t) = cnGr(r)e *", (12.136) 
where 


G,(r) = e"/290 (12.137) 
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with oo being the beam size at t = 0. We insert (12.137) into (12.135) and obtain an 
expression for the eigenvalues a, 


aa, (12.138) 


The time dependent solution for the particle distribution now becomes 


2D r 2D 
Wir, t) = Aexp (-3) exp| (-] (1 = =) | : (12.139) 
orn 205 05 


Since nowhere a particular mode is used we have omitted the index n. The 
solution (12.139) exhibits clearly the effect of the diffusion in two respects. The 
particle density decays exponentially with the decrement 2D/o,. At the same time 
the distribution remains to be Gaussian although being broadened by diffusion. The 
time dependent beam size o is given by 


o2 

2 0 ?) 

o-(t) = xoj(l+—st), 12.140 
(t) 1a 7 3( >) ( ) 
% 


where we have assumed that the diffusion term is small (2D/o))t « 1. Setting 
= a = ¢€,f, for the plane u where f,, is the betatron function at the observation 
point of the beam size o,,. The time dependent beam emittance is 


2D 
€u = €40 + 3 (12.141) 


or the rate of change 


de, 2D De+Dr 
dt Bi Bu , 


Due to the diffusion coefficient D we obtain a continuous increase of the beam 
emittance in cases where no damping is available. 

The Fokker—Planck diffusion equation provides a tool to describe the evolution 
of a particle beam under the influence of conservative forces as well as statistical 
processes. Specifically, we found that such a system has a stationary solution in 
cases where there is damping. The stationary solution for the particle density is 
a Gaussian distribution with the standard width of the distribution o given by the 
diffusion constant and the damping decrement. 

In particular, the emission of photons due to synchrotron radiation has the 
properties of a Markoff process and we find therefore the particle distribution to be 
Gaussian. Indeed we will see that this is true in all six dimensions of phase space. 


(12.142) 
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Obviously not every particle beam is characterized by the stationary solution of 
the Fokker—Planck equation. Many modes contribute to the particle distribution and 
specifically at time t = 0 the distribution may have any arbitrary form. However, 
it has been shown that after a time long compared to the damping times only one 
nontrivial stationary solution is left, the Gaussian distribution. 


Problems 


12.1 (S). Derive from the Vlasov equation an expression for the synchrotron 
frequency while ignoring damping. A second rf-system with different frequency 
can be used to change the synchrotron tune. Determine a system that would reduce 
the synchrotron tune for the reference particle to zero while providing the required 
rf-voltage at the synchronous phase. What is the relationship between both voltages 
and phases? Is the tune shift the same for all particles? 


12.2 (S). Formulate an expression for the equilibrium bunch length in a storage 
ring with two rf-systems of different frequencies to control bunch length. 


12.3 (S). Energy loss of a particle beam due to synchrotron radiation provides 
damping. Show that energy loss due to interaction with an external electromagnetic 
field does not provide beam damping. 


12.4 (S). An arbitrary particle distribution of beam injected into a storage ring 
damps out while a Gaussian distribution evolves with a standard width specific to 
the ring design. What happens if a beam from another storage ring with a different 
Gaussian distribution is injected? Explain why this beam changes its distribution to 
the ring specific Gaussian distribution. 


12.5 (S). Consider a 1.5GeV electron storage ring with a bending field of 
1.5T. The circumference may be covered to 60% by bending magnets. Let 
the bremsstrahlung lifetime be 100h, the Coulomb scattering lifetime 50h and 
the Touschek lifetime 60h. Calculate the total beam lifetime including quantum 
excitation as a function of aperture. How many “sigma’s” (A/o) must the apertures 
be in order not to reduce the beam lifetime by more than 10% due to quantum 
excitation? 


12.6. To reduce coupling instabilities between bunches of a multibunch beam it is 
desirable to give each bunch a different synchrotron tune. This can be done, for 
example, by employing two rf-systems operating at harmonic numbers / andh + 1. 
Determine the ratio or required rf-voltages to split the tunes between successive 
bunches by Av/v,. 


12.7. Attempt to damp out the energy spread of a storage ring beam in the following 
way. At a location where the dispersion function is finite one could insert a TM) 10- 
mode cavity. Such a cavity produces accelerating fields which vary linear with 
the transverse distance of a particle from the reference path. This together with a 
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linear change in particle energy due to the dispersion would allow the correction 
of the energy spread in the beam. Derive the complete Vlasov equation for such an 
arrangement and discuss the six-dimensional dynamics. Show that it is impossible 
to achieve a monochromatic stable beam. 


12.8. Derive an expression for the diffusion due to elastic scattering of beam 
particles on residual gas atoms. How does the equilibrium beam emittance of an 
electron beam scale with gas pressure and beam energy? Determine an expression 
for the required gas pressure to limit the emittance growth of a proton or ion beam 
to no more than 1 % per hour and evaluate numerical for a proton emittance of 
10~° rad-m at an energy of 300GeV. Is this a problem if the achievable vacuum 
pressure is | nTorr? Concentrating the allowable scattering to one location of 10cm 
length (gas jet as a target) in a ring of 4km circumference, calculate the tolerable 
pressure of the gas jet. 


12.9. For future linear electron colliders it may be desirable to provide a switching 
of the beams from one experimental detector to another. Imagine a linear collider 
system with two experimental stations separated transversely by 50m. To guide 
the beams from the linear accelerators to the experimental stations use translating 
FODO cells and determine the parameters required to keep the emittance growth of 
a beam to less than 10 % (beam emittance 10~!! rad-m at 500 GeV). 
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Chapter 13 
Equilibrium Particle Distribution* 


The wide variety of particle beam applications require often very specific beam 
characteristics in terms of say cross section, divergence, energy spread or pulse 
structure. To a large extend such parameters can be adjusted by particular appli- 
cation of focusing and other forces. In this chapter, we will discuss some of these 
methods of beam optimization and manipulation. 


13.1 Particle Distribution in Phase Space 


The beam emittance of particle beams is primarily defined by characteristic 
source parameters and source energy. Given perfect matching between different 
accelerators and beam lines during subsequent acceleration, this source emittance 
is reduced inversely proportional to the particle momentum by adiabatic damping 
and stays constant in terms of normalized emittance. This describes accurately the 
situation for proton and ion beams, for nonrelativistic electrons and electrons in 
linear accelerators. 

The beam emittance for relativistic electrons, however, evolves fundamentally 
different in circular accelerators. Relativistic electron and positron beams passing 
through bending magnets emit synchrotron radiation, a process that leads to 
quantum excitation and damping. As a result, the original beam emittance at the 
source is completely replaced by an equilibrium emittance that is unrelated to the 
source characteristics. 
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13.1.1 Diffusion Coefficient and Synchrotron Radiation 


Emission of a photon causes primarily a change of the particle energy but the 
characteristics of the particle motion is changed as well. Neither position nor the 
direction of the particle trajectory is changed during the emission of photons. From 
beam dynamics, however, we know that different reference trajectories exist for 
particles with different energies. Two particles with energies cpg and cp; follow two 
different reference trajectories separated at the position z along the beam transport 
line by a distance 


Cc 1G 
Ae gy (13.1) 
CPo 


where 7(z) is the dispersion function and cpo the reference energy. Although 
particles in general do not exactly follow these reference trajectories they do perform 
betatron oscillations about them. The sudden change of the particle energy during 
the emission of a photon leads to a sudden change in the reference path and thereby 
to a sudden change in the betatron oscillation amplitude. 

Following the discussion of the Fokker-Planck equation in Chap. 12, we may 
derive a diffusion coefficient from these sudden changes in the coordinates. Using 
normalized coordinates w = x/ if B. , the change in the betatron amplitude at the 
moment a photon of energy €, is emitted becomes 


Apso 2 =, (13.2) 
Vv Bx Fo 
Similarly, the conjugate coordinate w = V Bx Xp + a, xg changes by 
€ a € 
Avan = —/ 4a = a, (13.3) 
Ban JB. Eo 


The frequency at which these statistical variations occur is the same for € and z 
and is equal to the number of photons emitted per unit time 


Ne = Na =N. (13.4) 


From (13.2), (13.3) we get 


Ptr = (2) v4 (va + sn) = (&) «. (13.5) 
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where we have defined a special lattice function 


H = Ben? + 2annn +)? . (13.6) 


We are interested in the average value of the total diffusion coefficient (12.101) 
= IN @ + 27)). = SIN (6) H):, (13.7) 


where the average (---), is to be taken along the whole transport line or the whole 
circumference of a circular accelerator. Since photon emission does not occur 
outside of bending magnets, the average is taken only along the length of the 
bending magnets. To account for the variation in photon energies, we use the rms 
value of the photon energies (€)). The theory of synchrotron radiation is discussed 
in much detail in Chap. 23 and we take in the following paragraph only relevant 
results of this theory. 

The number of photons emitted per unit time with frequencies between w and 
@-+da is simply the spectral radiation power at this frequency divided by the photon 
energy iw. Here, we consider only bending magnet radiation and treat radiation 
from insertion devices as perturbations. Of course, this approach must be modified 
if a significant part of radiation comes from non-bending magnet radiation. The 
spectral photon flux from a single electron is from (25.132) with the synchrotron 
radiation power (24.34) 


dn(w) 1 dP(@) _ Py 9V3 
do hw dw hw? 8x 


/ Ks5/3(x) dx, (13.8) 
¢ 


where € = w/w, and the critical photon energy defined in (24.49). The total photon 
flux is by integration over all frequencies 


nal 2 f° i K(x) dxdt (13.9) 


ho, 81 


which becomes with GR(6.561.16) and (1/6) /(1/6) = 5/3 after integration 
by parts from AS(6.1.17) 


Py 9V3 15V3 Py 
ae * Ksja(o) a = = 8 he (13.10) 


The rms value of the photon energy (€)) can be derived in the usual way from 
the spectral distribution n (w) by 


hr ce, 9/3P,ho, f% oo 
(ef) = x | w°n(w) dw = eae e | Ksj3(x)dxd€- (13.11) 
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and is after integration by parts 


Piha. 9/31 
(~)= ag 3 / €?Ks/3(6) de. (13.12) 


The integral of the modified Bessel’s function in (13.12) is from GR[6.561.16] 
41°(2 + 2) (2 — 2) where we use again AS(6.1.17) for (7?) (4) = 2x. 
Collecting all terms 


55 
N (2) = ——P, ho, 13.13 
( y ) 944/3 y ( ) 
and the diffusion coefficient (13.7) becomes 
55 (Py hwH )- 
D=3(N(@4+2’)). = (13.14) 


— 48V3— ES 


The stationary solution for the Fokker-Planck equation has been derived describ- 
ing the equilibrium particle distribution in phase space under the influence of 
quantum excitation and damping. In all six dynamical degrees of freedom the 
equilibrium distribution is a Gaussian distribution and the standard value of the 
distribution width is determined by the damping time and the respective diffusion 
coefficient. In this chapter, we will be able to calculate quantitatively the diffusion 
coefficients and from that the beam parameters. 


13.1.2 Quantum Excitation of Beam Emittance 


High energy electron or positron beams passing through a curved beam transport 
line suffer from quantum excitation which is not compensated by damping since 
there is no acceleration. In Sect. 12.3.3 we have discussed this effect and found 
the transverse beam emittance to increase linear with time (12.142) and we get 
with (13.14) 


de, de, 55 reh 
é, €, Te he (7) (13.15) 


edt dz a 24,/3 mc? y Pie 


There is a strong energy dependence of the emittance increase along the beam 
transport line and the effect becomes significant for high beam energies as proposed 
for linear collider systems. Since the emittance blow up depends on the lattice 
function H, we would choose a very strong focusing lattice to minimize the dilution 
of the beam emittance. For this reason, the beam transport system for the linear 
collider at the Stanford Linear Accelerator Center [1] is composed of very strongly 
focusing combined bending magnets. 
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Particle distributions become modified each time we inject a beam into a circular 
accelerator with significant synchrotron radiation. Arbitrary particle distributions 
can be expected from injection systems before injection into a circular accelerator. If 
the energy in the circular accelerator is too small to produce significant synchrotron 
radiation the particular particle distribution is preserved according to Liouville’s 
theorem while all particles merely rotate in phase space as discussed in Sect. 12.1. 
As the beam energy is increased or if the energy is sufficiently high at injection 
to generate significant synchrotron radiation, all modes in the representation of the 
initial particle distribution vanish within a few damping times while only one mode 
survives or builds up which is the Gaussian distribution with a standard width given 
by the diffusion constant and the damping time. In general, any deviation from this 
unique equilibrium solution and be it only a mismatch to the correct orientation of 
the beam in phase space will persist for a time not longer than a few damping times. 


13.2 Equilibrium Beam Emittance 


In circular electron accelerators, as in electron storage rings, quantum excitation 
is counteracted by damping. Since quantum excitation is not amplitude dependent 
but damping is, there is an equilibrium beam emittance when both effects are 
equally strong. In the presence of quantum fluctuations Liouville’s theorem is not 
applicable strictly anymore. In the case of an electron beam in equilibrium the 
phase space density for a beam in equilibrium is preserved, although in a different 
way. While Liouville’s theorem is based on Hamiltonian mechanics and demands 
that no particle should escape its phase space position we allow in the case of an 
electron beam in equilibrium that a particle may escape its phase space position but 
be replaced instantly by another particle due to damping. 


13.2.1 Horizontal Equilibrium Beam Emittance 


The horizontal beam size is related to damping and diffusion coefficient 
from (12.113) like 


o2 


— 
Damping times have been derived in Sect. 12.2 and with (13.7) the horizontal 
beam size 0, at a location where the value of the betatron function is 6, becomes 


T Dy . (13.16) 


4 (13.17) 
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The ratio o7/, is consistent with our earlier definition of the beam emittance. 
For a particle beam which is in equilibrium between quantum excitation and 
damping, this ratio is defined as the equilibrium beam emittance being equivalent 
to the beam emittance for all particles within one standard value of the Gaussian 
distribution. For further simplification, we make use of the expression (13.13) and 
get with the radiation power (24.34) and the critical frequency (24.49) the horizontal 
beam emittance equation 


3 
& = Cyy? HAP he (13.18) 


Je (1/p?) 2’ 


where we adopted Sands’ [2] definition of a quantum excitation constant 
C, = —=—> = 3.84« 10 * m. (13.19) 


The equilibrium beam emittance scales like the square of the beam energy 
and depends further only on the bending radius and the lattice function #. From 
the definition of H the horizontal equilibrium beam emittance depends on the 
magnitude of the dispersion function and can therefore be adjusted to small or large 
values depending on the strength of the focusing for the dispersion function. 


13.2.2 Vertical Equilibrium Beam Emittance 


The vertical beam emittance follows from (13.18) considering that the dispersion 
function and therefore vanishes. Consequently, the equilibrium vertical beam 
emittance seems to be zero because there is only damping but no quantum 
excitation. In this case we can no longer ignore the fact that the photons are emitted 
into a finite although very small angle about the forward direction of particle 
propagation. Each such emission causes both a loss in the particle energy and a 
transverse recoil deflecting the particle trajectory. The photons are emitted typically 
within an angle 1/y generating a transverse kick without changing the betatron 
oscillation amplitude. With dy = 0 and dy = mi we get for the statistical 
variations 


&=0, 
—— 1 &y \2 
1s = bys (#) . 


Following a derivation similar to that for the horizontal beam emittance, we get 
for the vertical beam emittance equation 


(13.20) 
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This is the fundamentally lower limit of the equilibrium beam emittance due 
to the finite emission angle of synchrotron radiation. For an isomagnetic ring the 
vertical beam emittance 


B=C, (Py) (13.22) 


Jy |p| 


does not depend on the particle energy but only on the bending radius and the 
average value of the betatron function. In most practical circular accelerator designs, 
both the bending radius and the betatron function are of similar magnitude and the 
fundamental emittance limit therefore is of the order of Cy = 10713 radian meter, 
indeed very small compared to actually achieved beam emittances. 

The assumption that the vertical dispersion function vanishes in a flat circular 
accelerator is true only for an ideal ring. Dipole field errors, quadrupole misalign- 
ments and any other source of undesired dipole fields create a vertical closed orbit 
distortion and an associated vertical dispersion function. This vertical dispersion 
function, often called spurious dispersion function, is further modified by orbit 
correction magnets but it is not possible to completely eliminate it because the 
location of dipole errors are not known. 

Since the diffusion coefficient D is quadratic in the dispersion function (13.7) we 
get a contribution to the vertical beam emittance from quantum excitation similar 
to that in the horizontal plane. Indeed, this effect on the vertical beam emittance is 
much larger than that due to the finite emission angle of photons discussed above 
and is therefore together with coupling the dominant effect in the definition of the 
vertical beam emittance. 

The contribution to the vertical beam emittance is in analogy to the derivation 
leading to (13.18) 


Hy/|p°|)- 
Agy = Cy? eho (13.23) 


Jy(1/p?)z 


where 


Hy — | Byn? + 2oynynl, + ym, 
lel” lel” : 


(13.24) 


To minimize this effect, orbit correction schemes must be employed which not 
only correct the equilibrium orbit but also the perturbation to the dispersion function. 
Of course, the same effect with similar magnitude occurs also in the horizontal plane 
but is in general negligible compared to ordinary quantum excitation. 
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13.3. Equilibrium Energy Spread and Bunch Length 


The statistical processes caused by the emission of synchrotron radiation photons 
affect not only the four transverse dimensions of phase space but also the energy- 
time phase space. Particles orbiting in a circular accelerator emit photons with a 
statistical distribution of energies while only the average energy loss is replaced in 
the accelerating cavities. 


13.3.1 Equilibrium Beam Energy Spread 


This leaves a residual statistical distribution of the individual particle energies 
which we have derived in Sect. 12.3 to be Gaussian just like the transverse particle 
distribution with a standard width given by (12.121). The conjugate coordinate is the 
“time” w = 2 where t is the deviation in time of a particle from the synchronous 
particle, and € the energy deviation of a particle from the reference energy Ep. 

The emission of a photon will not change the position of the particle in time and 
therefore £ = 0. The conjugate coordinate being the particle energy will change 
due to this event by the magnitude of the photon energy and we have 2 = €,/Ep. 
Comparing with (13.5), we note that we get the desired result analogous to the 
transverse phase space by setting # = | and using the correct damping time for 
longitudinal motion. The equilibrium energy spread becomes then from (12.121) in 
analogy to (13.18) 


o, (I1/e°l): 
5 =Qy——, 13.25 
ES o Je(1/p?)z : 
which in a separated function lattice depends only on the particle energy and the 
bending radius. In a fully or partially combined function lattice, the partition number 
J. can be modified providing a way to vary the energy spread. 


13.3.2 Equilibrium Bunch Length 


There is also a related equilibrium distribution in the longitudinal dimension which 
defines the length of the particle bunch. This distribution is also Gaussian and the 
standard bunch length is from (12.123), (12.124) 


op = ch — —. (13.26) 
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The equilibrium bunch length not only depends on the particle energy and the 
bending radius but also on the focusing lattice through the momentum compaction 
factor and the partition number as well as on rf-parameters included in the syn- 
chrotron oscillation frequency §2,. To exhibit the scaling, we introduce lattice and 
rf-parameters into (13.26) to get with (13.25) and the definition of the synchrotron 
frequency (9.32) an expression for the equilibrium bunch length 


2. 2mCq ncE\R (11/7 ])- 
oy = 2)2 C 22 
(mc*)* J.heVo cos Ww, (1/7): 


(13.27) 


where R is the average radius of the ring. The bunch length can be modified 
through more parameters than any other characteristic beam parameter in the six- 
dimensional phase space. Lattice design affects the resulting bunch length through 
the momentum compaction factor and the partition number. Strong focusing results 
in a small value for the momentum compaction factor and a small bunch length. 
Independent of the strength of the focusing, the momentum compaction factor can 
in principle be adjusted to any value including zero and negative values by allowing 
the dispersion function to change sign along a circular accelerator because the 
momentum compaction factor is the average of the dispersion function a, = (7/p). 
In this degree of approximation the bunch length could therefore be reduced to 
arbitrarily small values by reducing the momentum compaction factor. However, 
close to the transition energy phase focusing to stabilize synchrotron oscillations is 
lost. 

Introduction of gradient magnets into the lattice modifies the partition numbers 
as we have discussed in Sect. 12.2.1. As a consequence, both, the energy spread 
and bunch length increase or decrease at the expense of the opposite effect on 
the horizontal beam emittance. The freedom to adjust any of these three beam 
parameters in this way is therefore limited but nonetheless an important means to 
make small adjustments if necessary. Obviously, the rf-frequency as well as the 
rf-voltage have a great influence on the bunch length. The bunch length scales 
inversely proportional to the square root of the rf-frequency and is shorter for higher 
frequencies. Generally, no strong reasons exist to choose a particular rf-frequency 
but might become more important if control of the bunch length is important for the 
desired use of the accelerator. The bunch length is also determined by the rate of 
change of the rf-voltage in the accelerating cavities at the synchronous phase 


V(Ws) = Sai = Voosy,. (13.28) 
dy v= 


For a single frequency rf-system the bunch length can be shortened when the 
rf-voltage is increased. To lengthen the bunch the rf-voltage can be reduced up to a 
point where the rf-voltage would fail to provide a sufficient energy acceptance. 
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13.4 Phase-Space Manipulation 


The distribution of particles in phase space is given either by the injector character- 
istics and injection process or in the case of electron beams by the equilibrium of 
quantum excitation due to synchrotron radiation and damping. The result of these 
processes are not always what is desired and it is therefore useful to discuss some 
method to modify the particle distribution in phase space within the validity of 
Liouville’s theorem. 


13.4.1 Exchange of Transverse Phase-Space Parameters 


In beam dynamics we are often faced with the desire to change the beam size in 
one of the six phase-space dimensions. Liouville’s theorem tells us that this is not 
possible with macroscopic fields unless we let another dimension vary as well so as 
not to change the total volume in six-dimensional phase space. 

A very simple example of exchanging phase-space dimensions is the increase or 
decrease of one transverse dimension at the expense of its conjugate coordinate. A 
very wide and almost parallel beam, for example, can be focused to a small spot 
size where, however, the beam divergence has become very large. Obviously, this 
process can be reversed too and we describe such a process as the rotation of a beam 
in phase space or as phase-space rotation. 

A more complicated but often very desirable exchange of parameters is the 
reduction of beam emittance in one plane at the expense of the emittance in the 
other plane. Is it, for example, possible to reduce say the vertical beam emittance to 
zero at the expense of the horizontal emittance? Although Liouville’s theorem would 
allow such an exchange other conditions in Hamiltonian theory will not allow this 
kind of exchange in multidimensional phase space. The condition of symplecticity is 
synonymous with Liouville’s theorem only in one dimension. For n dimensions the 
symplecticity condition imposes a total of n(2n — 1) conditions on the dynamics 
of particles in phase space [3]. These conditions impose an important practical 
limitation on the exchange of phase space between different degrees of freedom. 
Specifically, it is not possible to reduce the smaller of two phase-space dimensions 
further at the expense of the larger emittance, or if the phase space is the same in 
two dimensions neither can be reduced at the expense of the other. 


13.4.2. Bunch Compression 


Longitudinal phase space can be exchanged also by special application of magnetic 
and rf-fields. Specifically, we often face the problem to compress the bunch to a very 
short length at the expense of energy spread. 
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For linear colliders the following problem exists. Very small transverse beam 
emittances can be obtained only in storage rings specially designed for low equilib- 
rium beam emittances. Therefore, an electron beam is injected from a conventional 
source into a damping ring specially designed for low equilibrium beam emittance. 
After storage for a few damping times the beam is ejected from the damping ring 
again and transferred to the linear accelerator to be further accelerated. During the 
damping process in the damping ring, however, the bunch length will also reach its 
equilibrium value which in practical storage rings is significantly longer than could 
be accepted in, for example, an S-band or X-band linear accelerator. The bunch 
length must be shortened. 

This is done in a specially designed beam transport line between the damping 
ring and linear accelerator consisting of a non-isochronous transport line and an 
accelerating section installed at the beginning of this line (Fig. 13.1). 

The accelerating section is phased such that the center of the bunch or the 
reference particle does not see any field while the particles ahead of the reference 
particle are accelerated and the particles behind are decelerated. Following this 
accelerating section, the particles travel through a curved beam transport system 
with a finite momentum compaction factor a, = L is : dz where Lo is the length 
of the beam transport line. Early particles within a bunch, having been accelerated, 
follow a longer path than the reference particles in the center of the bunch while the 
decelerated particles being late with respect to the bunch center follow a shortcut. 
All particles are considered highly relativistic and the early particles fall back toward 
the bunch center while late particles catch up with the bunch center. If the parameters 
of the beam transport system are chosen correctly the bunch length reaches its 
minimum value at the desired location at, for example, the entrance of the linear 
accelerator. From that point on the phase-space rotation is halted because of lack of 
momentum compaction in a straight line. Liouville’s theorem is not violated because 
the energy spread in the beam has been increased through the phase dependent 
acceleration in the bunch-compression system. 
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Fig. 13.1 Bunch-compressor system (schematic) 
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Formulating this bunch compression in more mathematical terms, we start from 
a particle distribution in longitudinal phase space described by the phase ellipse 


G6 er =e, =e . (13.29) 


where a is the longitudinal emittance and T is the particle location along the bunch 
measured from the bunch center such that t > 0 if the particle trails the bunch 
center. In the first step of bunch compression, we apply an acceleration 


Ae = —eVo sin @yt & —eVo Weft « (13.30) 


The particle energy is changed according to its position along the bunch. Replacing 
€ in (13.29) by € + Ae and sorting we get 


tie? — 27) eVompet + (HeVi or + G4)? =a’, (13.31) 


where the appearance of the cross term indicates the rotation of the ellipse. The 
second step is the actual bunch compression in a non-isochronous transport line of 
length L and momentum compaction Az/L = n,. €/(cpo). Traveling though this 
beam line, a particle experiences a shift in time of 


_ Az = NL € 
Be Be cpo- 


Again, the time t in (13.31) is replaced by t + Art to obtain the phase ellipse at 
the end of the bunch compressor of length L. The shortest bunch length occurs when 
the phase ellipse becomes upright. The coefficient for the cross term must therefore 
be zero giving a condition for minimum bunch length 


At 


(13.32) 
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evo = — : 
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(13.33) 


From the remaining coefficients of €? and t?, we get the bunch length after 
compression 


é € 
T= (13.34) 
Vir rf 
and the energy spread 
é = To Ort eV, (13.35) 


where we used the approximation ty eVo @¢ >> €o. This is justified because we must 
accelerate particles at the tip of the bunch by much more than the original energy 
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spread to obtain efficient bunch compression. Liouville’s theorem is obviously kept 
intact since 


ET = €y To. (13.36) 


For tight bunch compression, a particle beam with small energy spread is required 
as well as an accelerating section with a high rf-voltage and frequency. Of course, 
the same parameters contribute to the increase of the energy spread which can 
become the limiting factor in bunch compression. If this is the case, one could 
compress the bunch as much as is acceptable followed by acceleration to higher 
energies to reduce the energy spread by adiabatic damping, and then go through a 
bunch compression again. 

The momentum compaction factor a. =f f paz is often referred to as the 
Rs¢ of the compression lattice. This designation comes from the TRANSPORT 
nomenclature where a 6 x 6-transformation matrix is defined for the variables 
(x,x’, y,y’,s,5). Here s is the individual particle path length and 6 the relative 
energy deviation. The correlation of s with 5 is the Rs_ element and in linear 
De = $9 +R566. Recalling the definition of the momentum compaction 

1) 


factor a, = Ap/p, We Fecognize the identity a, = Lo - az = 


13.4.3 Alpha Magnet 


Bunch compression requires two steps. First, an accelerating system must create a 
correlation between particle energy and position. Then, we utilize a non-isochronous 
magnetic transport line to rotate the particle distribution in phase space until the 
desired bunch length is reached. 

The first step can be eliminated in the case of an electron beam generated in an 
rf-gun. Here the electrons emerge from a cathode which is inserted into an rf-cavity 
[4]. The electrons are accelerated immediately where the acceleration is a strong 
function of time because of the rapidly oscillating field. In Fig. 13.2 the result from 
computer simulations of the particle distribution in phase space [5] is shown for an 
electron beam from a 3 GHz rf-gun [6, 7] (Fig. 13.3). 

For bunch compression we use an alpha magnet which got its name from 
the alpha like shape of the particle trajectories. This magnet is made from a 
quadrupole split in half where the other half is simulated by a magnetic mirror plate 
at the vertical midplane. While ordinarily a particle beam would pass through a 
quadrupole along the axis or parallel to this axis this is not the case in an alpha 
magnet. The particle trajectories in an alpha magnet have very unique properties 
which were first recognized by Enge [8]. Most obvious is the fact that the entrance 
and exit point can be the same for all particles independent of energy. The same 
is true also for the total deflection angle. Borland [9] has analyzed the particle 
dynamics in an alpha magnet in detail and we follow his derivation here. Particles 
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Fig. 13.2 Particle distribution in phase space for an electron beam from an rf gun with thermionic 
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Fig. 13.3 Cross section of a microwave electron gun [6, 7] 


entering the alpha magnet fall under the influence of the Lorentz force 


F, = cE + e[v x BI, 


(13.37) 
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Fig. 13.4 Alpha magnet and 
particle trajectories 
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where we ignore the electrical field. Replacing the magnetic field by its gradient 
B = (gu3,0,gu1), we get in the coordinate system of Fig. 13.4 the equation of 
motion, 


du > | du 
— =-o°|— xu], (13.38) 
dz? dz 
where the scaling factor 
T 
o? (m~) = <5 = 5.86674 x 10° g(T/m) | (13.39) 
me? By By 


and the coordinate vector uw = (uy, U2, U3). 
By introducing normalized coordinates U = ou and path length S = oz, 
Eq. (13.38) becomes 


dU dU 
OF ew 0.U;)|. 13.4 
42 E x (U3 u1)| (13.40) 


The remarkable feature of (13.40) is the fact that it does not exhibit any 
dependence on the particle energy or the magnetic field. One solution for (13.40) is 
valid for all operating conditions and beam energies. The alpha shaped trajectories 
are similar to each other and scale with energy and field gradient according to the 
normalization introduced above. 

Equation (13.40) can be integrated numerically and in doing so, Borland obtains 
for the characteristic parameters of the normalized trajectory in an alpha magnet [9] 


64 = 0.71052 rad, Sy = 4.64210, 


A (13.41) 
= 40.70991 deg, U; = 1.81782, 
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where 6, is the entrance and exit angle with respect to the magnet face, S, is the 
normalized path length and U, is the apex of the trajectory in the alpha magnet. 
We note specifically that the entrance and exit angle 6, is independent of beam 
energy and magnetic field. It is therefore possible to construct a beam transport line 
including an alpha magnet. 

Upon introducing the scaling factor (13.39), (13.41) becomes equation 


ae Sy. By 
Sa(m) = J = 0.19165 y Feryay> (13.42) 
a(m) = & = 0.07505 ,/ 8. 


Bunch compression occurs due to the functional dependence of the path length 
on the particle energy. Taking the derivative of (13.42) with respect to the particle 
momentum po = fy, one gets the compression equation 


dsa(m) 0.07505 wad 


djo ¥228(T/m) po 


For bunch compression, higher momentum particles must arrive first because 
they follow a longer path and therefore fall back with respect to later particles. For 
example, an electron beam with the phase-space distribution from Fig. 13.2 becomes 
compressed as shown in Fig. 13.5. 

Because of the small longitudinal emittance of the beam it is possible to generate 
very short electron bunches of some 100 f-sec (rms) duration which can be used to 
produce intense coherent far infrared radiation [10]. 
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For high energy physics experimentation, it is sometimes important to have beams 
of transversely or longitudinally polarized particles. It is possible, for example, to 
create polarized electron beams by photoemission from GaAs cathodes [11]. From 
a beam dynamics point of view, we are concerned with the transport of polarized 
beams through a magnet system and the resulting polarization status. The magnetic 
moment vector of a particle rotates about a magnetic field vector. An electron with 
a component of a longitudinal polarization traversing a vertical dipole field would 
experience a rotation of the longitudinal polarization about the vertical axis. On the 
other hand, the vertical polarization would not be affected while passing through a 
horizontally bending magnet. This situation is demonstrated in Fig. 13.6. 

Similarly, longitudinal polarization is not affected by a solenoid field. In linear 
collider facilities, specific spin rotators are introduced to manipulate the electron 
spin in such a way as to preserve beam polarization and obtain the desired spin 
direction at an arbitrarily located collision point along the beam transport line. For 
the preservation of beam polarization, it is important to understand and formulate 
spin dynamics. 

Electron and positron beams circulating for a long time in a storage ring 
can become polarized due to the reaction of continuous emission of transversely 
polarized synchrotron radiation. The evolution of the polarization has been studied 
in detail by several researchers [12-15] and the polarization time is given by [15] 


i 5/3 rechny? 


a (13.44) 


Tpol 8 mc?p 


with a theoretically maximum achievable polarization of 92.38 %. The polarization 
time is a strong function of beam energy and is very long for low energies. At 


beam 


beam path 


Fig. 13.6 Precession of the particle spin in a transverse or longitudinal magnetic field 
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energies of several GeV, however, this time becomes short compared to the storage 
time of an electron beam in a storage ring. 

This build up of polarization is counteracted by nonlinear magnetic field errors 
which cause precession of the spin depending on the betatron amplitude and 
energy of the particle thus destroying polarization. Again, we must understand spin 
dynamics to minimize this depolarization. Simple relations exist for the rotation of 
the spin while the particle passes through a magnetic field. To rotate the spin by a 
magnetic field, there must be a finite angle between the spin direction and that of the 
magnetic field. The spin rotation angle about the axis of a transverse field depends 
on the angle between the spin direction o, (|o,| = 1) and magnetic field B, and is 
given by [14] 


1 
v= C1 (1+—) Jo. Bile, (13.45) 
where 

g—2 

Ne = oe = 0.00115965, (13.46) 
eNg =e 

Cre = a 0.0068033 (r m ) (13.47) 
mc 


g the gyromagnetic constant and B,¢ the integrated transverse magnetic field 
strength. Apart from a small term 1/y, the spin rotation is independent of the energy. 
In other words, a spin component normal to the field direction can be rotated by 90° 
while passing though a magnetic field of 2.309 Tm and it is therefore not important 
at what energy the spin is rotated. 

Equation (13.45) describes the situation in a flat storage ring with horizontal 
bending magnets only unless the polarization of the incoming beam is strictly 
vertical. Any horizontal or longitudinal polarization component would precess while 
the beam circulates in the storage ring. As long as this spin is the same for 
all particles the polarization would be preserved. Unfortunately, the small energy 
dependence of the precession angle and the finite energy spread in the beam would 
wash out the polarization. On the other hand the vertical polarization of a particle 
beam is preserved in an ideal storage ring. Field errors, however, may introduce a 
depolarization effect. Horizontal field errors from misalignments of magnets, for 
example, would rotate the vertical spin. Fortunately, the integral of all horizontal 
field components in a storage ring is always zero along the closed orbit and the net 
effect on the vertical polarization is zero. Nonlinear fields, however, do not cancel 
and must be minimized to preserve the polarization. 

A transverse spin can also be rotated about the longitudinal axis of a solenoid 
field and the rotation angle is 


e y 
== (ter — | leew? 13.48 
Vi =( +n) la x Bi| ( ) 


13.5 Polarization of a Particle Beam 455 


In a solenoid field it is therefore possible to rotate a horizontal polarization 
into a vertical polarization, or vice versa. Spin rotation in a longitudinal field is 
energy dependent and such spin rotations should therefore be done at low energies 
if possible. 

The interplay between rotations about a transverse axis and the longitudinal 
axis is responsible for a spin resonance which destroys whatever beam polarization 
exists. To show this, we assume a situation where the polarization vector precesses 
just by 277, or an integer multiple n thereof, while the particle circulates once around 
the storage ring. In this case yy = n 27, eB, C/E = 27, and we get from (13.45) 


n=ng(1+y). (13.49) 


For n = 1, resonance occurs at a beam energy of E = 440.14MeV. At this 
energy any small longitudinal field interacts with the polarization vector at the same 
phase, eventually destroying any transverse polarization. This resonance occurs at 
equal energy intervals of 


E, (MeV) = 440.14 + 440.65(n — 1) (13.50) 


and can be used in storage rings as a precise energy calibration by observing the loss 
of polarization due to spin-resonances at E,, while the beam energy is changed. 

In Fig. 13.7 spin dynamics is shown for the case of a linear collider where 
a longitudinally polarized beam is desired at the collision point. For example, a 
longitudinally polarized beam is generated at the source and accelerated in a linear 
accelerator. No rotation of the polarization direction occurs because no magnetic 
fields are involved yet. At an energy of 1.2 GeV the beam is transferred to a damping 
ring to reduce the beam emittance. To preserve polarization in the damping ring 
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the polarization must be vertical. In Fig. 13.7, we assume a longitudinal polarized 
beam coming out of the linear accelerator. A series of transverse fields amounting 
to 5 x 2.309 Tm creating a total deflection angle of 5 x 32.8°. The longitudinal 
spin from the linac is rotated by 5 x 90° to a horizontal spin as shown in Fig. 13.7 
by the open arrows. A solenoid of 6.34Tm rotates this spin about the beam axis to 
become a vertical spin which survives the storage time in the damping ring. After 
emittance reduction in a few damping times the beam is ejected again with vertical 
spin. Now we have to decide which spin orientation we need at the collision point of 
the linear collider. there are two cases. In the first case (left side of the ejected beam 
in Fig. 13.7 a solenoid of 6.34Tm rotates the spin to become transverse followed 
by a 32.8° bending section to make the spin longitudinal. This beam is injected 
back into the linac for collisions with longitudinal spin. of course, any bending 
downstream must be carefully implemented to preserve the spin. In the second case 
we turn the first solenoid after ejection off and the beam with vertical spin arrives at 
the second solenoid unaffected by the 32.8° bend. in the second solenoid the spin is 
rotated to become a horizontal spin which is then reinjected into the linac. Note in 
both cases there is some spin rotation after the second solenoid yet in both cases the 
effect on the spin is just what is desired to have a transverse or longitudinal spin in 
the linac. 

To rotate the longitudinal into a horizontal spin, followed by a solenoid field 
which rotates the horizontal into a vertical spin, is used in the transport line to the 
damping ring to obtain the desired vertical spin orientation. This orientation is in 
line with all magnets in the damping ring and the polarization can be preserved. 

To obtain the desired rotation in the beam transport magnets at a given energy, 
the beam must be deflected by a specific deflection angle which is from (13.45) 


j= pe (13.51) 
Ne lt+y 

Coming out of the damping ring the beam passes through a combination of two 
solenoids and two transverse field sections. Depending on which solenoid is turned 
on, we end up with a longitudinal or transverse polarization at the entrance of the 
linac. By the use of both solenoids any polarization direction can be realized. 


Problems 


13.1 (S). Show that the horizontal damping partition number is negative in a fully 
combined function FODO lattice as employed in older synchrotron accelerators. 
Why, if there is horizontal anti-damping in such synchrotrons, is it possible to retain 
beam stability during acceleration? What happens if we accelerate a beam and keep 
it orbiting in the synchrotron at some higher energy? 
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13.2. Use the high energy, linear part of the particle distribution in Fig. 13.2 and 
specify an alpha magnet to achieve best bunch compression at the observation point 
2m downstream from the magnet. By how much is the bunchlength increased if you 
now also include the variation of velocities. What are the alpha magnet parameters? 
Sketch the particle distribution at the entrance and exit of the alpha magnet. 


13.3. Specify relevant parameters for an electron storage ring made of FODO cells 
with the goal to produce a very short equilibrium bunch length of og = 1 mm. Use 
superconducting cavities for bunch compression with a maximum field of 10 MV/m 
and a total length of not more than 10 % of the ring circumference. 


13.4. Describe spin rotations in bending magnets in matrix formulation. 


13.5. Consider an electron storage ring for an energy of 30GeV and a bending 
radius of p = 500m and calculate the polarization time. The vertical polarization 
will be perturbed by spurious dipole fields. Use statistical methods to calculate the 
rms change of polarization direction per unit time and compare with the polarization 
time. Determine the alignment tolerances to get a polarized beam. 
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Chapter 14 
Beam Emittance and Lattice Design 


The task of lattice design for proton and ion beams can be concentrated to a 
pure particle beam optics problem. Transverse as well as longitudinal emittances 
of such beams are constants of motion and therefore do not depend on the 
particular design of the beam transport or ring lattice. This situation is completely 
different for electron and positron beams in circular accelerators where the emission 
of synchrotron radiation determines the particle distribution in six-dimensional 
phase space. The magnitude and characteristics of synchrotron radiation effects 
can, however, be manipulated and influenced by an appropriate choice of lattice 
parameters. We will discuss optimization and scaling laws for the transverse beam 
emittance of electron or positron beams in circular accelerators. 

Originally electron storage rings have been designed, optimized and constructed 
for the sole use as colliding beam facilities for high energy physics. The era of 
electron storage rings for experimentation at the very highest particle energies 
has, however, reached a serious limitation due to excessive energy losses into 
synchrotron radiation. Of course, such a limitation does not exist for proton and ion 
beams with particle energies up to the order of some tens of TeV’s and storage rings 
are therefore still the most powerful and productive research tool in high-energy 
physics. At lower and medium-energies electron storage rings with specially high 
luminosity still serve as an important research tool in high energy physics to study 
more subtle phenomena which could not be detected on earlier storage rings with 
lower luminosity like t- and B-factories. 

To overcome the energy limitation in electron colliding beam facilities, the idea 
of linear colliders which avoids energy losses into synchrotron radiation [1, 2] 
becomes increasingly attractive to reach ever higher center of mass energies for 
high-energy physics. Even though electron storage rings are displaced by this 
development as the central part of a colliding beam facility they play an important 
role for linear colliders in the form of damping rings to prepare very small emittance 
particle beams. 


© Springer International Publishing Switzerland 2015 459 
H. Wiedemann, Particle Accelerator Physics, Graduate Texts in Physics, 
DOI 10.1007/978-3-319-18317-6_14 


460 14 Beam Emittance and Lattice Design 


The single purpose of electron storage rings for high-energy physics has been 
replaced by a multitude of applications of synchrotron radiation from such rings in 
a large variety of basic and applied research disciplines. It is therefore appropriate 
to discuss specific design and optimization criteria for electron storage rings. 

Synchrotron radiation sources have undergone significant transitions and modi- 
fications over past years. Originally, most experiments with synchrotron radiation 
were performed parasitically on high energy physics colliding beam storage rings. 
Much larger photon fluxes could be obtained from such sources compared to any 
other source available. The community of synchrotron radiation users grew rapidly 
and so did the variety of applications and fields. By the time the usefulness of 
storage rings for high energy physics was exhausted some of these facilities were 
turned over to the synchrotron radiation community as fully dedicated radiation 
sources. Those are called first generation synchrotron radiation sources. They were 
not optimized for minimum beam emittance and maximum photon beam brightness. 
Actually, the optimization for high energy physics called for a maximum beam 
emittance to maximize collision rates for elementary particle events. The radiation 
sources were mostly bending magnets although the development and use of insertion 
devices started in these rings. Typically, the beam emittance is in the 100s of 
nanometer. 

As the synchrotron radiation community further grew, funds became available 
to construct dedicated radiation facilities. Generally, these rings were designed as 
bending magnet sources but with reduced beam emittance (< 100 nm) to increase 
photon brightness. The design emittances were much smaller than those in first 
generation rings but still large by present day standards. The use of insertion devices 
did not significantly affect the storage ring designs yet. These rings are called second 
generation rings. 

Third generation synchrotron radiation sources were and are being designed, 
constructed and operated now. These rings are specifically designed for insertion 
device radiation with minimum beam emittances below 20 nm down to 0.5 nm for 
maximum photon beam brightness. As such, they exhibit a large number of magnet- 
free insertion straight sections. 

Finally, fourth generation synchrotron radiation sources are the latest develop- 
ment for synchrotron radiation sources. Such sources are based on linear accelerator 
technology and the principle of single pass FELs where a high energy and high 
quality electron beam passing through a long undulator produces coherent undulator 
radiation in the X-ray regime. 

Whatever the applications, in most cases it is the beam emittance which will 
ultimately determine the usefulness of the storage ring design for a particular 
application. We will derive and discuss physics and scaling laws for the equilibrium 
beam emittance in storage rings while using basic phenomena and processes of 
accelerator physics as derived in previous sections. 
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The equilibrium beam emittance in electron storage rings is determined by the 
counteracting effects of quantum excitation and damping as has been discussed 
earlier. Significant synchrotron radiation occurs only in bending magnets and the 
radiation from each bending magnet contributes independently to both quantum 
excitation and damping. The contribution of each bending magnet to the equilibrium 
beam emittance can be determined by calculating the average values for ( |x? 7) and 


(x?) by 
Cc 
(| PH), = =/ |«? (z)| H(z) dz, (14.1) 
0 


where H is defined by (11.52) and C is the circumference of the storage ring. 
Obviously, this integral receives contributions only where there is a finite bending 
radius and therefore the total integral is just the sum of individual integrals over each 
bending magnet. 


14.1.1 FODO Lattice 


We consider here briefly the FODO lattice because of its simplicity and its ability to 
give us a quick feeling for the scaling of beam emittance with lattice parameters. The 
beam emittance can be manipulated at design time by adjusting (#/) to the desired 
value. To calculate the average value (H) in a FODO lattice is somewhat elaborate. 
Here, we are interested primarily in the scaling of the beam emittance with FODO 
lattice parameters. Recollecting the results for the symmetric solutions of the lattice 
functions in a FODO lattice (10.3), (10.5), (10.74) we notice the following scaling 
laws 


BoL, (14.2) 
B’ « L’, (14.3) 
nxL’/p, (14.4) 
n xL/p, (14.5) 


where L is the distance between the centers of adjacent quadrupoles. All three terms 
in the function H(z) = y(z) n? + 2a(z) nn’ + B(z) 7” scale in a similar fashion like 


Le ge i Se 
: [L—- e (14.6) 
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and the equilibrium emittance for a FODO lattice scales then like 


3 3 
U7 xP 07, (14.7) 


where © = ¢,/p is the deflection angle in each bending magnet. The proportionality 
factor depends on the beam focusing. A minimum can be reached for a focal length 
of |f| ~ 1.06Z in each half-quadrupole resulting in a minimum beam emittance 
achievable in a FODO lattice given in practical units by 


€, = Cyy 


e(radm) = 107!'E?(GeV) ©3 (deg?) , (14.8) 


where y = 277/Ny, Nm the number of bending magnets in the ring and Ny / 2 the 
total number of FODO cells in the ring. This result is significant because it exhibits 
a general scaling law of the beam emittance proportional to the square of the beam 
energy and the cube of the deflecting angle in each bending magnet, which is valid 
for all lattice types. The coefficients, though, vary for different lattices. While the 
beam energy is primarily driven by the desired photon spectrum, we find that high 
brightness photon beams from low emittance electron beams require a storage ring 
design composed of many lattice cells with a small deflection angle per magnet. Of 
course, there are some limits on how far one can go with this concept due to other 
limitations, not the least being size and cost of the ring which both grow with the 
number of lattice cells. 


14.1.2. Minimum Beam Emittance 


While the cubic dependence of the beam emittance on the bending angle is a 
significant design criterion, we discuss here a more detailed optimization strategy. 
The emittance is determined by the beam energy, the bending radius and the 
H-function. Generally, we have no choice on the beam energy which is mostly 
determined by the desired critical photon energy of bending magnet and insertion 
device radiation or cost. Similarly, the bending radius is defined by the ring 
geometry, desired spectrum etc. Interestingly, it is not the bending radius but rather 
the bending angle which influences the equilibrium beam emittance. The main 
process to minimize the beam emittance is to adjust the focusing such that the 
lattice functions in the bending magnets generate a minimum value for (Hy)-. The 
equilibrium beam emittance (13.18) depends only on the lattice function H(z) 
inside bending magnets. Independent of any lattice type, we may therefore consider 
this function only within bending magnets. For the purpose of this discussion we 
assume a regular periodic lattice, where all bending magnets are the same and 
all lattice functions within each bending magnet are the same. That allows us to 
concentrate our discussion just on one bending magnet. The contribution of any 
individual bending magnet to the beam emittance can be determined by calculation 
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of the average value for 


a 


1 
(Hp), = > Hy(z) dz, (14.9) 
Ly Jo 


where £, is the length of the bending magnet and the bending radius is assumed to 
be constant within a magnet. From here on, we ignore the index x since we assume 
a flat storage ring in the horizontal plane. All lattice functions are therefore to be 
taken in the horizontal plane. 

In evaluating the integral (14.1) we must include all contributions. The emission 
of photons depends only on the bending radius regardless of whether the bending 
occurs in the horizontal or vertical plane. Since for the calculation of equilibrium 
beam emittances only the energy loss due to the emission of photons is relevant 
it does not matter in which direction the beam is bent. The effect of the emission 
of a photon on the particle trajectory, however, is different for both planes because 
dispersion functions are different resulting in a different quantum excitation factor 
H. For a correct evaluation of the equilibrium beam emittances in the horizontal and 
vertical plane (14.1) should be evaluated for both planes by determining 1, and H, 
separately but including in both calculations all bending magnets in the storage ring. 

The integral in (14.1) can be evaluated for each magnet if the values of the 
lattice functions at the beginning of the bending magnet are known. With these 
initial values the lattice functions at any point within the bending magnet can be 
calculated assuming a pure dipole magnet. With the definitions of parameters from 
Fig. 14.1, we find the following expressions for the lattice functions in a bending 
magnet where z is the distance from the entrance of the magnet 


B(z) = Bo — 2a0z + yz’, 

a(z) = a — Voz, 

y(z) = Yo, (14.10) 
n(z) = No + Mz + p (1 — cos 8), 

n! (z) = ng + sin @. 
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Fig. 14.1 Lattice functions in a bending magnet 
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Here the deflection angle is 6 = z/p and Bo, a, Yo. No, No are the values of the 
lattice functions at the beginning of the magnet. Before we use these equations we 
assume lattices where no = nj = 0. The consequences of this assumption will be 
discussed later. Inserting (14.10) into (14.1) we get for small deflection angles after 
integration over one dipole magnet 


(Hy), = 407 Bo — a0 p70? + pp’ pO* +00), (14.11) 


where we have assumed the bending radius to be constant within the length , of 
the magnet. In a storage ring with dipole magnets of different strength, contributions 
from all magnets must be added to give the average quantum excitation term for the 
whole ring of length C 


(|x? | Ho), = Go DAI Po.) Svs (14.12) 


i 


where we sum over all magnets i with length ¢,;. In an isomagnetic ring the 
factor ( \«>| Hp/(k?)). becomes simply |k| (H,), and the equilibrium beam emit- 
tance is 


y? 


€iso = CF ~ lel (Hy): - (14.13) 


Inserting (14.11) into (14.13) we get for the beam emittance in the lowest order 
of approximation 


1 1 
€iso = Cay 263 E Bo — -ao + 5) ts +0(0%), (14.14) 
b 


where yo = y(Zo) 1s one of the lattice functions not to be confused with the particle 
energy y. 

Here we have assumed a separate function lattice where the damping partition 
number J, = 1. For strong bending magnets or sector magnets this assumption 
is not always justified due to focusing in the bending magnets and the damping 
partition number should be corrected accordingly. 

The result (14.14) shows clearly a cubic dependence of the beam emittance 
on the deflection angle © of the bending magnets which is a general lattice 
property since we have not yet made any assumption on the lattice type yet. 
Equation (14.14) exhibits minima with respect to both a@ and Bo. We solve the 
derivation d(H) /da = 0 for a and the derivative 0(H)/08o = 0 for Bo and get 
the optimum values for the Twiss functions at the entrance to the bending magnet 


[12 
Bovopt = rae (14.15a) 


oop: = V15 (14.15b) 
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and the minimum value for (H) is 


_ Sp 
4/15 


With this, the minimum obtainable beam emittance in any lattice is from (13.18) 


(HL) main (14.16) 


> (Hp(z)/p?) ei (eae 8} 
(7p), avis 


The results are very simple for small deflection angles but for angles larger than 
about 30° per bending magnet the error for (H),,., exceeds 10 % and higher order 
terms must be included. 

For simplicity, we assumed that the dispersion functions no = O and no = 0. 
This a desirable feature, because it means that the dispersion function is also zero 
in the insertion devices (ID) of a synchrotron radiation source. A finite dispersion 
function in IDs can lead to an undesirable increase of the beam emittance. 

In summary it has been demonstrated that for certain optimum lattice functions 
in the bending magnets the equilibrium beam emittance becomes a minimum. No 
assumption about a particular lattice has been made. Another observation is that the 
beam emittance is proportional to the third power of the magnet deflection angle 
and proportional to the square of the beam energy. Therefore many small deflection 
magnets interspersed within quadrupoles should be used to achieve a small beam 
emittance. Low emittance storage rings, therefore, are characterized by many short 
magnet lattice cells. 

This approach has been used for a number of third generation synchrotron light 
sources. However, soon it was apparent that modification of the dispersion function 
could produce even smaller beam emittance in spite of the effect of IDs. Only, as 
it became possible in recent years to reach sub-nm beam emittances with sufficient 
dynamic aperture did the choice of finite dispersions in the IDs become undesirable 
again. 


Edba,min © Cay (14.17) 


min 


14.2 Absolute Minimum Emittance 


In the previous section we found conditions which lead to a minimum beam 
emittance in an isomagnetic ring 


Nie 


L 
ee = =Cyy' H(z) dz (14.18) 
Pe Je 


1 
2 


The H-function in (14.1) is a nonlinear function of z and therefore any asym- 
metry of the Twiss functions lead to larger values of the H-integral. We may 
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therefore assume that a symmetric H-function may actually give the smallest value 
for the integral and emittance [3]. Following Teng we first discuss the case when 
No = No = 0, and ao = 0 at the center of the bending magnet. Note that this 
condition is different from the previous assumption for the Twiss functions. At a 


distance z from the magnet center the Twiss functions are for small deflection angles 
(© < 30°) 


2 


B (z) = Bo + a (z) = y (2) = Yo (14.19a) 
Bo ae 
n(z) © No + a 7 (z) = = (14.19b) 
p p 


The H-function (11.52) becomes then with @ the length of the bending magnet 
(H), = ii, (Bn? + 2ann! + yn?) dz, where the three integrals are 


ene 
tJIo 3p2 4 | 5 Bop? 16’ 
4 no e 8 e 
— | (2ann’) dz = — - 
A, Pann) de = 35s ~ Bop? 32 


1h? 9% mn, 2 & 2 £ 
_ dz= 2 
al (ym) de = e+ sap 8 t 2OBop? 32 


to result finally in 


¥ f 
No Bo No 3 1 5 
dz= L - L —————— 14.20 
| ees +(e aia) + 3206p? a 


First, we consider the case in which we set 79 = O and 


Bo_, 1s & ( bo c 
dz = = 
/ Hi@hde= api + aa0paat = pe Line + F008, 


This integral has a minimum versus fo and from a J Hi(z) dz = 0 the optimum 
value for Bo is 
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With this optimum f 71; (z) dz = was and the minimum beam emittance is 
Cy oe 1 
aan Nica 


This result has been derived by L. Teng in [3] and immediately rejected as 
“absolute minimum but useless”. This judgement was based on the realization that 
the dispersion function at either end of the bending magnet is not zero and must 
therefore be of finite value at the insertion straight section too. This is not good as 
discussed above because insertion devices will enhance the emittance where n 4 0 
and will also lead to an increased effective emittance for the synchrotron radiation 
users. This becomes a serious problem for very small beam emittances as can be 
obtained now about 30 years after his note. However there is a way out. 

If we cut one bending magnet in a cell into two pieces and install them as the first 
and last bending magnet we get a zero dispersion function for all straight sections 
without change of the beam emittance. There may be an arbitrary number of such 
bending magnets between those half-magnets and there are enough quadrupoles 
between the last bending magnet and the center of the straight section to match 
the horizontal betatron function to any desired value while the dispersion function 
is now zero in the IDs. The vertical betatron function does not contribute to the 
emittance and may be matched any way possible within reason. Within the unit 
cell we expect a periodic matching section between magnets. Incidentally, the same 
result can be obtained if we set @ = 0 in (14.11) and look again for the optimum 
fo. However, we must replace the total deflection angle by its half. 

Just to be complete in this discussion we assume for a moment that no 4 0 


ys 
No Bo 43 1 5 10 73 
dz = L e — e. 
| Me@Qa= Bet wo2° + s70Boe — 12Bop 


= (14.21) 


From ii J Ho(z) dz = 0 the optimum betatron function is 


Bo _ 3, 12np* _ nop 
2° 30° «4 2° 


Furthermore there is also an optimum dispersion function and evaluating 


a [ H(z) dz = 0 we get an optimum dispersion function in the middle of the 
bending magnet of 


e £ 
No — ea —0O 
24p 24 
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for which f H2(z) dz = aia and the minimum beam emittance 
el 
& = e (14.22) 


C, 2 
a Ee. 15/15 


which is even smaller. 

The reduction in emittance by a factor 3/2 looks desirable but now we have 
again a finite although small dispersion function in the long straight section. The 
dispersion function scales like the square root of the betatron functions and for a 
betatron phase of 90°, for example, the dispersion function 7* in the middle of the 


straight section is y* = ~ No = ve 4s . For the users the effective emittance 


#252 


: : 93 
€etf = €0,/1 + ORF? where the relative energy spread is & = Cy’? = € ave, 


Finally, the effective beam emittance is 


Sige 

€0 12 03 
To keep the effective beam emittance close to the natural emittance the deflection 
angle in the bending magnets must be large. In other words, the effective beam 
emittance for finite values of the dispersion function in insertion devices is much 
larger for modern low emittance storage rings with small deflection angles per 


bending magnet. for an emittance increase of a factor /2 the deflection angle per 
bending magnet must be © > 0.75 or 42.8°. 


14.3 Beam Emittance in Periodic Lattices 


To achieve a small particle beam emittance a number of different basic magnet 
storage ring lattice units are available and in principle most any periodic lattice 
unit can be used to achieve as small a beam emittance as desired. More practical 
considerations, however, will limit the minimum beam emittance achievable in a 
lattice. While all lattice types to be discussed have been used in existing storage 
rings and work well at medium to large beam emittances, differences in the 
characteristics of particular lattice types become more apparent as the desired 
equilibrium beam emittance is pushed to very small values. 

Of the large variety of magnet lattices that have been used in existing storage 
rings the most commonly used ones are based on the double bend achromat (DBA) 
and derivatives thereof. In the DBA lattice the straight sections are separated by 
two bending magnets forming an achromat. In more recent years this approach has 
been modified into a multi-bend achromat where several bending magnets form an 
achromat between the straight sections. This trend was stimulated by the desire to 
minimize the beam emittance ever more while utilizing the ©? scaling. However, 
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at the same time the ring circumference would grow equally because of the higher 
number of long insertion straight sections unless there are several bending magnets 
between straight sections, thus limiting circumference and costs. 


14.3.1 The Double Bend Achromat Lattice (DBA) 


The double bend achromat or DBA lattice is designed to make full use of the 
minimization of beam emittance by the proper choice of lattice functions as 
discussed earlier. In Fig. 14.2 the basic layout of this lattice is shown. 

A set of two or three quadrupoles provides the matching of the lattice functions 
into the bending magnet to achieve the minimum beam emittance. The central part of 
the lattice between the bending magnets may consist of one or more quadrupoles and 
its only function is to focus the dispersion function such that it is matched again to 
zero at the end of the next bending magnet resulting necessarily in a phase advance 
from bending magnet to bending magnet of close to 180°. This lattice type has been 
proposed first by Panofsky [4] and later by Chasman and Green [5] as an optimized 
lattice for a synchrotron radiation source. In Fig. 14.3 an example of a synchrotron 
light source based on this type of lattice is shown representing the solution of the 
design study for the European Synchrotron Radiation Facility ESRF [6]. 


V LAN 
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Fig. 14.3, European synchrotron radiation facility, ESRF [6] (one half of 16 superperiods). The 
lattice is asymmetric to provide a mostly parallel beam in one insertion and a small beam cross 
section in the other 
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The ideal minimum beam emittance (14.17) in this lattice type for small bending 
angles and an isomagnetic ring with J, = | is 


Cy ag3 
€é.. = oO (14.23) 
DBA 4 Ji 
or in more practical units 
Eppa (rad m) = 5.036 x 10-3 E?(GeV”) ©3 (deg?) . (14.24) 


This lattice type can be very useful for synchrotron light sources where many 
component and dispersion free straight sections are required for the installation of 
insertion devices. For damping rings this lattice type is not quite optimum since it 
is a rather “open” lattice with a low bending magnet fill factor and consequently a 
long damping time. Other more compact lattice types must be pursued to achieve in 
addition to a small beam emittance also a short damping time. 


14.3.2. The FODO Lattice 


The FODO lattice, shown schematically in Fig. 14.4 is the most commonly used and 
best understood lattice in storage rings optimized for high-energy physics colliding 
beam facilities where large beam emittances are desired. This choice is obvious 
considering that the highest beam energies can be achieved while maximizing the 
fill factor of the ring with bending magnets. 

This lattice provides the most space for bending magnets compared to other 
lattices. The usefulness of the FODO lattice, however, is not only limited to high- 
energy large emittance storage rings. By using very short cells very low beam 
emittances can be achieved as has been demonstrated in the first low emittance 
storage ring designed [7] and constructed [8] as a damping ring for the linear collider 
SLC to reach an emittance of 11 x 10~° mat 1 GeV. 

The lattice functions in a FODO structure have been derived and discussed in 
detail and are generally determined by the focusing parameters of the quadrupoles. 


Fig. 14.4 FODO lattice —____— cell length: 2L 
(schematic) 


I 
11/2QF B QD B 1/2QF 
I 


14.3 Beam Emittance in Periodic Lattices 471 


Fig. 14.5 Electron beam 7] 
emittance of a FODO lattice <H>/ (p cD) 
as a function of the betatron 100 + — 

phase advance per half cell in 
the deflecting plane 
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Since FODO cells are not achromatic the dispersion function is in general not zero 
at either end of the bending magnets. 

The beam emittance can be derived analytically in thin lens approximation by 
integrating the quantum excitation factor along the bending magnets. The result is 
shown in Fig. 14.5 where the function [(H) /(o@°)] is plotted as a function of the 
betatron phase advance per FODO half cell which is determined by the focal length 
of the quadrupoles. 

The beam emittance for an isomagnetic FODO lattice is given by [9] 


ty (H) 
=cC io _— -—., 
vi fy. pO? 


(14.25) 


€ropo 


where f)9 is the actual effective length of one bending magnet and 2¢, the length 
of a FODO cell. From Fig. 14.5 it becomes apparent that the minimum beam 
emittance is reached for a betatron phase of about 136.8° per FODO cell. In this 
case (H)/(p @?) = 1.25 and the minimum beam emittance in such a FODO lattice 
in practical units is 


l 
Eropo (tad m) = 97.53 x 10° > B°(Gev") @3 (deg?) . (14.26) 
b,0 


Comparing the minimum beam emittance achievable in various lattice types 
the FODO lattice seems to be the least appropriate lattice to achieve small beam 
emittances. This, however, is only an analytical distinction. FODO cells can be made 
much shorter than the lattice units of other structures and for a given circumference 
many more FODO cells can be incorporated than for any other lattice. As a 
consequence, the deflection angles per FODO cell can be much smaller. For very 


472 14 Beam Emittance and Lattice Design 


low emittance storage ring, therefore, it is not a priori obvious that one lattice 
is better than another. However, additional requirements like number of desired 
insertion straight sections for a particular application must be included in the 
decision for the optimum storage ring lattice. 


14.3.3, Optimum Emittance for Colliding Beam Storage Rings 


The single most important parameter of colliding beam storage rings is the 
luminosity and most of the design effort for such facilities is aimed at maximizing 
the collision rate. As a consequence of the beam-beam effect, the beam emittance 
must be chosen to be as large as possible for maximum luminosity as will be 
discussed in Sect. 21.2.2. Since for most high energy storage rings a FODO lattice is 
employed it is clear that for maximum emittance the phase advance per cell should 
be kept low as indicated in Fig. 14.5. Of course, there is a practical limit given by 
increasing magnet apertures and associated costs. 

In linear colliders there is no beam stability concern due to the beam-beam effect 
like in a storage ring and a much smaller beam cross section can be chosen. The 
limit here is the total beam-beam disruption due to the large electromagnetic fields 
at the surface of the colliding beams. Strong synchrotron radiation introduce, for 
example, significant energy losses which jeopardize the analysis of high energy 
physics events. 


Problems 


14.1 (S). Derive an approximate expression of the beam emittance in an isomag- 
netic FODO lattice as a function of phase per cell and determine the minimum value 
of the emittance. Use a lattice which is symmetric in both planes and assume that 
the bending magnets are as long as the half cells (€) = L). 


14.2 (S). Consider a storage ring made of FODO cells at an energy of your choice. 
How many bending magnets or half cells do you need to reach a beam emittance of 
no more than €, = 5-107? m? 
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Part V 
Perturbations 


Chapter 15 
Perturbations in Beam Dynamics 


The study of beam dynamics under ideal conditions is the first basic step toward 
the design of a beam transport system. In the previous sections we have followed 
this path and have allowed only the particle energy to deviate from its ideal 
value. In a real particle beam line or accelerator we may, however, not assume 
ideal and linear conditions. More sophisticated beam transport systems require the 
incorporation of nonlinear sextupole fields to correct for chromatic aberrations. 
Deviations from the desired field configurations can be caused by transverse or 
longitudinal misplacements of magnets with respect to the ideal beam path. Of 
similar concern are errors in the magnetic field strength, undesirable field effects 
caused in the field configurations at magnet ends, or higher order multipole fields 
resulting from design, construction, and assembly tolerances. Still other sources of 
errors may be beam-beam perturbations, magnetic detectors for high energy physics 
experiments, insertion devices in beam transport systems or accelerating sections, 
which are not part of the magnetic lattice configurations. The impact of such errors 
is magnified in strong focusing beam transport systems as has been recognized soon 
after the invention of the strong focusing principle. Early overviews and references 
can be found for example in [1, 2]. 

A horizontal bending magnet has been characterized as a magnet with only a 
vertical field component. This is true as long as this magnet is perfectly aligned, 
in most cases perfectly level. Small rotations about the magnet axis result in the 
appearance of horizontal field components which must be taken into account for 
beam stability calculations. 

We also assumed that the magnetic field in a quadrupole or higher multipole 
vanishes at the center of magnet axis. Misalignments of any multipole generates all 
lower order perturbations which is known as “spill-down”’. 

In addition, any multipole can be rotated by a small angle with respect to the 
reference coordinate system. As a result we observe the appearance of a small 
component of a rotated or skew quadrupole causing coupling of horizontal and 
vertical betatron oscillations. 
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Although such misalignments and field errors are unintentional and undesired, 
we have to deal with their existence since there is no way to avoid such errors 
in a real environment. The particular effects of different types of errors on beam 
stability will be discussed. Tolerance limits on these errors as well as corrective 
measures must be established to avoid destruction of the particle beam. Common to 
all these perturbations from ideal conditions is that they can be considered small 
compared to forces of linear elements. We will therefore discuss mathematical 
perturbation methods that allow us to determine the effects of perturbations and 
to apply corrective measures for beam stability. The equations of motion with 
perturbations are 


ul + (k+ kz) U = isn (Z) XY" where u = x or y (15.1) 


andr,s = 0,1,2... withr+s5-+ 1 =n andz the order of perturbation. In the 
remainder of this text whenever r = 0 or s = 0 we use p, instead of Paar 


15.1 Magnet Field and Alignment Errors 


First we consider field errors created by magnet misalignments like displacements 
or rotations from the ideal positions. Such magnet alignment errors, however, are 
not the only cause for field errors. External sources like the earth magnetic field, 
the fields of nearby electrical current carrying conductors, magnets connected to 
vacuum pumps or ferromagnetic material in the vicinity of beam transport magnets 
can cause similar field errors. For example electrical power cables connected to 
other magnets along the beam transport line must be connected such that the 
currents in all cables are compensated. This occurs automatically for cases, where 
the power cables to and from a magnet run close together. In circular accelerators 
one might, however, be tempted to run the cables around the ring only once to save 
the high material and installation costs. This, however, causes an uncompensated 
magnetic field in the vicinity of cables which may reach as far as the particle beam 
pipe. The economic solution is to seek electrical current compensation among all 
magnet currents by running electrical currents in different directions around the 
ring. Careful design of the beam transport system can in most cases minimize the 
impact of such field perturbations while at the same time meeting economic goals. 

Multipole errors in magnets are not the only cause for perturbations. For beams 
with large divergence or a large cross section, kinematic perturbation terms may 
have to be included. Such terms are neglected in paraxial beam optics discussed 
here, but will be derived in detail later. 


'Note that for this definition py) = 0 and therefore there is no conflict with the momentum po. 
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15.1.1 Self Compensation of Perturbations 


The linear superposition of individual dipole contributions to the dispersion function 
can be used in a constructive way. Any contribution to the dispersion function by a 
short magnet can be eliminated again by a similar magnet located 180° in betatron 
phase downstream from the first magnet. If the betatron function at the location of 
both magnets is the same, the magnet strengths are the same too. For quantitative 
evaluation we assume two dipole errors introducing a beam deflection by the angles 
6; and 6, at locations with betatron functions of 6; and B2 and betatron phases Wy 
and 2, respectively. Since the dispersion function or fractions thereof evolve like 
a sine like function, we find for the variation of the dispersion function at a phase 


Wz) = Wo 
AD(2) = 6: VBA: sin [W (2) — Wi] + 62-V/Bpo sin [W(2) — Wo]. (15.2) 


For the particular case where 0; = 02 and 6; = 2 we find 


AD(z)=0 for w-W = Qn I)z. (15.3) 


AD(z)=0 for W—wW, = 2nz, (15.4) 


where 7 is an integer. This property of the dispersion function can be used in periodic 
lattices if, for example, a vertical displacement of the beam line is desired. In this 
case we would like to deflect the beam vertically and as soon as the beam reaches 
the desired elevation a second dipole magnet deflects the beam by the same angle 
but opposite sign to level the beam line parallel to the horizontal plane again. In an 
arbitrary lattice such a beam displacement can be accomplished without creating a 
residual dispersion outside the beam deflecting section if we place two vertical or 
rotated bending magnets of opposite sign at locations separated by a betatron phase 
of 27. 

Similarly, a deflection in the same direction by two dipole magnets does not 
create a finite dispersion outside the deflecting section if both dipoles are separated 
by a betatron phase of (2n + 1)z. This feature is important to simplify beam- 
transport lattices since no additional quadrupoles are needed to match the dispersion 
function. 

Sometimes it is necessary to deflect the beam in both the horizontal and vertical 
direction. This can be done in a straightforward way by a sequence of horizontal and 
vertical bending sections leading, however, to long beam lines. In a more compact 
version, we combine the beam deflection in both planes within one or a set of 
magnets. To obtain some vertical deflection in an otherwise horizontally deflecting 
beam line, we may rotate a whole arc section about the beam axis at the start of 
this section to get the desired vertical deflection. Some of the horizontal deflection 
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is thereby transformed into the vertical plane. At the start of such a section we 
introduce by the rotation of the coordinate system a sudden change in all lattice 
functions. Specifically, a purely horizontal dispersion function is coupled partly 
into a vertical dispersion. If we rotate the beam line and coordinate system back 
at a betatron phase of 2nz downstream from the start of rotation, the coupling of the 
dispersion function as well as that of other lattice functions is completely restored. 
For that to work without further matching, however, we require that the rotated 
part of the beam line has a phase advance of 2nz in both planes as, for example, 
a symmetric FODO lattice would have. This principle has been used extensively 
for the terrain following beam transport lines of the SLAC Linear Collider to the 
collision point. 


15.2 Dipole Field Perturbations 


Dipole fields are the lowest order magnetic fields and therefore also the lowest order 
field perturbations. The equation of motion (15.1) is in this case 


u" + (k+xK,)u=pi(z), (15.5) 


where pj, (z) represents any dipole field error, whether it be chromatic or not. In 
trying to establish expressions for dipole errors due to field or alignment errors, 
we note that the bending fields do not appear explicitly anymore in the equations 
of motions because of the specific selection of the curvilinear coordinate system 
and it is therefore not obvious in which form dipole field errors would appear in 
the equation of motion (15.5). In (6.95) or (6.96) we note, however, a dipole field 
perturbation due to a particle with a momentum error 6. This chromatic term kK, 6 is 
similar to a dipole field error as seen by a particle with the momentum BEo(1 + 4). 
For particles with the ideal energy we may therefore replace the chromatic term 
KO by a field error —Ax. Perturbations from other sources may be obtained by 
variations of magnet positions (Ax, Ay) or magnet strengths. Up to second order, 
the horizontal dipole perturbation terms due to magnet field (Ax) and alignment 
errors (Ax, Ay) are from (6.95) 


Pi,x(Z) = —Akyo + (k2y + ko) Ax + (2k Ako + Ak) Ax (15.6) 
1 2 2, 
— zm (Ax — 2x,Ax — Ay” + 2y,Ay) + O(3), 


where we used x = xg + x, — Ax and y = yg + y, — Ay with (xg, yg) the betatron 
oscillations and (x, y.) the closed orbit deviation in the magnet. In the presence of 
multipole magnets the perturbation depends on the displacement of the beam with 
respect to the center of multipole magnets. 
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There is a similar expression for vertical dipole perturbation terms and we get 
from (6.96) ignoring vertical bending magnets (Kyo = 0) but not vertical dipole 
errors, Akyo # 0, 


Piy(Z) = —Akyo — ko Ay — mx Ay + yeAx) + O(3). (15.7) 


Such dipole field errors deflect the beam from its ideal path and we are interested 
to quantify this perturbation and to develop compensating methods to minimize the 
distortions of the beam path. In an open beam transport line the effect of dipole field 
errors on the beam path can be calculated within the matrix formalism. 

A dipole field error at point z, deflects the beam by an angle 6. If M(zm|zx) is the 
transformation matrix of the beam line between the point z,, where the kick occurs, 
and the point zm, where we observe the beam position, we find a displacement of 
the beam center line, for example, in the x-plane by 


Ax=M0, (15.8) 


where Mj» is the element of the transformation matrix in the first row and the 
second column. Due to the linearity of the equation of motion, effects of many 
kicks caused by dipole errors can be calculated by summation of individual beam 
center displacements at the observation point z,, for each kick. The displacement of 
a beam at the location z,, due to many dipole field errors is then given by 


Ax(Zm) = Y> Mi2(emlzk) Ox. (15.9) 
k 


where 6 are kicks due to dipole errors at locations z% < Zp and Mj2(Zm|zx) the 
M)2-matrix element of the transformation matrix from the perturbation at z, to the 
monitor at Zm. 

Generally, we do not know the strength and location of errors. Statistical methods 
are applied therefore to estimate the expectation value for beam perturbation and 
displacement. With Mj2(zm|zk) = WV BmPx sin(Wm— Wx) we calculate the root-mean- 
square of (15.9) noting that the phases 7% are random and cross terms involving 
different phases cancel. With (07) = 07 and (Au’) = o; we get finally from (15.9) 
the expectation value of the path distortion o,, at zm due to statistical errors with a 
standard value og 


Ou = VB (Bx) V Nooo . (15.10) 


where (fx) is the average betatron function at the location of errors and Ng the 
number of dipole field errors. Random angles are not obvious, but if we identify 
the potential sources, we may be in a better position to estimate og. For example, 
alignment errors 04, of quadrupoles are related to og by 09 = klgoa,, where i. 


f 
k£, are the inverse focal lengths of the quadrupoles. 
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15.2.1 Dipole Field Errors and Dispersion Function 


The dispersion function of a beam line is determined by the strength and placement 
of dipole magnets. As a consequence, dipole field errors also contribute to the 
dispersion function and we determine such contributions to the dispersion function 
due to dipole field errors. First, we note from the general expression for the linear 
dispersion function that the effect of dipole errors adds linearly to the dispersion 
function by virtue of the linearity of the equation of motion. We may therefore 
calculate separately the effect of dipole errors and add the results to the ideal 
solution for the dispersion function. 


15.2.2 Perturbations in Open Transport Lines 


While these properties are useful for specific applications, general beam dynamics 
requires that we discuss the effects of errors on the dispersion function in a more 
general way. To this purpose we use the general equation of motion up to linear 
terms in 6 and add constant perturbation terms. In the following discussion we use 
only the horizontal equation of motion, but the results can be immediately applied to 
the vertical plane as well. The equation of motion with only linear chromatic terms 
and a quadratic sextupole term is then 


x" + (k+«,)x = kx — bmx’ (1 — 8) — Ak,(1 — 8) + O2). (15-11) 


We observe two classes of perturbation terms, the ordinary chromatic terms 
and those due to field errors. Taking advantage of the linearity of the solution we 
decompose the particle position into four components 


x= xptxetm5 + 0,8, (15.12) 


where xg is the betatron motion, x, the distorted beam path or orbit, 7, the ideal 
dispersion function and v, the perturbation of the dispersion that derives from field 
errors. The individual parts of the solution then are determined by the following set 
of differential equations: 


xpt (k+ K2)xp = — pay + mxpxe. ad 
x! pie K2) Xe = —Ak, — 5mxe, (15.13b) 
ni dee K2) Ne = Ky, (15.13c) 


UY + (KA KR) vy = FAK, + dmxz + ke — mxeNy. (15.13d) 
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In the ansatz (15.12) we have ignored the energy dependence of the betatron 
function since it will be treated separately as an aberration and has no impact on the 
dispersion. We have solved (15.13a)—(15.13c) before and concentrate therefore on 
the solution of (15.13d). Obviously, the field errors cause distortions of the beam 
path x, which in turn cause additional variations of the dispersion function. The 
principal solutions are 


C(z) = VB(2)/Bocos[W(z) — Wol . (15.14) 
S(2) = VB@Bosin [W(2) — vol. (15.15) 


and the Greens function becomes 
G(z,0) = S(2)C() — S(o)C() = VB@BO)sinY@-Wo)]. (15.16) 
With this the solution of (15.13d) is 
Ux(Z) = —Xe(Z) (15.17) 


+ VB / (k— mn) VB@xe(0) sinlys(2) — valO)Nde. 


Here, we have split off the solution for the two last perturbation terms in (15.13d) 
which, apart from the sign, is exactly the orbit distortion (15.13b). In a closed lattice 
we look for a periodic solution of (15.17), which can be written in the form 


Bx (z) 


2 sina Vv, 


Vy (Z) = —Xe(Z) + (15.18) 


z+Lp 
x i (k — mng) VBa©)xe() c08 {vs [pe(2) — gel) + m]} dE, 


where x,(z) is the periodic solution for the distorted orbit and L, the length of the 
orbit. In the vertical plane we have exactly the same solution except for a change in 
sign for some terms 


Uy(Z) = —ye(z) — eas (15.19) 


Z+Lp 
x / (k — mnx) By(S)ye(S) COs [vy ((z) = py(S) + m)| dé. 


For reasons of generality we have included here sextupoles to permit chromatic 
corrections in long curved beam lines with bending magnets. The slight asymmetry 
due to the term mn, in the vertical plane derives from the fact that in real 


484 15 Perturbations in Beam Dynamics 


accelerators only one orientation of the sextupoles is used. Due to this orientation 
the perturbation in the horizontal plane is —5mx? ( 1 — 5) and in the vertical plane 
mxy(1 — 6). In both cases we get the term m7, in the solution integrals. 

Again we may ask how this result varies as we add acceleration to such a 
transport line. Earlier in this section we found that the path distortion is independent 
of acceleration under certain periodic conditions. By the same arguments we can 
show that the distortion of the dispersions (15.18) and (15.19) are also independent 
of acceleration and the result of this discussion can therefore be applied to any 
periodic focusing channel. 


15.2.3 Existence of Equilibrium Orbits 


Particles orbiting around a circular accelerator perform in general betatron oscilla- 
tions about the equilibrium orbit and we will discuss properties of this equilibrium 
orbit. Of fundamental interest is of course that such equilibrium orbits exist at all. 
We will not try to find conditions for the existence of equilibrium orbits in arbitrary 
electric and magnetic fields but restrict this discussion to fields with midplane 
symmetry as they are used in particle beam systems. The existence of equilibrium 
orbits can easily be verified for particles like electrons and positrons because these 
particles radiate energy in form of synchrotron radiation as they orbit around the 
ring. 

We use the damping process to find the eventual equilibrium orbit in the presence 
of arbitrary dipole perturbations. To do this, we follow an orbiting particle starting 
with the parameters x = 0 and x’ = 0. This choice of initial parameters will not 
affect the generality of the argument since any other value of initial parameters is 
damped independently because of the linear superposition of betatron oscillations. 

As an electron orbits in a circular accelerator it will encounter a number of kicks 
from dipole field errors or field errors due to a deviation of the particle energy from 
the ideal energy. After one turn the particle position is the result of the superposition 
of all kicks the particle has encountered in that turn. Since each kick leads to a 
particle oscillation given by 


x(z) = V B(z)Bo9 sin[ve(z) — vg] 


we find for the superposition of all kicks in one turn 


x(z) = VB >~ VBi6;sin[ve(2) — veil. (15.20) 


where the index i indicates the location of the kicks. We ask ourselves now what 
is the oscillation amplitude after many turns. For that we add up the kicks from all 
past turns and include damping effects expressed by the factor e~*”°/* on the particle 
oscillation amplitude, where Tp is the revolution time, kT is the time passed since 
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the kick occurred k turns ago, and t the damping time. The contribution to the 


betatron oscillation due to kicks k turns ago, is then given by 


J B(z)e eo!" > V Bi9; sin. vk + vo(z) — vgi). (15.21) 


Ax, (2) = 


Adding the contributions from all past turns results in the position x(z) of the particle 
(15.22) 


x / B@e "lt 2 V B:6; sin[2avk + vo(z) — vei]. 


k=0 


X(z) = 


After some rearranging (15.22) becomes 


CO CO 
x(z) = Co > e *T0/T sin(2avk) + So e 70/7 cos(2avk) (15.23) 
k=0 k=0 
where 
Co = Di VB@)Bi6; cos[y(z) — gi), 
. (15.24) 
So = Vi; VB(@)B:6; sin|p(z) — gil. 
With the definition g = e~7°/* we use the mathematical identities 
= sin 27 v 
Ye!" sin(Qavk) = zs (15.25) 
an! 1 —2qcos2rv + q? 
and 
= 1—qcos2zv 
Y_ #0!" cos(2arvk) = 2 ; (15.26) 
p= 1 —2qcos2rv + q 
and get finally instead of (15.23) 
Coq sin22v + Se(1 — qceos2zrv) (15.27) 


ce 1 — 2qcos2rv + q? 


The revolution time is generally much shorter than the damping time Ty < T 
1. In this approximation we get after some manipulation and 


and therefore gq ~ 
using (15.24) 
(15.28) 


—— B@) dX V Bi; cos[vy(z) — vg; + vz]. 


@= 2sin wv 
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Equation (15.28) describes the particle orbit reached by particles after some 
damping times. The solution does not include anymore any reference to earlier turns 
and kicks except those in one turn and the solution therefore is a steady state solution 
defined as the equilibrium orbit . 

The cause and nature of the kicks 6; is undefined and can be any perturbation, 
random or systematic. A particular set of such errors are systematic errors in the 
deflection angle for particles with a momentum error 6 for which 6; = x;£j6 is the 
deflection angle of the bending magnet i. These errors are equivalent to those that 
led to the dispersion or n-function. Indeed, setting n(z) = x(z)/é in (15.28) we get 
the solution (10.91) for the 7 function. The trajectories x(z) = n(z)é therefore are 
the equilibrium orbits for particles with a relative momentum deviation 6 = Ap/po 
from the ideal momentum po. 

In the next subsection we will discuss the effect of random dipole field errors 6; 
on the beam orbit. These kicks, since constant in time, are still periodic with the 
periodicity of the circumference and lead to a distorted orbit which turns out to be 
again equal to the equilibrium orbit found here. 

To derive the existence of equilibrium orbits we have made use of the damping 
of particle oscillations. Since this damping derives from the energy loss of particles 
due to synchrotron radiation we have proof only for equilibrium orbits for radiating 
particles like electrons and positrons. The result obtained applies also to any other 
charged particle. The damping time may be extremely long, but is not infinite and 
a particle will eventually reach the equilibrium orbit. The concept of equilibrium 
orbits is therefore valid even though a proton or ion will never reach that orbit in a 
finite time but will oscillate about it. 


15.2.4 Closed Orbit Distortion 


The solution (15.28) for the equilibrium orbit can be derived also directly by solving 
the equation of motion. Under the influence of dipole errors the equation of motion 
is 


u’ + K(z)u = pi(z). (15.29) 


where the dipole perturbation po(z) is independent of coordinates (x, y) and energy 
error 6. This differential equation has been solved earlier in Sect. 5.5.4, where a 
dipole field perturbation was introduced as an energy error of the particle. Therefore, 
we can immediately write down the solution for an arbitrary beam line for which 
the principal solutions C(z) and S(z) are known 


u(z) = C(z) uo + S(z) up + P(z) 6 (15.30) 


15.2 Dipole Field Perturbations 487 
with 
PQ) = [rw [S()C(E) — S(Z)C()] de. (15.31) 


The result (15.30) can be interpreted as a composition of betatron oscillations 
with initial values (uo, uj) and a superimposed perturbation P(z) which defines the 
equilibrium trajectory for the betatron oscillations. In (15.31) we have assumed 
that there is no distortion at the beginning of the beam line, P(O) = O. If there 
were already a perturbation of the reference trajectory from a previous beam line, 
we define a new reference path by linear superposition of new perturbations to 
the continuation of the perturbed path from the previous beam line section. The 
particle position (uo, u,) is composed of the betatron oscillation (wog, uh p) and the 
perturbation of the reference path (uc, Up.) With up = ung + Uoe and uy = up pt Ude 
we get 


u(z) = [Hop + uhpS(2)] + [wocC(z) + up,S(z)] + P(z). (15.32) 


In a circular accelerator we look for a self-consistent periodic solution. Because 
the differential equation (15.29) is identical to that for the dispersion function, the 
solution must be similar to (10.91) and is called the closed orbit, reference orbit or 
equilibrium orbit given by 


v B(2) 


2sin wv 


Z+C _ 
f pilt)V/ BE) cos [ve(z) — ve(g) + va] dé, (15.33) 


Uc(Z) = 


where C is the circumference of the accelerator. We cannot anymore rely on a super- 
periodicity of length L, since the perturbations p;s,(¢) due to misalignment or field 
errors are statistically distributed over the whole ring. Again the integer resonance 
character discussed earlier for the dispersion function is obvious, indicating there 
is no stable orbit if the tune of the circular accelerator is an integer. The influence 
of the integer resonance is noticeable even when the tune is not quite an integer. 
From (15.33) we find a perturbation p;(z) to have an increasing effect the closer 
the tune is to an integer value. The similarity of the closed orbit and the dispersion 
function in a circular accelerator is deeper than merely mathematical. The dispersion 
function defines closed orbits for energy deviating particles approaching the real 
orbit (15.33) as 6 > 0. 

Up to second order the horizontal and vertical dipole perturbation terms due to 
magnet field and alignment errors are given by (15.6) and (15.7). In the presence of 
multipole magnets the perturbation depends on the displacement of the beam with 
respect to the center of multipole magnets. 

A vertical closed orbit distortion is shown in Fig. 15.1 for the PEP storage ring. 
Here, a Gaussian distribution of horizontal and vertical alignment errors with an rms 
error of 0.05 mm in all quadrupoles has been simulated. In spite of the statistical 
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Fig. 15.1 Simulation of the closed orbit distortion in the sixfold symmetric PEP lattice due to 
statistical misalignments of quadrupoles by an amount (Ax);ms = (Ay):ms = 0.05mm 
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Fig. 15.2 Closed orbit distortion of Fig. 15.2 in normalized coordinates as a function of the 
betatron phase g 


distribution of errors a strong oscillatory character of the orbit is apparent and 
counting oscillations we find 18 oscillations being equal to the vertical tune of PEP 
as we would expect from the denominator of (15.33). 

We also note large values of the orbit distortion adjacent to the interaction points 
(dashed lines), where the betatron function becomes large, again in agreement 
with expectations from (15.33) since ue « af B. A more regular representation 
of the same orbit distortion can be obtained if we plot the normalized closed 
orbit u.(z)/+/ B(z) as a function of the betatron phase w(z) shown in Fig. 15.2. In 
this representation the strong harmonic close to the tune becomes evident while 
the statistical distribution of perturbations appears mostly in the amplitude of the 
normalized orbit distortion. 

For the sake of simplicity terms of third or higher order as well as terms asso- 
ciated with nonlinear magnets have been neglected in both Eqs. (15.6) and (15.7). 
All terms in (15.6) and (15.7) are considered small perturbations and can therefore 
be treated individually and independent of other perturbations terms. Sextupole and 
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higher multipole perturbations depend on the orbit itself and to get a self-consistent 
periodic solution of the distorted orbit, iteration methods must be employed. 

Solutions for equilibrium orbits can be obtained by inserting the perturba- 
tion (15.6) or (15.7) into (15.33). First, we will concentrate on a situation, where 
only one perturbing kick exists in the whole lattice, assuming the perturbation to 
occur at z = z and to produce a kick & = f pi(¢) dé in the particle trajectory. The 
orbit distortion at a location z < z, in the lattice is from (15.33) 


uo(<) = +VB@BED &— eee VON (15.34) 


sin Vv 


If on the other hand we look for the orbit distortion downstream from the 
perturbation z > z, the integration must start at z, follow the ring to z = C and 
then further to z = z + C. The kick, therefore, occurs at the place C + z with the 
phase y(C) + 9(z) = 2% + (z,) and the orbit is given by 


uo= ora (15.35) 


sin wv 


This mathematical distinction of cases z < zy, and z > z is a consequence of 
the integration starting at z and ending at z + C and is necessary to account for the 
discontinuity of the slope of the equilibrium orbit at the location of the kick. At 
the point z = z, obviously both equations are the same. In Fig. 15.3 the normalized 
distortion of the ideal orbit due to a single dipole kick is shown. In a linear lattice this 
distortion is independent of the orbit and adds in linear superposition. If, however, 
sextupoles or other coupling or nonlinear magnets are included in the lattice, the 
distortion due to a single or multiple kick depends on the orbit itself and self- 
consistent solutions can be obtained only by iterations. 

In cases where a single kick occurs at a symmetry point of a circular accelerator 
we expect the distorted orbit to also be symmetric about the kick. This is expressed 
in the asymmetric phase terms of both equations. Indeed, since (zx) — g(z) = 
Ag for x > z and ¢(z) — o(%) = Ag for z > ~% the orbit distortion extends 
symmetrically in either direction from the location of the kick. 


Fig. 15.3 Distorted orbit due Au/VB Au/NB 
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The solution for the perturbed equilibrium orbit is specially simple at the place 
where the kick occurs. With g(z) = o(z,) the orbit distortion is 


Uu,= 5 BO cotzv. (15.36) 


In situations where a short bending magnet like an orbit correction magnet and 
a beam position monitor are at the same place or at least close together we may 
use these devices to measure the betatron function at that place z, by measuring the 
tune v of the ring and the change in orbit uw, due to a kick 6. Equation (15.36) can 
then be solved for the betatron function f, at the location z. This procedure can 
obviously be applied in both planes to experimentally determine f, as well as By. 


15.2.5 Statistical Distribution of Dipole Field 
and Alignment Errors 


In a real circular accelerator a large number of field and misalignment errors of 
unknown location and magnitude must be expected. If the accelerator is functional 
we may measure the distorted orbit with the help of beam position monitors and 
apply an orbit correction as discussed later in this section. During the design stage, 
however, we need to know the sensitivity of the ring design to such errors in order to 
determine alignment tolerances and the degree of correction required. In the absence 
of detailed knowledge about errors we use statistical methods to determine the most 
probable equilibrium orbit. All magnets are designed, fabricated, and aligned within 
statistical tolerances, which are determined such that the distorted orbit allows the 
beam to stay within the vacuum pipe without loss. An expectation value for the orbit 
distortion can be derived by calculating the root-mean-square of (15.33) 


z+C 
80 = FOG $™ ruornieyVBaV VEO (15.37) 
sin? rv 
x cos [v (9, — Ye + 1)]| cos [v (g, — Y, + 1)] do drt, 


where for simplicity g, = g(z) etc. This double integral can be evaluated by 
expanding the cosine functions to separate the phases g, and g,. We get terms like 
COS VY, COS VG, and sin vg, sin vp, or mixed terms. All these terms tend to cancel 
except when o = T since both the perturbations and their locations are statistically 
distributed in phase. Only for o = t will we get quadratic terms that contribute to a 
finite expectation value for the orbit distortion 


(pi(t) [cos v(~, + 11) cos” vy, + sin? v(g, + 27) sin? Vor | ) 
= (pi(t))[cos” vy, + 2)(cos* vy;) + sin’ v(g, + x){sin* v¢;)] 
= (pi())3, 
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and with this (15.37) becomes 


B(z) 


+o 
8 sin* av 


(ug(2)) = YP) Baie?) . (15.38) 


where the integrals have been replaced by a single sum over all perturbing fields 
of length £;. This can be done since we assume that the betatron phase does not 
change much over the length of individual perturbations. Equation (15.38) gives 
the expectation value for the orbit distortion at the point z and since the errors are 
statistically distributed we get from the central limit theorem a Gaussian distribution 
of the orbit distortions with the standard deviation o?(z) = (ug(z)) from (15.38). 
In other words if an accelerator is constructed with tolerances (pi (ai) there is a 


68 % probability that the orbit distortions are of the order ,/ (ug (z)) as calculated 
from (15.38) and a 98 % probability that they are not more than twice that large. 
As an example, we consider a uniform beam transport line, where all quadrupoles 
have the same strength and the betatron functions are periodic like in a FODO 
channel. This example seems to be very special since hardly any practical beam 
line has these properties, but it is still a useful example and may be used to simulate 
more general beam lines for a quick estimate of alignment tolerances. Assuming a 
Gaussian distribution of quadrupole misalignments with a standard deviation oy, 
and quadrupole strength k, the perturbations are p; (z) = koa, and the expected 


orbit distortion is 
y (ui(2)) = VB@Acau. (15.39) 


where A is called the error amplification factor defined by 
NB 


i 
8 sin? xv f? 


A? = —S __((ee,)) = 


= 15.40 
8 sin? rv ( ) 


((klq)*B) is taken as the average value for the expression in all N misaligned 
quadrupoles, f is the focal length of the quadrupoles, and B the average betatron 
function. 

The expectation value for the maximum value of the orbit distortion (#(z)) is 
larger. In (15.38) we have averaged the trigonometric functions 


(cos? vo(r)) = (sin? vg(z)) = } 
and therefore 
(ii) = 2 (up(2)) - (15.41) 


These methods obviously require a large number of misalignments to become 
statistically accurate. While this is not always the case for shorter beam lines it is 
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still useful to perform such calculations. In cases where the statistical significance is 
really poor, one may use 100 or more sets of random perturbations and apply them 
to the beam line or ring lattice. This way a better evaluation of the distribution of 
possible perturbations is possible. 

Clearly, the tolerance requirements increase as the average value of betatron 
functions, the quadrupole focusing, or the size of the accelerator or number of 
magnets WN is increased. No finite orbit can be achieved if the tune is chosen to be 
an integer value. Most accelerators work at tunes which are about one quarter away 
from the next integer to maximize the trigonometric denominator | sinzv| ~ 1. 
From a practical standpoint we may wonder what compromise to aim for between 
a large aperture and tight tolerances. It is good practice to avoid perturbations 
as reasonable as possible and then, if necessary, enlarge the magnet aperture to 
accommodate distortions which are too difficult to avoid. As a practical measure 
it is possible to restrict the uncorrected orbit distortion in most cases to 5S-10mm 
and provide magnet apertures that will accommodate this. 

What happens if the expected orbit distortions are larger than the vacuum 
aperture which is virtually sure to happen at least during initial commissioning 
of more sensitive accelerators? In this case one relies on fluorescent screens or 
electronic monitoring devices located along the beam line, which are sensitive 
enough to detect even small beam intensities passing by only once. By empirically 
employing corrector magnets the beam can be guided from monitor to monitor thus 
establishing a path and eventually a closed orbit. Once all monitors receive a signal, 
more sophisticated and computerized orbit control mechanism may be employed. 


15.2.6 Dipole Field Errors in Insertion Devices 


Periodic magnet arrays like wiggler and undulator magnets are used often in 
synchrotron radiation sources to produce specific radiation characteristics. The 
requirement for such insertion devices is that the total deflection angle be zero as to 
not affect the closed orbit in the storage ring 


+00 
i Bidz=0. 


co 


In reality that is not possible because of manufacturing tolerances. A real 
trajectory through an undulator may look like shown in Fig. 15.4 [3]. 

From Fig. 15.4 it is obvious that a particle entering the undulator on axis will exit 
the magnet with a large distance from the axis and with a significant angle. Both 
will contribute to the orbit distortions. The problem here is that this orbit distortion 
is gap dependent and as the experimental user changes the gap the orbit changes 
all around the storage ring moving at the same time the source position for all other 
users. It is therefore imperative to correct this distortion before it can affect the orbit. 
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Fig. 15.4 Trajectory through 20 + + + ; + ; ; 1 
an undulator without any : ; ( 
special corrections [3] 
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There are two quantities which must be corrected, both being called the first and 
second integral 


h= [se =0 and (15.42) 


n= [ad [ Bar=0, 


Both integrals should be zero because J; is proportional to the exit angle and 
In proportional to the position at the undulator exit. Both errors should and can be 
corrected by a steering magnet before the entrance and right after the undulator exit. 
By adjusting the entrance steering magnet the exit angle can be varied and the exit 
beam displacement can be made to be zero. After adjusting the exit beam position 
to zero the angle still may be wrong which can be adjusted to zero with the exit 
steering magnet. With this correction the undulator effect on the orbit is eliminated. 
Unfortunately, the first and second integral can be in a permanent magnet device 
gap-dependent. Therefore, before using the undulator the steering corrections must 
be determined experimentally as a function of gap size. This information is stored 
in the control computer and as the gap size is changed by the user the computer will 
also change the steering field such that the orbit stays constant during change of the 
gap. This procedure is know as feed-forward. With this correction the undulkator 
has become a true insertion device from an accelerator physics point of view. 

We notice, however, in Fig. 15.4 that the oscillating trajectory within the 
undulator is not along a straight line. In the particular case of Fig. 15.4 the trajectory 
resembles an arc which can reduce the radiation characteristics especially for higher 
harmonics. This can be corrected by two long coils one each around the full array of 
poles. This coil can deflect the beam on a dipole trajectory such that it compensates 
the average curvature within the undulator. Of course if this is done then the 
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correction of the first and second integral must be repeated again. The long coil 
current is also gap dependent and for successful feed-forward three tables must be 
prepared for the computer control of undulator gap changes. 


15.2.7 Closed Orbit Correction 


Due to magnetic field and alignment errors a distorted equilibrium orbit is generated 
as discussed in the previous section. Specifically for distinct localized dipole field 
errors at position z, 


VBR B(z) 


2sin wv 


ug(z) = vb VB cos[vy(z) — vg, + v7]. (15.43) 


Since orbit distortions reduce the available aperture for betatron oscillations and 
can change other beam parameters it is customary in accelerator design to include 
a special set of magnets for the correction of distorted orbits. These orbit correction 
magnets produce orbit kicks and have, therefore, the same effect on the orbit as 
dipole errors. However, now the location and the strength of the kicks are known. 
Before we try to correct an orbit it must have been detected with the help of beam 
position monitors. The position of the beam in these monitors is the only direct 
information we have about the distorted orbit. From the set of measured orbit 
distortions u; at the m monitors i we form a vector 


Un = (Uy, U2, U3,...,Um) (15.44) 


and use the correctors to produce additional “orbit distortions” at the monitors 
through carefully selected kicks 6; in orbit correction magnets which are also called 
trim magnets. For n corrector magnets the change in the orbit at the monitor i is 


Au; = _vBi_ > Visi. [v(9; -O +7), (15.45) 


2sin av 


where the index k refers to the corrector at z = zz. The orbit changes at the beam 
position monitors due to the corrector kicks can be expressed in a matrix equation 


Au, = M@,, (15.46) 


where Au,, is the vector formed from the orbit changes at all m monitors, 6, the 
vector formed by all kicks in the n correction magnets, and M the response matrix 


M = (Mix) with 


Ga 


- cos [v(g; — @ + 7)] . (15.47) 
2sin wv 
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The distorted orbit can be corrected at least at the position monitors with corrector 
kicks 6 chosen such that Au,, = —u,, or 


6,=—-M"u,. (15.48) 


Obviously, this equation can be solved exactly if n = mand also for n > m if not all 
correctors are used. Additional conditions could be imposed in the latter case like 
minimizing the corrector strength. 

While an orbit correction according to (15.48) is possible it is not always the 
optimum way to do it. A perfectly corrected orbit at the monitors still leaves finite 
distortions between the monitors. To avoid large orbit distortions between monitors 
sufficiently many monitors and correctors must be distributed along the beam line. 
A more sophisticated orbit correction scheme would only try to minimize the sum 
of the squares of the orbit distortions at the monitors 

(u,, — Aun)Yinin = Cm — M8 n)inin> (15.49) 
thus avoiding extreme corrector settings due to an unnecessary requirement for 
perfect correction at monitor locations. 

This can be achieved for any number of monitors m and correctors n although the 
quality of the resulting orbit depends greatly on the actual number of correctors and 
monitors. To estimate the number of correctors and monitors needed we remember 
the similarity of dispersion function and orbit distortion. Both are derived from 
similar differential equations. The solution for the distorted orbit, therefore, can also 
be expressed by Fourier harmonics similar to (10.99). With F,, being the Fourier 
harmonics of —?/?(z) A(z), the distorted orbit is 


+00 


°F, ing 
wos) = VB) >> a (15.50) 


n=—oo 


which exhibits a resonance for v = n. The harmonic spectrum of the uncorrected 
orbit uo(z) has therefore also a strong harmonic content for n ~ v. To obtain an 
efficient orbit correction both the beam position monitor and corrector distribution 
around the accelerator must have a strong harmonic close to the tune v of the 
accelerator. It is, therefore, most efficient to distribute monitors and correctors 
uniformly with respect to the betatron phase y(z) rather than uniform with z and 
use at least about four units of each per betatron wave length. 

With sufficiently many correctors and monitors the orbit can be corrected in 
different ways. One could excite all correctors in such a way as to compensate 
individual harmonics in the distorted orbit as derived from beam position measure- 
ment. Another simple and efficient way is to look for the first corrector that most 
efficiently reduces the orbit errors then for the second most efficient and so on. This 
latter method is most efficient since the practicality of other methods can be greatly 
influenced by errors of the position measurements as well as field errors in the 
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Fig. 15.5 Orbit of Fig. 15.2 
before and after correction 


uncorrected orbit 


corrected orbit 


1.0 


correctors. The single most effective corrector method can be employed repeatedly 
to obtain an acceptable orbit. Of similar practical effectiveness is the method of 
beam bumps. Here, a set of three to four correctors are chosen and powered in such 
a way as to produce a beam bump compensating an orbit distortion in that area. This 
method is a local orbit correction scheme while the others are global schemes. 

As a practical example, we show the vertical orbit in the storage ring PEP 
before and after correction (Fig. 15.5) in normalized units. The orbit distortions are 
significantly reduced and the strong harmonic close to the betatron frequency has all 
but disappeared. Even in normalized units the orbit distortions have now a statistical 
appearance and further correction would require many more correctors. The peaks at 
the six interaction points of the lattice, indicated by dashed lines, are actually small 
orbit distortions and appear large only due to the normalization to a very small value 
of the betatron function f, at the interaction point. 


15.2.8 Response Matrix 


In the last section we found the relation of the beam position response at each 
position monitor (BPM) due to a change in any of the steering magnets in the 
circular accelerator. The matrix made up of these relations (15.47) is called the 
response matrix. The elements of this response matrix gives us an inside look at 
perturbations, calibration errors and alignment and field tolerances. Each element 
of the response matrix is defined by the movement of the beam at a particular 
beam position monitor due to a known change in a particular steering magnet. 
That response is made up of all fields encountered by the beam from the steerer to 
the BPM, all ideal bending, quadrupole and sextupole fields, but also all undesired 
fields originating from manufacturing tolerances, alignment errors, field errors, stray 
fields etc. Also included are calibration and alignment errors (offset) of BPMs 
and steering magnets. Therefore these response matrix elements contain all the 
information of perturbations which we would like to know. Unfortunately such 
errors are not spelled out in clear text but must be assumed. To find such errors 
one uses a computer program like LOCO [4, 5] which allows the user to choose 
a specific source of suspected errors. The program then uses such errors and tries 
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a fit to measured response matrix data. Usually most if not all errors are found to 
be corrected in a series of different approximations. Of course, the measurement of 
some 5,000 to 10,000 response matrix elements takes time. It is a very repetitive 
measurement which is best left to a computer. Such programs are part of the 
Accelerator Tool box (AT) [6] which includes many more routines to analyse and 
optimize electron storage rings. 


15.2.9 Orbit Correction with Single Value Decomposition 
(SVD) 


Space age developments have had their impact on accelerator physics too. Transmis- 
sion of pictures from space craft and communication in general push for methods to 
get the information with a minimum of data transfers. Mathematicians found a way 
to invert a big matrix and determine the most significant eigenvalues. In accelerator 
physics, we know from the last section that a series of approximations based on 
the most significant corrector leads to a greatly improved closed orbit. Now, with 
the new approach, which is called Singe Value Decomposition (SVD) , we get the 
desired result in one application. The method of SVD provides us with the matrix 
inversion (15.48) such that all steering corrections are listed in order of magnitude. 


Single Value Decomposition (SVD) 


Assume an x m matrix A like the one in (15.46). This matrix can be decomposed 
into three matrices such that 


A=uwy'. (15.51) 


Here, the columns of U/ are the eigenvectors of 4A‘ and V is made up of rows 
which are the eigenvectors of ATA. Finally, W is a diagonal matrix with elements 
being the “singular values” equal to the square root of the eigenvalues of both AA? 
and A’ A. The inverse A™! is 


Alt=vw'tu' (15.52) 
and the desired corrector strength are from (15.48) 
6,=—A"'u,. (15.53) 


where 6, is the strongest and @,, the weakest corrector. 
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Fig. 15.6 3-D graph of the horizontal response matrix elements before correction (/eft) and after 
applying SVD (right). The values are shown versus the number of steering magnets and the number 
of BPMs each close to 100 


More visually, we show the process in some graphs from the synchrotron light 
source PLS-II in Pohang, Korea.” The response matrix before correction is shown 
in Fig. 15.6 in a 3-D graph (lower left) where the two axis are the number of steering 
magnets and number of BPMs. 

There are somewhat regular oscillation in the values of the matrix elements 
visible, which are due to the repetition of the betatron oscillation in each super- 
periods. Applying SVD the inverse response matrix is of diagonal form and shown 
in Fig. 15.6 (lower right). The strength of the steering magnets to correct the closed 
orbit is finally shown in Fig. 15.7 where it is quite obvious that only a few correctors 
are very effective. Of the 96 corrector magnets installed in PLS-II only about 30 to 
50 are effective. All others do not contribute to orbit correction but rather fight each 
other and therefore should not be used. 

While it is not known which corrector magnets and BPMs are the most effective 
a sufficient number of both, about 6 per betatron oscillation, should be installed. 
Eventually, correction of the orbit defines the most effective correctors and BPMs, 
about 4 each per betatron oscillation. In the vertical plane the process is exactly the 
same with some variation of numbers due to the different tune. 


71 thank Dr. Sheungwan Shin, PLS-II, Pohang, Korea for providing the data and graphs. 
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Fig. 15.7 Strength of the corrector magnets ordered from most to least effective 


15.3. Quadrupole Field Perturbations 


The dipole perturbation terms cause a shift in the beam path or closed orbit without 
affecting the focusing properties of the beam line. The next higher perturbation 
terms which depend linearly on the transverse particle offset from the ideal orbit 
will affect focusing because these perturbations act just like quadrupoles. Linear 
perturbation terms are of the form 


w" + (ky + kj) u = Po (2) u, (15.54) 


where u stand for x and y, respectively. More quantitatively, these linear perturba- 
tions are from (15.6) and (15.7) 


P2x (2) = —A (ky + 2) x—mAxx+... 
(15.55) 
Pry (2) = +Akyy+mAxy+... 


As a general feature, we recognize the “feed down” from misalignments of higher 
order multipoles. A misaligned sextupole, for example, generates dipole as well as 
gradient fields. Any misaligned multipole produces fields in all lower orders. 

Quadrupole fields determine the betatron function as well as the phase advance or 
tune in a circular accelerator. We expect therefore that linear field errors will modify 
these parameters and we will derive the effect of gradient errors on lattice functions 
and tune. 
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15.3.1 Betatron Tune Shift 


Gradient field errors have a first order effect on the betatron phase and tune. 
Specifically in circular accelerators we have to be concerned about the tune not 
to deviate too much from stable values to avoid beam loss. The effect of a linear 
perturbation on the tune can be easily derived in matrix formulation for one single 
perturbation. For simplicity we choose a symmetry point in the lattice of a circular 
accelerator and insert on either side of this point a thin half-lens perturbation with 
the transformation matrix 


1 0 
= , 15.56 
ws (<i l om 
where f—! = —5 J p2(z) dz and p2(z) is the total perturbation. Combining this with 


the transformation of an ideal ring (8.74) with 6 = Bo,a@ = a = Oand Wo = 279 


C(z) S(z) cosWo Bo sin Yo 
Mo => => 
C'(2) S’(z) — JZ, SiN Wo cos Wo 
we get for the trace of the total transformation matrix M = M, Mo Mp 
= Bo .. 
TrM = 2cos Ve~ 2 sin Yo , (15.57) 


where fo is the unperturbed betatron function at the location of the perturbation and 
Wo = 27 Vo the unperturbed phase advance per turn. The trace of the perturbed ring 
is TrM = 2cos yp and we have therefore 


cos W = cos Wo — - sin Wo. (15.58) 


With w = 27v = 279 +276v and cos ¥ = cos Wo cos 27dv—sin Wo sin 27dv 
we get for small perturbations the tune shift 


6p = —— = -2 | ne dz. (15.59) 


For more than a single gradient error one would simply add the individual 
contribution from each error to find the total tune shift. The same result can be 
obtained from the perturbed equation of motion (15.54). To show this, we introduce 
normalized coordinates w = u/ /B andg = [ & and (15.54) becomes 


iv + vow = vgB"(2) pr(z) w. (15.60) 
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For simplicity, we drop the index , and recognize that all equations must 
be evaluated separately for x and y. Since both the betatron function 6(z) and 
perturbations p2(z) are periodic, we may Fourier expand the coefficient of vow on 
the r.h.s. and get for the lowest, non-oscillating harmonic 


1 20 ‘5 1 
Fo= 5 [Bp @ dp = =f Bloppale) ez. (15.61) 
T Jo 20 


Inserting this into (15.60) and collecting terms linear in w we get 
iv + (vj — voFo)w = 0 (15.62) 


and the new tune v = vo + dv is determined by 


2 


v= Vo — vo Fo © be + 2vo9dv. (15.63) 


Solving for dv gives the linear tune perturbation 


1 
bv = LF) =~ BOpal) (15.64) 


in complete agreement with the result obtained in (15.59). The tune shift produced 
by a linear perturbation has great diagnostic importance. By varying the strength of 
an individual quadrupole and measuring the tune shift it is possible to derive the 
value of the betatron function in this quadrupole. 

The effect of linear perturbations contributes in first approximation only to a 
static tune shift. In higher approximation, however, we note specific effects which 
can affect beam stability and therefore must be addressed in more detail. To derive 
these effects we solve (15.60) with the help of a Green’s function as discussed in 
Sect. 5.5.4 and obtain the perturbation 


Q 
P(e) = / voB?(Dpa(xw(a) sin [vo(y — 20] dx, (15.65) 


where we have made use of the principal solutions. We select a particular, 
unperturbed trajectory, w(y) = wo cos (vx) with wo = O and get the perturbed 
particle trajectory 


g 
w(Vg) = wo Cos (voy) + wore f B’p2 cos (vox) sin[vo(y — x] dx, (15.66) 


where 6 = B(x) and po = p(x). If, on the other hand, we consider the 
perturbations to be a part of the lattice, we would describe the same trajectory by 


w(~) = wocos vg. (15.67) 
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Both solutions must be equal. Specifically the phase advance per turn must be 
the same and we get from (15.66), (15.67) after one turn g¢ = 27 for the perturbed 
tune v = v9 + bv 


2n 
cos 27 (vo + bv) = cos 2mv9 + Vo B°(¢)p2(@) cos(vog) sin [vo(2 — g)] dy, 
0 


(15.68) 
which can be solved for the tune shift dv. Obviously the approximation breaks down 
for large values of the perturbation as soon as the r.h.s. becomes larger than unity. 
For small perturbations, however, we expand the trigonometric functions and get 


1 
év= - p B(2)p2(z) dz (15.69) 


1 
— p B(@)pr(2) sin {2v9 [x — p@)]} dz. 
0 


Ar sin 27 v 


The first term is the average tune shift which has been derived before, while the 
second term is of oscillatory nature averaging to zero over many turns if the tune 
of the circular accelerator is not equal to a half integer or multiples thereof. We 
have found hereby a second resonance condition to be avoided which occurs for 
half integer values of the tunes 


vo # 5n. (15.70) 
This resonance is called a half integer resonance and causes divergent solutions 
for the lattice functions. 


15.3.2 Optics Perturbation Due to Insertion Devices 


The use of insertion devices in synchrotron light sources can introduce significant 
focusing perturbations. Undulators and wiggler magnets are a series of dipole 
magnets with end effects causing vertical focusing. This focusing will perturb the 
periodic betatron function in the storage ring and with it all correction that have 
been made. Because the perturbation scales like the square of the magnet field it is 
for most undulators too small to be significant. However, wiggler magnet may cause 
some problems. Like for orbit correction we do not like to spread the correction of 
the betatron functions all around the ring. To localize the correction to the vicinity of 
the insertion device we consider only the closest lattice quadrupoles on either side of 
the wiggler magnet which are not yet beyond the next insertion device. To minimize 
the number of quadrupoles needed for correction we start away from the wiggler 
where we expect the betatron functions to stay unperturbed, say in the middle of the 
next long straight section. Starting from there we adjust quadrupoles such that in the 


15.3 Quadrupole Field Perturbations 503 


middle of the wiggler magnet a, = 0 and 7’ = 0. That requires three quadrupoles 
for matching. If we match from the middle of the wiggler magnet to the middle of the 
next straight section we would need six quadrupoles to match f,,, oy, and 7’. For 
optimum localization of the perturbation one should use the three quadrupole closest 
to the wiggler magnet. This correction does not take care of the perturbations in the 
betatron phase. If two more quadrupoles are available one could try to use them for 
tune correction. This, however, is not always possible and one might therefore use 
two quadrupole families for the whole ring to readjust the tunes to the original value. 
Because there usually are many superperiods, the effect of a small tune correction 
is distributed around the ring and causes little variation in the beatron functions. 
The corrections in the quadrupoles are again for a permanent magnet wiggler gap- 
dependent and must be determined before general use to establish feed-forward of 
the computer control of the wiggler. 


15.3.3 Resonances and Stop Band Width 


Calculating the tune shift from (15.68) we noticed that there is no solution if the 
perturbation is too large such that the absolute value of the r.h.s. becomes larger than 
unity. In this case the tune becomes imaginary leading to ever increasing betatron 
oscillation amplitudes and beam loss. This resonance condition occurs not only at 
a half integer resonance but also in a finite vicinity. The region of instability is 
called the stop band and the width of unstable tune values is called the stop band 
width which can be calculated by using a higher approximation for the perturbed 
solution. Following the arguments of Courant and Snyder [1] we note that the 
perturbation (15.65) depends on the betatron oscillation w(g) itself and we now 
use in the perturbation integral the first order approximation (15.66) rather than the 
unperturbed solution to calculate the perturbation (15.65). Then instead of (15.68) 
we get with 


vob*(¢) px¢) = g (Y) (15.71) 


2n 
cos 27 (vo + dv) — cos 27V9 = + / g(~) cos(vog) sin vo(2 — y) dy 
0 
2n 
+ wo f g(X) sin vo(27 — x) (15.72) 
0 


Xx 
x / Crees Hci Hdl dy, 
0 


This expression can be used to calculate the stop band width due to gradient 
field errors which we will do for the integer resonance vg = n+ dv and for the half 
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integer resonance vo = n+1/2+6v where n is an integer and dv the deviation of the 
tune from these resonances. To evaluate the first integral /;, on the r.h.s. of (15.72) 
we make use of the relation 


cos (Vo@) sin [vo(2 — ~)] = 5 sin (27 v9) + 5 sin[2vo(a — @)] (15.73) 


and get with dz = vofdg and 


Qn 
$ BE) prt) d= / e(¢) dp = 2nFp (15.74) 


from (15.62) for the first integral 


2n 
I, = wFosin (27 v9) + sf g(~) sin [2vo(x — g)] dg. (15.75) 
0 


The second term of the integral J; has oscillatory character averaging to zero over 
many turns and with é < 1 


2n*Fy dv forvo =n+dv 
Tl, =aFosin2mv9 & : (15.76) 
—2n°Fy dv forvo =n+44+dv 


The second integral in (15.72) can best be evaluated while expressing 
the trigonometric functions in their exponential form. Terms like et!’@7~2” or 
e+iv@7—2s) vanish on average over many turns. With 


2n ¥ 2n 2n 
F(pay / reoat=1f se~arf sou 
0 0 0 0 


we get for the second integral 


2n 20 
ie -2 2x) / 3(6) (15.77) 
0 0 


s {(e2nv ae e270) = [ei2v0(7 A+) +e i2v9(x—y44 ah dé dy. 


Close to the integer resonance vo = n + dv and 


Vo 20 20 
bn= -7 f eco [ g() (15.78) 
0 0 


ss {(e2xdv $e i2m8vy _ feidnb—2) 4 e Pnb—0]} dédy 
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while in the vicinity of the half integer resonance vg = n + 5 + dv 


20 Qn 

Vo i2ndv —i2mdv 

Ll == / eco [ (6) {—(e?" + e278) 
° 16 Jo 0 


+ io 3)(-”) Ave int 5) a) dédy. 


The integralscan now be expressed in terms of Fourier harmonics of 
voB* (~) pi(~), where the amplitudes of the harmonics F, with integer gq > 0 
are given by 


Vo 20 


Qn 
; = oe e(ye i dx | g(ljel% de. (15.79) 
IU 0 0 


For Fo we have from the Fourier transform the result 
Fo = (g()) = vo (B?(~)p2(¢)) (15.80) 
and we get for (15.78) with this and ignoring terms quadratic in dv 
Inn © $0” (F5, — 4 FG cos 275) (15.81) 
and for (9.57) 
Lyng % — 3H” (Fong — 4 Fo 008 278) . (15.82) 
At this point we may collect the results and get on the L.h.s. of (15.72) for vo = n+édv 
cos 2m (vo + 6v) — cos 2m v9 = cos 2m(vp + Sv) — 1 +207 bv?. 


This must be equated with the r.h.s. which is the sum of integrals 7; and J, and with 
F> cos2n6v = O(5*v) 


cos 2n(vo + bv) — 1 = —207Sv? + 207 Fy dv + am (F5, - 4F5) : (15.83) 
The boundaries of the stop band on either side of the integer resonance vo ~ 
n can be calculated from the condition that cos27(vp + dv) < 1 which has two 
solutions 6v;,2. From (15.83) we get therefore 
bv? — Fo dv = &(|Fon |? — 4F 5) 


and solving for dv 


Sui = $Fo + 4\Fan | (15.84) 
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the stop band width is finally 
1 
Av = $v —8v2 = 3 \Fon| = oa f B(2) pope 2 dz. (15.85) 
4 


The stop band width close to the integer tune v ~ n is determined by the second 
harmonic of the Fourier spectrum for the perturbation. The vicinity of the resonance 
for which no stable betatron oscillations exist increases with the value of the gradient 
field error and with the value of the betatron function at the location of the field error. 
For the half integer resonance vo © n + 5, the stop band width has a similar form 


2n 
Avi = 5|Fantil = es Pong ad, (15.86) 
20 0 

The lowest order Fourier harmonic n = 0 determines the static tune shift while 
the resonance width depends on higher harmonics. The existence of finite stop 
bands is not restricted to linear perturbation terms only. Nonlinear, higher order 
perturbation terms lead to higher order resonances and associated stop bands. In 
such cases one would replace in (15.60) the linear perturbation 6 2 p2(z) w by the 
nth order nonlinear perturbation B”/?p,,(z) w"! and basically go through the same 
derivation. Later in this chapter, we will use a different way to describe resonance 
characteristics caused by higher order perturbations. At this point we note only that 
perturbations of order n are weighted by the n/2 power of the betatron function at the 
location of the perturbation and increased care must be exercised, where large values 
of the betatron functions cannot be avoided. Undesired fields at such locations must 

be minimized. 


15.3.4 Perturbation of Betatron Function 


The existence of linear perturbation terms causes not only the tunes but also betatron 
functions to vary around the ring or along a beam line. This variation, also called 
beta-beat can be derived by observing the perturbation of a particular trajectory like 
for example the sine-like solution given by 


So(zolz) = VB(2) Vv Bo sin volp(z) — go] - (15.87) 


The sine-like function or trajectory in the presence of linear perturbation terms 
is by the principle of linear superposition the combination of the unperturbed 
solution (8.74) and perturbation (5.75) 


S(zolz) = VB] v Bo sin voy(2) 
+ VB@ i * pal) VBOSolzolé) sin volo) — o(C)] de. 
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Following the sinusoidal trajectory for the whole ring circumference or length 
of a superperiod L,, we have with z = zo + Lp, B(zo + Lp) = B(Zo) = fo and 
pz + Ly) = 2% + Go 


i 
S(zolzo + £) = fo sit 2nvo + Bo f A(t) pill) (15.88) 


20 


x sin vo[e(S) — Go] sin [vo (2% + Yo — G(C))] de. 


The difference due to the perturbation from the unperturbed trajectory (15.87) at 
Z= 2+ Lp is 


AS = S(Zo|Zo0 + Lp) — So(Zo|z0 + Lp) (15.89) 


zotLp 
= i (OpO aie. oh airs ale: 


<0 


where we abbreviated ~(zo) = @o etc. 
The variation of the sine like function can be derived also from the variation of 
the M2 element of the transformation matrix for the whole ring 


AS = A(f sin2zv) = AB sin2rv9 + Bo 27Avcos27V9. (15.90) 


We use (15.64) for the tune shift dv = -1F, equate (15.90) with (15.89) 
and solve for AB/B. After some manipulations, where we replace temporarily 
the trigonometric functions by their exponential expressions, the variation of the 
betatron function becomes at ¢(z) 


AB(z) _ 1 
B(z) 2 sin2av, 


f BOE) Prl6) cos [29 (le) — g(t) + mM) dé. (15.91) 


The perturbation of the betatron function shows clearly resonance character 
and a half integer tune must be avoided. We observe a close similarity with the 
solution (10.91) of the dispersion function or the closed orbit (15.28). Setting 
dé = voB(f)dgy, we find by comparison that the solution for the perturbed 
betatron function can be derived from a differential equation similar to a modified 
equation (10.88) 


cae 


ee +) + (2v9)? 


AB 


a (2v9)?5B?(z) pr(z) - (15.92) 


Expanding the periodic function vpb?p. = >> gra e'? we try the periodic ansatz 


A . 
= = > BF, eg? (15.93) 
q 
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and get from (15.92) 
\- [-@ + Qv0)?] ByFy ei” = 219) Fy e' . 
q q 


This can be true for all values of the phase g only if the coefficients of the 
exponential functions vanish separately for each value of q or if 


2Vv0 


Be = SS SS 15.94 
4 QnyP-@ meen 


Inserting into the periodic ansatz (15.93) the perturbation of the betatron function 
in another form is 


A v F,e4? 
raw >=. (15.95) 
vq — (q/2) 
Again we recognize the half inter resonance leading to an infinitely large 
perturbation of the betatron function. In the vicinity of the half integer resonance 
Vo n+ 5 = q/2 the betatron function can be expressed by the resonant term only 


Ap Lp ees + lg 
a) n+1|—_7,.._qv_ 
B pee Vo— (n + 7) 


and with |Fo,41| = 2Av 1 from (15.86) we get again the perturbation of the betatron 
function (15.91). The beat factor for the variation of the betatron function is define 
by 


of) Avon+1 
BF=1+{— = 1+ ————_.,, 15.96 
( Bo max 2Vv9 — (2n F 1) ‘ 


where Av>,4; is the half integer stop band width. The beating of the betatron 
function is proportional to the stop band width and therefore depends greatly on the 
value of the betatron function at the location of the perturbation. Even if the tune is 
chosen safely away from the next resonance, a linear perturbation at a large betatron 
function may still cause an unacceptable beat factor. It is generally prudent to design 
lattices in such a way as to avoid large values of the betatron functions. As a practical 
note, any value of the betatron function which is significantly larger than the 
quadrupole distances should be considered large. For many beam transport problems 
this is easier said than done. Therefore, where large betatron functions cannot be 
avoided or must be included to meet our design goals, results of perturbation theory 
warn us to apply special care for beam line component design, alignment and to 
minimize undesirable stray fields. 
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15.4 Chromatic Effects in a Circular Accelerator 


Energy independent perturbations as discussed in previous sections can have a 
profound impact on the stability of particle beams in the form of perturbations of the 
betatron function or through resonances. Any beam transport line must be designed 
and optimized with these effects in mind since it is impossible to fabricate ideal 
magnets and align them perfectly. Although such field and alignment errors can 
have a destructive effect on a beam, it is the detailed understanding of these effects 
that allow us to minimize or even avoid such problems by a careful design within 
proven technology. 

To complete the study of perturbations, we note that a realistic particle beam is 
never quite mono-energetic and includes a finite distribution of particle energies. 
Bending as well as focusing is altered if the particle momentum is not the 
ideal momentum. We already derived the momentum dependent reference path in 
transport lines involving bending magnets. Beyond this basic momentum dependent 
effect we observe other chromatic aberrations which contribute in a significant way 
to the perturbations of lattice functions. The effect of chromatic aberrations due to 
a momentum error is the same as that of a corresponding magnet field error and for 
beam stability we must include chromatic aberrations. 


15.4.1 Chromaticity 


Perturbations of beam dynamics can occur in beam transport systems even in the 
absence of magnet field and alignment errors. Deviations of particle energies from 
the ideal design energy cause perturbations in the solutions of the equations of 
motion. We have already derived the variation of the equilibrium orbit for different 
energies. Energy related or chromatic effects can be derived also for other lattice 
functions. Focusing errors due to an energy error cause such particles to be imaged 
at different focal points causing a blur of the beam spot. In a beam transport system, 
where the final beam spot size is of great importance as, for example, at the collision 
point of linear colliders, such a blur causes a severe degradation of the attainable 
luminosity. In circular accelerators we have no such direct imaging task but note 
that the tune of the accelerator is determined by the overall focusing and tune errors 
occur when the focusing system is in error. 

In this chapter we will specifically discuss effects of energy errors on tunes of a 
circular accelerator and means to compensate for such chromatic aberrations . The 
basic means of correction are applicable to either circular or open beam transport 
systems if, for the latter case, we only replace the tune by the phase advance of 
the transport line in units of 27. The control of these chromatic effects in circular 
accelerators is important for two reasons, to avoid loss of particles due to tune shifts 
into resonances and to prevent beam loss due to an instability, which we call the 
head tail instability to be discussed in more detail in Sect. 22.5. 
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The lowest order chromatic perturbation is caused by the variation of the focal 
length of the quadrupoles with energy (Fig. 15.8). This kind of error is well known 
from light optics, where a correction of this chromatic aberration can at least 
partially be obtained by the use of different kinds of glasses for the lenses in a 
composite focusing system. 

In particle beam optics no equivalent approach is possible. To still correct for 
the chromatic perturbations we remember that particles with different energies 
can be separated by the introduction of a dispersion function. Once the particles 
are separated by energy we apply different focusing corrections depending on the 
energy of the particles. Higher energy particles are focused less than ideal energy 
particles and lower energy particles are over focused. For a correction of these 
focusing errors we need a magnet which is focusing for higher energy particles and 
defocusing for lower energy particles (Fig. 15.9). A sextupole has just that property. 

The variation of tunes with energy is called the chromaticity and is defined by 


Av 


= Ap/po 


(15.97) 


The chromaticity derives from second and higher order perturbations in (x, y, 4) 
and the relevant equations of motion are from 


X"+kx= kxd—Sm(x? —y’), 


LS: 
y’ —kx = —kyé + mxy. Men 
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Setting x = xg + 7,6 and y = yg, assuming that 7, = 0, we retain only betatron 
oscillation terms involving xg or yg to derive chromatic tune shifts. In doing so we 
note three types of chromatic perturbation terms, those depending on the betatron 
motion only, those depending on the momentum error only, and terms depending on 
both. With these expansions (15.98) becomes 


xB +kxp = — kxgd — mnxxpd — mx; — yp) + O(3), 


(15.99) 
Y¥g — kyp = —kypd + mnsygd + mxpyp + O(3). 


We ignore for the time being non chromatic terms of second order which will be 
discussed later as geometric aberrations and get 


xB +kxgp= (k—mny) xpd, 


(15.100) 
yg —kyp = —(k—mny) yp6. 

The perturbation terms now are linear in the betatron amplitude and therefore 
have the character of a gradient error. From Sect. 15.3 we know that these types of 
errors lead to a tune shift which by comparison with (15.64) becomes in terms of a 
phase shift 


Avs 
Ay 


II 


—35 f By(k — mny) dz, 
2 
4186 By(k— my) dz. (15.101) 


II 


Equations (15.101) are applicable for both circular and open beam lines. Using 
the definition of the chromaticity for circular accelerators we have finally 


&, = -i f Bx(k = myx) dz, 
fy = 4 Ug By(k—mn,) dz. ane 


Similar to the definition of tunes the chromaticities are also an integral property 
of the circular accelerator lattice. Setting the sextupole strength m to zero one gets 
the natural chromaticities determined by focusing terms only 


Exo = -z~ f Byk dz ; 


(15.103) 
50 — +i f Byk dz . 


The natural chromaticities are always negative which is to be expected since 
focusing is less effective for higher energy particles (6 > 0) and therefore the 
number of betatron oscillations is reduced. 

For a thin lens symmetric FODO lattice the calculation of the chromaticity 
becomes very simple. With the betatron function B* at the center of a focusing 
quadrupole of strength kt = k and B~ at the defocusing quadrupole of strength 
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k~ =k, the chromaticity of one FODO half cell is 


fo = -z (6° [tase fi az) =F fra. (15.104) 


With B* (10.3) and B~ (10.5) and f kdz = 1/f = 1/(kL), where x is the FODO 
strength parameter and L the length of a FODO half cell, we get the chromaticity 
per FODO half-cell in a more practical formulation 


1 1 1 1 
&o0 = — = ——tan| ~-v,)., (15.105) 
= 14 a 2 


20 K2 


where wy, is the horizontal betatron phase for the full FODO cell. The same result 
can be obtained for the vertical plane. 

The natural chromaticity for each 90° FODO cell is therefore equal to 1/z. 
Although this value is rather small, the total chromaticity for the complete lattice 
of a storage ring or synchrotron, made up of many FODO cells, can become quite 
large. For the stability of a particle beam and the integrity of the imaging process by 
quadrupole focusing it is important that the natural chromaticity be corrected. 

It is interesting at this point to discuss for a moment the chromatic effect if, for 
example, all bending magnets have a systematic field error with respect to other 
magnets. In an open beam transport line the beam would follow an off momentum 
path as determined by the difference of the beam energy and the bending magnet 
“energy”. Any chromatic aberration from quadrupoles as well as sextupoles would 
occur just as discussed. 

In a circular accelerator the effect of systematic field errors might be different. 
We consider, for example, the case where we systematically change the strength 
of all bending magnets. In an electron storage ring, the particle beam would 
automatically stay at the ideal design orbit with the particle energy being defined by 
the strength of the bending magnets. The strength of the quadrupoles and sextupole 
magnets, however, would now be systematically too high or too low with respect 
to the bending magnet field and particle energy. Quadrupoles introduce therefore a 
chromatic tune shift proportional to the natural chromaticity while the sextupoles 
are ineffective because the beam orbit leads through magnet centers. Changing the 
strength of the bending magnets by a fraction A in an electron circular accelerator 
and measuring the tune change Av one can determine experimentally the natural 
chromaticity (5 = —Av/A) of the ring. In Fig. 15.10 the measurement of the tunes 
as a function of the bending magnet current is shown for the storage ring SPEAR. 
From the slope of the graphs we derive the natural chromaticities of the SPEAR 
storage ring as € = —11.4 and &, = —11.7. 

In a proton accelerator the beam energy must be changed through acceleration 
or deceleration together with a change of the bending magnet strength to keep the 
beam on the reference orbit before this measurement can be performed. 
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15.4.2 Chromaticity Correction 


Equations (15.102) clearly suggest the usefulness of sextupole magnets for chro- 
matic correction. Sextupoles must be placed along the orbit of a circular accelerator 
or along a beam transport line at locations, where the dispersion function does not 
vanish, 7, # 0. A single sextupole is sufficient, in principle, to correct the chro- 
maticity for the whole ring or transport line but its strength may exceed technical 
limits or cause problems of geometric aberrations to the beam stability. This is 
due to the nonlinear nature of sextupole fields which causes dynamic instability for 
large amplitudes for which the sextupole field is no more a perturbation. The largest 
betatron oscillation amplitude which is still stable in the presence of nonlinear fields 
is called the dynamic aperture. To maximize the dynamic aperture it is prudent to 
distribute many chromaticity correcting sextupoles along the beam line or circular 
accelerator. 

To correct both the horizontal and the vertical chromaticity two different groups 
of sextupoles are required. For a moment we assume that there be only two 
sextupoles. To calculate the required strength of these sextupoles for chromaticity 
correction we use thin lens approximation and replacing integrals in (15.102) by a 
sum the corrected chromaticities are 


&. = Eo + (Mm Nx Bxt + m212Bx2) es =0 ; 


7 a. 7 (15.106) 
§ = 0 + an (mi Nx1 By + mo Nx2 By2)es =0. 


Here we assume that two different sextupoles, each of length ¢,, are available at 
locations z; and z2. Solving for the sextupole strengths we get from (15.106) 


4m Exo By2 — §y0 Bx2 
4x1 By Byo — By2By1 , 
Am &yo By1 — &o Bx 
x2 Bu By2 _ B2Byi , 


mt .= — 


(15.107a) 


ml.= — 


(15.107b) 
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It is obvious that the dispersion function at sextupoles should be large to 
minimize sextupoles strength. It is also clear that the betatron functions must be 
different preferably with 6B, >> fA, at the m; sextupole and B, < fy at the 
my sextupole to avoid “fighting” between sextupoles leading to excessive strength 
requirements. 

In general this approach based on only two sextupoles in a ring to correct chro- 
maticities leads to very strong sextupoles causing both magnetic design problems 
and strong higher order aberrations. A more gentle correction uses two groups 
or families of sextupoles with individual magnets distributed more evenly around 
the circular accelerator and the total required sextupole strength is spread over 
all sextupoles. In cases of severe aberrations, as discussed later, we will need 
to subdivide all sextupoles into more than two families for a more sophisticated 
correction of chromaticities. Instead of (15.106) we write for the general case of 
chromaticity correction 


&. = £0 + aa Yo, MiNi BPrilsi 


(15.108) 
é, = 0 + aa ; mMiNxiByi Cs ; 
where the sum is taken over all sextupoles. In the case of a two family correction 
scheme we still can solve for m; and m2 by grouping the terms into two sums. 

The chromaticity of a circular accelerator as defined in this section obviously 
does not take care of all chromatic perturbations. Since the function (k — mn,) 
in (15.100) is periodic, we can Fourier analyze it and note that the chromaticity only 
describes the effect of the non-oscillating lowest order Fourier component (15.103). 
All higher order components are treated as chromatic aberrations. In Sect. 17.2 we 
will discuss in more detail such higher order chromatic and geometric aberrations. 


15.4.3 Chromaticity in Higher Approximation 


So far we have used only quadrupole and sextupole fields to define and calculate 
the chromaticity. From the general equations of motion we know, however, that 
many more perturbation terms act just like sextupoles and therefore cannot be 
omitted without further discussion. To derive the relevant equations of motion 
from (6.95), (6.96) we set x = xg + yd and y = yg + yd where we keep 
for generality the symmetry between vertical and horizontal plane. Neglecting, 
however, coupling terms we get with perturbations quadratic in (x, y, 5) but at most 
linear in 6 and after separating the dispersion function a differential equation of the 
form (u = xg or yg) 


wy + Kug = —AK upd — ALus 6, (15.109) 
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where 

K, = +k, (15.110) 
Ky =«—k, (15.111) 
—AK, = 2x? +k — (m+ 2K? + 4k,k) ny (15.112) 

—(m + 2kk + 2kyk)ny + KM — Ky, 
—AKy = 2ky —k + (m — 2kyk + 2k,k) nx (15.113) 

+(m — 2k} + 4kyk)ny — Kin, + KN, 
—AL, = —ALy = +n + my + Kal, + kyr, (15.114) 


d 
+ gy hate + KyNy) . 
Z 


The perturbation terms (15.109) depend on the betatron oscillation amplitude as 
well as on the slope of the betatron motion. If by some manipulation we succeed in 
transforming (15.109) into an equation with terms proportional only to u we obtain 
immediately the chromaticity. We try a transformation of the form u = vu f(z) where 
f(z) is a still to be determined function of z. With uv’ = v'f + vf’ andu” = v"f + 
2u'f’ + uf” (15.109) becomes 


vf + 2u'f' + uf” + Kuf + AKuf 5 + ALv'f 5 + ALuf'5 = 0. (15.115) 


Now we introduce a condition defining the function f such that in (15.115) the 
coefficients of v’ vanish. This occurs if 


2f' = —ALfs. (15.116) 
To first order in 6 this equation can be solved by 
f = 14 48(kxne + Kyny) (15.117) 
and (15.115) becomes 
vo” +[K+(f" + 6AK)|v =0. (15.118) 


The chromaticity in this case is § = x g(e + AK)6dz, which becomes with 


2° = © (kms + Kyty) and (15.112) 


1 sel 
oe (5 is Ak; pss (15.119) 
1 d2 
~ An 2 gg (kath + KyMy) Bx dz 
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1 
— 5 P Br [x2 + &) + een — Hn, 
—(m + 2K} + 4k pk) nx—(m + 2k ck + 2kyk) ny] dz. 


The first integral can be integrated twice by parts to give 6 S (keene + Kyny) B''dz. 
Using 5 B” = yy — KB, and (15.119) the horizontal chromaticity is finally 


1 
& = ae p [—(kK+2«2) — Kin, — KN, (15.120) 
+(m + «2 + 3k) ny + (m + 2kk + Kyk) ny | B, dz 


1 
a ees Ps Nx + KyNy) Vx dz . 
IT 


A similar expression can be derived for the vertical chromaticity 


1 
= p [(-2K2 + k) + Kin, — Kh, (15.121) 


—(m + 2kyk + Kyk) Nx — (m-K; a 3kyk) ny] By dz 
1 
-o— Pim + KyNy)Vy dz. 
4n 


In deriving the chromaticity we used the usual curvilinear coordinate system for 
which the sector magnet is the natural bending magnet. For rectangular or wedge 
magnets the chromaticity must be determined from (15.121) by taking the edge 
focusing into account. Generally, this is done by applying a delta function focusing 
at the edges of dipole magnets with a focal length of 


1 1 
= “ane f 5(sae) dz. (15.122) 
frp 


Similarly, we proceed with all other terms which include focusing. 

The chromaticity can be determined experimentally simply by measuring the 
tunes for a beam circulating with a momentum slightly different from the lattice 
reference momentum. In an electron ring, this is generally not possible since any 
momentum deviation of the beam is automatically corrected by radiation damping 
within a short time. To sustain an electron beam at a momentum different from the 
reference energy, we must change the frequency of the accelerating cavity. Due 
to the mechanics of phase focusing, a particle beam follows such an orbit that 
the particle revolution time in the ring is an integer multiple of the rf-oscillation 
period in the accelerating cavity. By proper adjustment of the rf-frequency the beam 
orbit is centered along the ideal orbit and the beam momentum is equal to the ideal 
momentum as determined by the actual magnetic fields. 
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If the rf-frequency is raised, for example, the oscillation period becomes shorter 
and the revolution time for the beam must become shorter too. This is accomplished 
only if the beam momentum is changed in such a way that the particles now follow a 
new orbit that is shorter than the ideal reference orbit. Such orbits exist for particles 
with momenta less than the reference momentum. The relation between revolution 
time and momentum deviation is a lattice property expressed by the momentum 
compaction which we write now in the form 


Afet 
St * Co 


A 
a. (15.123) 


Through the knowledge of the lattice and momentum compaction we can relate a 
relative change in the rf-frequency to a change in the beam momentum. Measure- 
ment of the tune change due to the momentum change determines immediately the 
chromaticity. 


15.4.4 Non-linear Chromaticity* 


The chromaticity of a circular accelerator is defined as the linear rate of change 
of the tunes with the relative energy deviation 5. With the increased amount of 
focusing that is being applied in modern circular accelerators, especially in storage 
rings, to obtain specific particle beam properties like very high energies in large 
rings or a small emittance, the linear chromaticity term is no longer sufficient to 
describe the chromatic dynamics of particle motion. Quadratic and cubic terms in 6 
must be considered to avoid severe stability problems for particles with energy error. 
Correcting the chromaticity with only two families of sextupoles we could indeed 
correct the linear chromaticity but the nonlinear chromaticity may be too severe to 
permit stable beam operation. 

We derive the nonlinear chromaticity from the equation of motion expressed in 
normalized coordinates and including up to third-order chromatic focusing terms 


Wy + voow = voob7p2(~) w = (a5 + BS? + c53)w, (15.124) 


where the coefficients a,b,c are perturbation functions up to third order in 6 and 
linear in the amplitude w, and where voo is the unperturbed tune. From (6.95) 
and (6.96) these perturbations are 


a = Vo? [(k + 2K2) + mn +...], (15.125) 
b = vio? [—(k + 2«2) — mn + ...], (15.126) 
C= VooB* [(k + 2«7) + mn +...]. (15.127) 
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This equation defines nonlinear terms for the chromaticity which have been 
solved for the quadratic term [7] and for the cubic term [8, 9]. While second 
and third-order terms become significant in modern circular accelerators, higher- 
order terms can be recognized by numerical particle tracking but are generally 
insignificant. 

Since the perturbations on the rh.s. of (15.124) are periodic for a circular 
accelerator we may Fourier expand the coefficients 


a(¢) = ag + > an ei”? , 


n#0 
b(g) =bo+ on ace (15.128) 
c(g) =cot DY cnei”?. 

n#0 


From the lowest-order harmonics of the perturbations we get immediately the first 
approximation of nonlinear chromaticities 


v2 = v2, — 8 (ao + bod + c08?) (15.129) 


or 


20 
vp = Yoo (1 - Bf Pilg) aw) : (15.130) 


With this definition we reduce the equation of motion (15.124) to 


W + Voow = ve p76 (= 2a, cosngy + 6 om 2b, cosng + 8° Ly 2cn cosng | w 
n>0 n>0 n>0 


(15.131) 


The remaining perturbation terms on the r.h.s. look oscillatory and therefore seem 
not to contribute to an energy dependent tune shift. In higher-order approximation, 
however, we find indeed resonant terms which do not vanish but contribute to 
a systematic tune shift. Such higher-order tune shifts cannot be ignored in all 
cases and therefore an analytical expression for this chromatic tune shift will be 
derived. To solve the differential equation (15.131), we consider the r.h.s. as a small 
perturbation with 5 serving as the smallness parameter. Mathematical methods for 
a solution have been developed and are based on a power series in 6. We apply this 
method to both the cosine and sine like principal solution and try the ansatz 


Cg) =>) Cry) 8" and S(y) = 9S, (y) 8” (15.132) 


n>0 n=0 


Concentrating first on the cosine like solution we insert (15.132) into (15.131) and 
sort for same powers in 6 noting that each term must vanish separately to make the 
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ansatz valid for all values of 5. The result is a set of differential equations for the 
individual solution terms 


Co+ veCy = 0, 
Ci + v9C1 = voB°(y)p2(Y) Co, 

esas (15.133) 
Cr + v6Cn = v6B? (Y)p2(Y) Cnt, 


where derivatives C; are taken with respect to the phase ¢, e.g. C; = 0?C;/d9?. 
These are defining equations for the functions Co, C),---C, with C; = C;(y) and 
each function depending on a lower-order solution. The lowest-order solutions are 
the principal solutions of the unperturbed motion 


1 
Co(y) = cos vop and So(g) = SiN Vog . (15.134) 
0 


The differential equations (15.133) can be solved with the Green’s Function method 
which we have applied earlier to deal with perturbation terms. All successive 
solutions can now be derived from the unperturbed solutions through 


Citi) — oT i B(S)p2(S) sin [vo (¢ — ¢)| Ci(6) dg ’ 


‘ (15.135) 
Snt (9) = 55 fo B(S)p2(S) sin [vo (6 — )] Sn(G) dé. 
With the unperturbed solution Cp we get for C; 
1 £? 
Cu(p) =< f° BlE)palE) sin vn (6 — g)] €08 (08) 4, (15.136) 
and utilizing this solution Cz becomes 
1 £? 
C2(y) = = | B(6)p2(E) sin [vo (€ — ~)] cos (v96) (15.137) 
0 


4 
x / B(©)px(E) sin [vo (€ — £)] cos (vk) dé db, 


Further solutions are derived following this procedure although the formulas get 
quickly rather elaborate. With the cosine and sine like solutions we can formulate 
the transformation matrix for the whole ring 


_ ( CQnx) Sx) 
a (E05) (15.138) 
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and applying Floquet’s theorem, the tune of the circular accelerator can be deter- 
mined from the trace of the transformation matrix 


2cos2nv = C(2m) + S(2z), (15.139) 
where S$ =d5/dg. With the ansatz (15.132) this becomes 


2cos2rv = ) °C, (2) 5" + D> S, (2x) 8", (15.140) 


n>=0 n>0 


Retaining only up to third-order terms in 6, we finally get after some manipulations 
with (15.135) 


1 2n 
cos 2mv = cos 2V9 — oan sin2avp f p2() dé (15.141) 
Vo 0 


1 20 ¢ 
+ 2v2 Jo | Pi(S) p2(B) sin [vo (§ — B — 2)] 
x sin [Vo (B — o)] dB da 
1 2a c E 
ar / / i po(£) p2(&) p(y) sin [v9 (¢ — y — 27) 


x sin [vo (§ — ¢)] sin [vo (y — §)] dy dé dé. 


These integrals can be evaluated analytically and (15.141) becomes after some 
fairly lengthy but straightforward manipulations 


cos 27v = cos2r v9 — 8” (: ee 2 2 2) (15.142) 
_ 33 (: oe y oe) 
n>0 0 
a Ee) 
apes Lae 7 wy + 0084). 


Sein 
G+ —42 4A? — 402 


This expression defines the chromatic tune shift up to third order. Note that 
the tune vo is not the unperturbed tune but already includes the lowest-order 
approximation of the chromaticity (15.129). The relevant perturbations here are 
linear in the betatron amplitude and drive therefore half-integer resonances as is 
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Fig. 15.11 Variation of the 7 
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obvious from (15.142). The main contribution to the perturbation observed here are 
from the quadrupole and sextupole terms 


po(p) = vob? (k — mn,) (6 — 8? + 83...) . (15.143) 


In large storage rings the nonlinear chromaticity becomes quite significant as 
demonstrated in Fig. 15.11. Here the tune variation with energy in the storage 
ring PEP is shown both for the case where only two families of sextupoles 
are used to compensate for the natural chromaticities [8]. Since in this ring an 
energy acceptance of at least +1 % is required, we conclude from Fig. 15.11 that 
insufficient stability is available because of the nonlinear chromaticity terms shifting 
the tunes for off-momentum particles to an integer resonance within the desired 
energy acceptance. 

For circular accelerators or rings with a large natural chromaticity it is important 
to include in the calculation of the nonlinear chromaticity higher-order terms of the 
dispersion function 7,. Following the discussion in Sect. 9.4.1 we set in (15.143) 


nx(Q) = No +16 + m5? +... (15.144) 


and find the Fourier components a, and b, in (15.142) defined by 


vgB°(k — mno) = )- 2a, cosng , (15.145) 
n>0 
—v5B°(k — mno + mm)=)> | 2b, cosng . (15.146) 
n>0 


Nonlinear energy terms in the 7-function can sometimes become quite significant 
and must be included to improve the accuracy of analytical expressions for the 
nonlinear chromaticity. In such cases more sophisticated methods of chromaticity 
correction are required to control nonlinear chromaticities as well. One procedure is 
to distribute sextupoles in more than two families while keeping their total strength 
to retain the desired chromaticity. Using more than two families of sextupoles 
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allows us to manipulate the strength of specific harmonics a, such as to minimize 
the nonlinear chromaticities. Specifically, we note in (15.142) that the quadratic 
a 
n2 =i . 
be minimized by a proper distribution of sextupoles suppressing the n'"-harmonic of 
the chromaticity function v?B?(k—my). Special computer programs like PATRICIA 
[8] calculate the contribution of each sextupole to the Fourier coefficients a, and 
provide thereby the information required to select optimum sextupole locations and 
field strength to minimize quadratic and cubic chromaticities. 


This term can 


chromaticity term originates mainly from the resonant term 


15.5 Kinematic Perturbation Terms* 


The rules of linear beam dynamics allow the design of beam transport systems with 
virtually any desired beam characteristics. Whether such characteristics actually can 
be achieved depends greatly on our ability or lack thereof to control the source 
and magnitude of perturbations. Only the lowest-order perturbation terms were 
discussed in the realm of linear, paraxial beam dynamics. With the continued 
sophistication of accelerator design and increased demand on beam quality it 
becomes more and more important to also consider higher-order magnetic field 
perturbations as well as kinematic perturbation terms. 

The effects of such terms in beam-transport lines, for example, may compromise 
the integrity of a carefully prepared very low emittance beam for linear colliders or 
may contribute to nonlinear distortion of the chromaticity in circular accelerators 
and associated reduced beam stability. Studying nonlinear effects we will not only 
consider nonlinear fields but also the effects of linear fields errors in higher order, 
whether it be higher-order perturbation terms or higher-order approximations for 
the equations of motion. The sources and physical nature of perturbative effects 
must be understood to determine limits to beam parameters and to design correcting 
measures. 

Perturbations of beam dynamics not only occur when there are magnetic field and 
alignment errors present. During the derivation of the general equation of motion 
in Chap.5 we encountered in addition to general multipole fields a large number 
of kinematic perturbation terms or higher-order field perturbations which appear 
even for ideal magnets and alignment. Generally, such terms become significant for 
small circular accelerators or wherever beams are deflected in high fields generating 
bending radii of order unity or less. If, in addition, the beam sizes are large the 
importance of such perturbations is further aggravated. In many cases well-known 
aberration phenomena from light optics can be recognized. 

Of the general equations of motion, we consider terms up to third order for 
ideal linear upright magnets and get the equation of motion in the horizontal and 
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deflecting plane 
x + (kp +R) 02a? + kek? + (Gieck + ZK)? (15.147) 

+ 3h (x? — yy?) + «Ox! + yy) 
+ (-10K EK + RY + kk! + Kl?) x — (2K? + 3k) xx” 
+ E(HOK TK +R! + Ske! + Kl?) xy’ 
— Kyl x! + Kxyy! — Skxy? + Skea? 
+k 6 — K,5? + K8* + (27 +k) x8 — Klyy’d 
+ Le? + y)s + (—Skyk — 5K) yd 
+ (Qkxk + «3275 — (k + 2K?) x8? + O(4). 


In the nondeflecting or vertical plane the equation of motion is 


y" — ky = +2kgk xy — Oy — ay") + kay’ (15.148) 
= C+ K+! + Kl + KZ)? 
F(R + kk = KK + KP) xy 
+ tkyy? = KyKlay’ 
—K xx'y + skx?y — (2k? + k) xx'y’ 
—ky6 + «1 x'y — 2n,kxy5 + ky6? + O(4). 


It is quite clear from these equations that most perturbations become significant 
only for large amplitudes and oblique particle trajectories or for very strong mag- 
nets. The lowest-order quadrupole perturbations are of third order in the oscillation 
amplitudes and therefore become significant only for large beam sizes. Second- 
order perturbations occur only in combined-function magnets and for particle 
trajectories traversing a quadrupole at large amplitudes or offsets from the axis. 
Associated with the main fields and perturbation terms are also chromatic variations 
thereof and wherever beams with a large energy spread must be transported such 
perturbation terms become important. Specifically, the quadrupole terms kx5 and 
kyé determine the chromatic aberration of the focusing system and play a significant 
role in the transport of beams with large momentum spread. In most cases of beam 
dynamics, all except the linear chromatic terms can be neglected. 

Evaluating the effect of perturbations on a particle beam, we must carefully select 
the proper boundary conditions for bending magnets. Only for sector magnets is the 
field boundary normal to the reference path and occurs therefore at the same location 
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z independent of the amplitude. Generally, this is not true and we must adjust the 
integration limits according to the particle oscillation amplitudes x and y and actual 
magnet boundary just as we did in the derivation of linear transformation matrices 
for rectangular or wedge magnets. 


15.6 Perturbation Methods in Beam Dynamics 


In this chapter, mathematical procedures have been developed to evaluate the 
effect of specific perturbations on beam dynamics parameters. It is the nature of 
perturbations that they are unknown and certain assumptions as to their magnitude 
and distribution have to be made. Perturbations can be systematic, statistical 
but periodic or just statistical and all can have a systematic or statistical time 
dependence. 

Systematic perturbations in most cases become known through careful magnetic 
measurements and evaluation of the environment of the beam line. By construction 
magnet parameters may be all within statistical tolerances but systematically off 
the design values. This is commonly the case for the actual magnet length. Such 
deviations generally are of no consequences since the assumed magnet length in 
the design of a beam-transport line is arbitrary within limits. After the effective 
length of any magnet type to be included in a beam line is determined by magnetic 
measurements, beam optics calculations need to be repeated to reflect the variation 
in length. Similarly, deviations of the field due to systematic errors in the magnet 
gap or bore radius can be cancelled by experimental calibration of the fields with 
respect to the excitation current. Left are then only small statistical errors in the 
strength and length within each magnet type. 

One of the most prominent systematic perturbation is an energy error a particle 
may have with respect to the ideal energy. We have treated this perturbation in much 
detail leading to dispersion or 7-functions and chromaticities. 

Other sources of systematic field errors come from the magnetic field of ion 
pumps or rf-klystrons, from earth magnetic field, and current carrying cables along 
the beam line. The latter source can be substantial and requires some care in the 
choice of the direction the electrical current flows such that the sum of currents in all 
cables is mostly if not completely compensated. Further sources of systematic field 
perturbations originate from the vacuum chamber if the permeability of the chamber 
or welding material is too high, if eddy currents exist in cycling accelerators or 
due to persistent currents in superconducting materials which are generated just like 
eddy currents during the turn on procedure. All these effects are basically accessible 
to measurements and compensatory measures in contrast to statistical perturbations 
as a result of fabrication tolerances. 
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15.6.1 Periodic Distribution of Statistical Perturbations 


Whatever statistical perturbations exist in circular accelerators, we know that these 
perturbations are periodic, with the ring circumference being the period length. The 
perturbation can therefore always be expressed by a Fourier series. The equation 
of motion in the presence of, for example, dipole field errors is in normalized 
coordinates 


+ vaw = —ve BF Ac. (15.149) 


The dipole perturbation B*/? Ax is periodic and can be expressed by the Fourier 
series 


Praga Ss Fue, (15.150) 


where vo¢ is the betatron phase and the Fourier harmonics F,, are given by 


F,= = | Var wana] eine) de, (15.151) 


The location of the errors is not known and we may therefore only calculate 
the expectation value for the perturbation by multiplying (15.151) with its complex 
conjugate. In doing so, we note that each localized dipole perturbation deflects the 
beam by an angle @ and replace therefore the integral in (15.151) by a sum over all 
perturbations. With f Axd¢ ~ 6 we get for F,F* = |F,,|? 


1 i : 
Fal? = gy | DE Bebe + DY VPuBi GeO | ee) 
k kA#j 


where fx is the betatron function at the location of the dipole perturbation. The 
second sum in (15.152) vanishes in general, since the phases for the perturbations 
are randomly distributed. 

For large circular accelerators composed of a regular lattice unit like FODO cells 
we may proceed further in the derivation of the effects of perturbations letting us 
determine the field and alignment tolerances of magnets. For simplicity, we assume 
that the lattice magnets are the source of dipole perturbations and that the betatron 
functions are the same at all magnets. Equation (15.152) then becomes 


1 
[Fl = agp NmBmo6 , (15.153) 


where og is the expectation value for the statistical deflection angle due to dipole 
perturbations. In a little more sophisticated approach, we would separate all magnets 
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into groups with the same strength and betatron function and (15.153) becomes 
1 
2 2 
|Fnly = 452 NinBinOG m ; (15.154) 


where the sum is taken over all groups of similar perturbations and N,,, is the number 
of perturbations within the group m. In a pure FODO lattice, for example, obvious 
groups would be all QF’s, all QD’s and all bending magnets. From now on we will, 
however, not distinguish between such groups anymore to simplify the discussion. 

Periodic dipole perturbations cause a periodic orbit distortion which is 
from (15.149) 


vg Fn in 
wy) =o wo a (15.155) 


The expectation value for the orbit distortion is obtained by multiplying (15.155) 
with it’s complex conjugate and we get with w(g) = u(z)/4/B (z) 


+00 eilp +00 ening 
* Aye 2 
wu* = B(2)v"|Ful? >> eae > er (15.156) 
n=—0o m=—oo 


The sums can be replaced by — reas vino) and we get finally for the expectation 


value of the orbit distortion o,, at locations with a betatron function 6 

NBo2 
o = p—NPoe : (15.157) 
8 sin’ wv 


where B is the average betatron function at the locations of perturbations. This 
result is in full agreement with the result (15.39) for misaligned quadrupoles, where 
09 = Ou/f, Ou the statistical quadrupole misalignment and f the focal length of the 
quadrupole. 

This procedure is not restricted to dipole errors only but can be applied to 
most any perturbation occurring in a circular accelerator. For this we determine 
which quantity we would like to investigate, be it the tunes, the chromaticity, 
perturbation of the dispersion functions, or any other beam parameter. Variation of 
expressions for such quantities due to variations of magnet parameters and squaring 
such variation we get the perturbation of the quantity under investigation. Generally, 
perturbation terms of order n in normalized coordinates are expressed by 


Pj) = 1,0 Bp ey (15.158) 


Because the perturbations are assumed to be small, we may replace the oscillation 
amplitudes w” in the perturbation term by their principle unperturbed solutions. 
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Considering that the beam position w is a composite of, for example, betatron 
oscillation wg, orbit distortion w,, and energy error w, we set 


Ww = Wp + We + Wy (15.159) 


and note that any higher-order perturbation contributes to the orbit, the eta-function, 
the tunes, betatron functions, and other beam parameters. Orbit distortions in 
sextupoles of strength m, for example, produce the perturbations 


Po(z) = 5 v6 8°? mw? (15.160) 
which for w, = 0 can be decomposed into three components 


P(z) = 1y5 8°? mw, 
P21 (2) = voB°?mwews , (15.161) 


P(z) = Sy B°?mws, 


The perturbation Py) causes an orbit distortion and since the perturbations are 
randomly distributed the contribution to the orbit according to (15.157) is 


Ns Bus Oo; 


2 
6” = 6,——>— 3 15.162 
w= P 8 sin? rv, ( ) 


where N, is the number of sextupoles, 6,; the value of the betatron function and 
o, the rms orbit distortion at the sextupoles, og = 5moz ls and £, is the effective 
sextupole deflection and length, respectively. In cases of very strong sextupoles 
iteration methods must be applied since the orbit perturbation depends on the orbit. 
Similarly, we could have set w, = 0 to calculate the perturbation of the 7-function 
due to sextupole magnets. 

The linear perturbation P2; in (15.161) causes a statistical tune shift and a 
perturbation of the betatron function. Following the derivation of tune shifts in 
Sect. 15.3, we find the expectation value for the tune shift to be 


(52v) = ae =o Crane (15.163) 


where (uv) is the random misalignment of the sextupole magnets or random orbit 
distortions in the sextupoles. 

We find the interesting result, that sextupoles contribute to a tune error only if 
there is a finite orbit distortion or misalignment uo, while a finite betatron oscillation 
amplitude of a particle in the same sextupoles does not contribute to a tune shift. 
Similarly, we may use the effects of systematic errors to get expressions for the 
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probable variation of the betatron function (15.91) due to gradient errors from 
misaligned sextupoles. 

In the approximation of small perturbations, we are able to determine the 
expectation value for the effect of statistical errors on a particular beam parameter 
or lattice function. This formalism is used particularly when we are interested to 
define tolerances for magnetic field quality and magnet alignment by calculating 
backwards from the allowable perturbation of beam parameters to the magnitude of 
the errors. Some specific statistical effects will be discussed in subsequent sections. 


15.6.2. Periodic Perturbations in Circular Accelerators 


Alignment and field errors in circular accelerators not only cause a distortion of 
the orbit but also a perturbation of the 7-functions. Although these perturbations 
occur in both the horizontal and vertical plane, we will discuss only the effect in 
the vertical plane. While the derivations are the same for both planes the errors 
contribute only to a small perturbation of the already existing horizontal 7-function 
while the ideal vertical n-function vanishes, and therefore the perturbation can 
contribute a large variation of beam parameters. This is specifically true for electron 
storage ring where the vertical beam emittance is very small and a finite vertical 
n-function may increase this emittance considerably. 

Similar to (15.11) we use the equation of motion 


y" — ky = +Aky — Akyd — kyS + mxy (15.164) 
with the decomposition 
y=yet vyd (15.165) 


and get in normalized coordinates ) = y/,/By, while ignoring the betatron motion, 
the differential equations for the orbit distortion y, 


Yo + VyVo = +0, By? (Ak, + mxcye) (15.166) 

and for the perturbation of the 7-function 0, = v,/ </ By 
Dy + vp dy = ve BS? Any + veBS(k — mnx)Ve - (15.167) 
First, we note in a linear lattice where m = 0 that the differential equations for 


both the closed orbit distortion and the 7-function perturbation are the same except 
for a sign in the perturbation. Therefore, in analogy to (15.157) 


2 _ B(2)Bo 2 
(v,(2)) = es 278. (15.168) 
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The perturbation of the 7-function becomes more complicated in strong focusing 
lattices, where the chromaticity is corrected by sextupole fields. In this case, we note 
that all perturbation terms on the r.h.s. are periodic and we express them in Fourier 
series 


n=-+00 
vp An = >) Re (15.169) 
n=—oo 
with 
2 2n 
r= = | 3! Axy ei dt (15.170) 
20 0 : 
and 
n=+o0o0 
v*Bs(k— mn) = D> Ane”? (15.171) 
n=—oo 
with 
2 20 
A,=—— | 6'(k—mn,e*" dr. (15.172) 
Qn 0 


We also make use of the periodicity of the perturbation of the n-function and set 


n=+00 


i=) Ee. (15.173) 


n=—oo 


Inserting (15.169)—(15.173) into (15.167) we get with the periodic solution of the 
closed orbit 


~ Fn in 
Jel) = >> aad (15.174) 
. Als pce 
Ye? —w)E, + F,|e" — a lim? = 0. (15.175) 
y= 


Noting that this equation must be true for all phases ¢ all terms with the same 
exponential factor must vanish separately and we may solve for the harmonics of 
the 7-function 


F,, An—+ Fy 
E, = -—=>=—> ; 15.176 
Pen 2 game ( ) 
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The perturbation of the 7-function is therefore 


An-r F, 


a Ga e”, (15.177) 


dy(y) = —Fe() + >> we 


ns 


We extract from the double sum on the r.h.s. of (15.177) all terms with n = r and 
get from those terms the expression Ay )—,, wy el” The coefficient Ap, however, 
is just the natural chromaticity Ag = 2&y/vy and the perturbation of the -function 


is from (15.177) 


F, eing An—r F, einy 
a (15.178) 


(v2 = n?)? (v2 _ n?)(v2 = r) . 


pact 
5M) = Fey) + =D 


yon nZ#r 

By correcting the orbit distortion and compensating the chromaticity, we are able 
to greatly reduce the perturbation of the vertical 7-function. All terms with r = 0 
vanish for a truly random distribution of misalignment errors since Fo = 0. Taking 
the quadrupole lattice as fixed we find the remaining terms to depend mainly on 
the distribution of the orbit correction F,, and sextupole positions A;. For any given 
sextupole distribution the orbit correction must be done such as to eliminate as much 
as possible all harmonics of the orbit in the vicinity of the tunes r % v, and to center 
the corrected orbit such that Fo = 0. 

Furthermore, we note that some care in the distribution of the sextupoles must 
be exercised. While this distribution is irrelevant for the mere correction of the 
natural chromaticities, higher harmonics of the chromaticity function must be held 
under control as well. The remaining double sum is generally rather small since the 
resonance terms have been eliminated and either v — n or v — r is large. However, 
in very large rings or very strong focusing rings this contribution to the perturbation 
of the 7-function may still be significant. 


15.6.3 Statistical Methods to Evaluate Perturbations 


In an open beam-transport line the perturbation effect at a particular point depends 
only on the upstream perturbations. Since perturbations cannot change the position 
but only the slope of particle trajectories, we merely transform the random kick 
angle 6, from the location of the perturbation to the observation point. Adding all 
perturbations upstream of the observation point we get with w = w(z) 


u(z)= BZ) »e V Bx sin(y — Wr) 9. 


vv) 15.179 
WO= zig LD VBLcosty — Ya) oe 


Wk<w(z) 
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The expectation value for the position of the beam center at the observation point 
becomes from the first equation (15.179) noting the complete independence of the 
perturbations 


ou(z) = VBZ)INY (B)oo « (15.180) 


Random variations of the beam position are customarily corrected by special 
steering magnets if such correction is required at all. In long beam-transport systems 
like those required in linear colliders a mere correction of the beam position at 
the collision point, for example, may not be acceptable. Specifically, nonlinear 
perturbations lead to an incoherent increase of the beam size which can greatly 
reduce the usefulness of the colliding-beam system. In the next subsection we will 
therefore discuss general perturbations in beam-transport lines and their effect on 
the beam cross section. 


15.7 Control of Beam Size in Transport Lines 


For the transport of beams with a very small beam size or beam emittance like in a 
linear collider facilities we are specially concerned about the impact of any kind of 
perturbation on the integrity of a small beam emittance. Errors can disturb the beam 
size in many ways. We have discussed already the effect of dipole errors on the 
dispersion. The distortion of the dispersion causes an increase in the beam size due 
to the energy spread in the beam. Quadrupole field errors affect the value of betatron 
functions and therefore the beam size. Vertical orbit distortions in sextupoles give 
rise to vertical—horizontal coupling. In this section we will try to evaluate these 
effects on the beam size. 

We use the equations of motion (6.95), (6.96) up to second order in x, y and 
6, and assume the curvature to be small of the order or less than (x, y, 5). This is 
a proper assumption for high-energy beam transport lines like in linear colliders. 
For lower-energy beam lines very often this assumption is still correct and where a 
better approximation is needed more perturbation terms must be considered. For the 
horizontal plane we get 


x" + (kK, +A) x = KS — k,5? — ky — 4m? — y*)(1— 8) (15.181) 
— Ak,(1 — 6) + kxé + Akx(1 — 5) + O(3) 


and for the vertical plane 


y” —ky = K,6 — Ky6 — kx + mxy(1 — 8) — kyd (15.182) 
—Ax,(1 — 8) — Aky(1 — 6) + OG) 
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In these equations rotated magnets (k,,k,m) are included as small quantities 
because rotational alignment errors of upright magnets cause rotated error fields 
although no rotated magnets per se are used in the beam line. For the solution 
of (15.181) and (15.182) we try the ansatz 


x =xg + Xe + 5 + x5 + w, 8", 
(15.183) 
Y=yp t+ ye + md + vyd + wyd?. 


Here we define (xg, yg) as the betatron oscillations, (x;, yc) the orbit distortions, 
(nx, Ny) the dispersion function, (v,, vy) the perturbations of the dispersion functions 
due to magnetic field errors, and (w,, wy) the first-order chromatic perturbation of 
the dispersion functions (Hor = 7-+v-+w6-+...). This ansatz leads to the following 
differential equations in the horizontal plane where we assume the bending radii to 
be large and k,, xy are therefore treated as small quantities 


xg + kxgp = —kyp — 5x; - yp) — m(xpXe — ypYc) + Akxg , a) 
xt + ke = —Aky + Ake — Kye — 5m(x2 — y?), b) 
netkn, = +ky, c) 
uy thy. = —kvy —m (xg + Xc)(Nx + Vy) +m (vp 7 Ye) (Ny - vy) d) 


+ Ak (Xe + Xp) + Aky + AK (Ny + Vx) 
+kxg + ke + $m(x2 — y2) + kye. 
Wh + kW = —Ky — $m(n2 + 2xv, — 2Znyvy — v2) + K(x + vz) e) 
= +m (XeNx + XcVy — VeNy — Vey) + (Ny + Vye)xXB— VyVp. 


(15.184) 
Similarly, we get for the vertical plane 
yg —kyp = —kxp + mxpyp — Akyp + m(xeyp + xpyc) a) 
yo ky, = —Aky — Akye — kx, + MXeYe 5 b) 
ny —kny = +ky, c) 
vy —kvy = +Aky — K(ny + vy) + m(xg + Xe) (Ny + Vy) d) 


+m (ny + vx)(yp + Ye) + Ak(yg + Ye) + kX — MXcVe 

—K(yp + ye) — Ak(ny + vy), 
wy es kwy = —Ky + K(ny — Uy) e) 
+m (nny + xVy + UxNy + UxVy) : 


(15.185) 


The solution of all these differential equations is, if not already known, straight- 
forward. We consider every perturbation to be localized as a thin element causing 
just a kick which propagates along the beam line. If {; is the betatron function at 
the observation point and f; that at the point of the perturbation p,,; the solutions 
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of (15.184), (15.185) have the form 
—_ Vaid VBisin wy | pide. (15.186) 


The kick due to the perturbation is 6; = f pni dz, where the integral is taken along 
the perturbation assumed to be short. To simplify the equations to follow we define 
the length £; = 6;/(p,;). Since most errors derive from real magnets, this length is 
identical with that of the magnet causing the perturbation and yj; = yj — yj; is the 
betatron phase between perturbation and observation point. A closer look at (15.184) 
and (15.185) shows that many perturbations depend on the solution itself requiring 
an iterative solution process. Here we will, however, concentrate only an the first 
iteration. 

Ignoring coupling terms we have in (15.184) two types of perturbations, sta- 
tistically distributed focusing errors Ak and geometric aberration effects due to 
sextupoles. We assume here that the beam line is chromatically corrected by the use 
of self-correcting achromats in which case the term s(x; — yp) is self-canceling. 


2 
The expectation value for the betatron oscillation amplitude due to errors is then 


x4(2) = Br) >, Bri ((Pnits)?) sin? Wij (15.187) 
or 
(x9(2)) = Br@By(k’yg + Alxg + m(xpxe + yey) 3NuE? (15.188) 


where B, is the average value of the betatron functions at the errors, Nyy the number 
of perturbed magnets and ¢ the magnet length. With k = ka, where a is the 
rotational error, we get 


(x4 (2))= 3Bx(OB Nur e? (15.189) 
2 m 
x | oto? + ofo? + T (e203 - of03)]. 
We have assumed the errors to have a Gaussian distribution with standard width 
o. Therefore, o2 = (a7), 02 = ((Ak/k)*), Ore = (x2), etc., and oy, 0; the standard 


beam size for the Gaussian particle distribution. Since (x4 (z))/ (2) = Ae, is the 


increase in beam emittance and o? = €,,, 0, = €,B, we get for a round beam 
for which €, = ¢€, and the average values for the betatron functions are the same 


(B. = By) 


Dey 2 2 
Se SB Nuke? le: +o, + Tr (Ox + 03) (15.190) 


x 
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To keep the perturbation of the beam small the alignment o, and magnet field 
quality o, must be good and the focusing weak which, however, for other reasons 
is not desirable. For a chromatically corrected beam line we have k/m = 7,, which 
can be used in (15.190). The perturbation of the vertical beam emittance follows 
exactly the same results because we used a round beam. 

The expectation value for the shift of the beam path is derived from (15.184a), 
(15.185b) with (15.186) in a similar way as for the betatron oscillations 


(x2(2)) = $Bx(2)B.Nul? [(Akz) + Pog (xz) + Poglye)] - (15.191) 


This expression for the path distortion, however, is not to be used to calculate the 
perturbation of the dispersion. In any properly operated beam line one expects this 
path distortion to be corrected leading to a smaller residual value depending on the 
correction scheme applied and the resolution of the monitors. With some effort the 
path can be corrected to better than 1mm rms which should be used to evaluate path 
dependent perturbation terms in (15.184), (15.185). In the vertical plane we get 


(ye(2)) = 5By(2)ByNue? [(An?) + og (y2) + Pog (xe) ] . (15.192) 


The perturbation of the dispersion is with (15.184d) and (15.186) 


Ue(2) = —Xe(z) + VB D> VBuprili sin yj (15.193) 


In (15.184d) we note the appearance of the same perturbation terms as for the path 
distortion apart from the sign and we therefore separate that solution in (15.193). 
The perturbations left are then 


Pxi = (k — mnx) (xB a Xe) + m(yg oF Ye) Ny + Aknx Serer (15.194) 


In this derivation the betatron phase wy; does not depend on the energy since 
the chromaticity is corrected. Without this assumption, we would get another 
contribution to v, from the beam-path distortion. We also note that the chromaticity 
factor (k — mn,) can to first order be set to zero for chromatically corrected beam 
lines. The expectation value for the distortion of the dispersion is finally given by 


(VQ) = 2] + ZBOB Nal? | (AMI? + mH, (99) + mH, 202) | 
(15.195) 


or with some manipulation 


2 
(uz (z)) = (x¢(2)) + 5Br(@B Nur? [oi My + (2) (Byey + “| (15.196) 
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The perturbation of the dispersion function is mainly caused by quadrupole field 
errors while the second term vanishes for a plane beam line where 7, = 0. 
In principle, the perturbation can be corrected if a specific path distortion is 
introduced which would compensate the perturbation at the point z as can be seen 
from (15.193). 

In the vertical plane we proceed just the same and get instead of (15.193) 


vy(2) = ~ve() + Bo) > Bsr (Pviéi) sin Yay (15.197) 
with 


Pyi = —(k— mnx)(yp + Yo) + mux(yp + Ye) (15.198) 
+ m(Ny + vy)(xp + Xe) — Ak(ny + vy) — K(ny + vy). 
Again due to chromaticity correction we have (k — mn,) ~ 0 and get for the 


expectation value of (v2) in first approximation with v, = 0 in (15.198) and the 
average values 77, and v; 


(v5(2)) = (2@) + 5By@ByNukP? 2? (15.199) 


x IG +oz+ 


(xe) \ oo | Be 
7 ) if + a (Bye, “te v2) | : 


x 


For a plane beam line where 7, = 0, we clearly need to go through a further 
iteration to include the perturbation of the dispersion which is large compared to 
ny = 0. In this approximation, we also set y,(z) = 0 and 


Si 
RMR be 


(v5(2) = <5 (Byey + 02) - (15.200) 


S| 


Using this in a second iteration gives finally for the variation of the vertical 
dispersion function due to field and alignment errors 


2 
(v,(2)) = (2(2)) +5 By(@)ByNMke e? G + oy + a) (15.201) 


2 =2 
z = vy Uy 
x G 7 by) a abuts ie qe Pe + | . 


This second-order dispersion due to dipole field errors is generally small 
but becomes significant in linear-collider facilities where extremely small beam 
emittances must be preserved along beam lines leading up to the collision point. 
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Problems 


15.1 (S). Use the perturbation terms P, (z) in (15.161) and show that pure betatron 
oscillations in sextupoles do not cause a tune shift in first approximation. Why is 
there a finite tune shift for the P2, (z)-term? 


15.2 (S). Show analytically that the dispersion function for a single bending magnet 
with a bending angle 0 seems approximately to emerge from the middle of the 
magnet with a slope D’ = @. 


15.3 (S). Use the lattice of example #3 in Table 10.1 and introduce vertical rms 
misalignments of all quadrupoles by (5x)... = 0.1mm. Calculate the vertical 


rms dispersion function. Then, add also rotational alignment errors of the bending 
magnets by (da) _. = 0.17 mrad and calculate again the vertical rms dispersion. 


rms 


15.4 (S). Use two bending magnets separated by a drift space of length ¢. Both 
bending magnets are of equal but opposite strength. Such a deflection arrangement 
causes a parallel displacement d of the beam path. Show that in this case the 
contribution to the dispersion at the end of the second bending magnet is D = —d 
and D’ = 0. 


15.5 (S). For the rings in Problems 15.6 or 15.7 calculate the rms tolerance on 
the quadrupole strength to avoid the integer or half integer resonance. What is 
the corresponding tolerance on the quadrupole length? To avoid gradient fields 
in bending magnets the pole profiles must be aligned parallel with respect to the 
horizontal midplane. What is the angular tolerance for parallelism of the poles? 


15.6. Use parameters of example #4 in Table 10.1 for a FODO lattice and construct 
a full ring. Adjust the quadrupole strength such that both tunes are an integer plus 
a quarter. Calculate the rms alignment tolerance on the quadrupoles required to 
keep the beam within o, = 0.1mm and o, = 0.1mm of the ideal orbit. What 
is the amplification factor? Determine the rms deflection tolerance of the bending 
magnets to keep the beam within 0.1 mm of the ideal orbit. A rotation of the bending 
magnets about its axis creates vertical orbit distortions. If the magnets are aligned 
to a rotational tolerance of o, = 0.17 mrad (this is about the limit of conventional 
alignment techniques) what is the expectation value for the vertical orbit distortion? 


15.7. Repeat the calculation of Problem 15.6 with the lattice example #1 in 
Table 10.1. The alignment tolerances are much relaxed with respect to the ring 
in Problem 15.6. What are the main three contributions influencing the tolerance 
requirements? Make general recommendations to relax tolerances. 


15.8. Design an electrostatic quadrupole with an aperture radius of 3cm which is 
strong enough to produce a tune split of dv = 0.01 between a counter rotating 
particle and antiparticle beam at an energy of 3GeV. Assume the quadrupole to 
be placed at a location with a betatron function of B = 10m. How long must the 
quadrupole be if the electric field strength is to be limited to no more than 15 kV/cm? 
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15.9. Consider a long straight beam-transport line for a beam with an emittance of 
€ = 10~!’ rad-m from the end of a 500 GeV linear collider linac toward the collision 
point. Use a FODO channel with Bmax = 5 m and determine statistical tolerances 
for transverse and rotational alignment and strength tolerances for the FODO cell 
quadrupoles to prevent the beam emittance from dilution by more than 10%. 


15.10. Use parameters of example #4 in Table 10.1 for a FODO lattice and 
construct a full ring. Adjust the quadrupole strength such that both tunes are an 
integer plus a quarter. Calculate the rms alignment tolerance on the quadrupoles 
required to keep the beam within o, = 0.1mm and o, = 0.1mm of the ideal 
orbit. What is the amplification factor? Determine the rms deflection tolerance of 
the bending magnets to keep the beam within 0.1 mm of the ideal orbit. A rotation of 
the bending magnets about its axis creates vertical orbit distortions. If the magnets 
are aligned to a rotational tolerance of og = 0.17 mrad (this is about the limit of 
conventional alignment techniques) what is the expectation value for the vertical 
orbit distortion? 


15.11. Consider statistical transverse alignment errors of the quadrupoles in the 
large hadron collider lattice example #4 in Table 10.1 of (5x),.,, = 0.1 mm. What 
is the rms path distortion at the end of one turn? Determine the allowable rotational 
alignment error of the bending magnets to produce a vertical path distortion of no 
more than that due to quadrupole misalignments. How precise must the bending 
magnet fields be aligned to not contribute more path distortion than the quadrupole 
misalignments. 


15.12. Repeat the calculation of Problem 15.10 with the lattice example #1 in 
Table 10.1. The alignment tolerances are much relaxed with respect to the ring 
in Problem 15.10. What are the main three contributions influencing the tolerance 
requirements? Make general recommendations to relax tolerances. 


15.13. Calculate the expectation value for the integer and half integer stop band 
width of the ring in Problem 15.5. Gradient errors introduce a perturbation of the 
betatron functions. What is the probable perturbation of the betatron function for 
the case in Problem 15.5? 


15.14. Consider a FODO cell equal to examples #1, #2, and #4 in Table 10.1, adjust 
the phase advance per cell to equal values and calculate the natural chromaticities. 
Insert thin sextupoles into the center of the quadrupoles and adjust to zero 
chromaticities. How strong are the sextupoles? Why must the sextupoles for lattice 
#2 be so much stronger compared with lattice #4 even though the chromaticity per 
cell is about the same? 


15.15. Consider the transformation of phase ellipses through one full FODO cell 
of the examples in Problem 15.14. Let the emittance for the phase ellipses be € = 
10 mm mrad. First transform the equation for the phase ellipse into a circle by setting 
u = x and v = a, + Bx’. Transform the phase circle from the center of the QF 
through one full FODO cell to the center of the next QF ignoring any sextupole 
terms. Repeat this transformation but include now the sextupole in the first QF only 
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as calculated in Problem 15.14. Discuss the distortions of the phase circle for the 
three different FODO lattices. 
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Chapter 16 
Resonances 


Particle resonances in circular accelerators occur as a result of perturbation terms 
involving particular Fourier harmonics. That approach is based on the common 
knowledge that periodic perturbations of a harmonic oscillator can cause a res- 
onance when the perturbation frequency is equal to an eigenfrequency of the 
oscillator. 


16.1 Lattice Resonances 


Perturbation terms in the equation of motion can lead to a special class of beam 
instabilities called resonances, which occur if perturbations act on a particle in 
synchronism with its oscillatory motion. While such a situation is conceivable in a 
very long beam transport line composed of many periodic sections, the appearance 
of resonances is generally restricted to circular accelerators. There, perturbations 
occur periodically at every turn and we may Fourier analyze the perturbation with 
respect to the revolution frequency. If any of the harmonics of the perturbation 
terms coincides with the eigenfrequency of the particles a resonance can occur and 
particles may get lost. Such resonances caused by field imperfections of the magnet 
lattice are also called structural resonances or lattice resonances. We have already 
come across two such resonances, the integer and the half-integer resonances. 

The characteristics of these two resonances is that the equilibrium orbit and 
the overall focusing is not defined. Any small dipole or quadrupole error would 
therefore lead to particle loss as would any deviation of the particle position and 
energy from ideal values. Since these resonances are caused by linear field errors, 
we call them also linear resonances. Higher order resonances are caused in a similar 
way by nonlinear fields which are considered to be field errors with respect to the 
ideal linear lattice even though we may choose to specifically include such magnets, 
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like sextupoles or octupoles, into the lattice to compensate for particular beam 
dynamics problems. 


16.1.1 Resonance Conditions 


In this section the characteristics of resonances in circular accelerators will be 
derived starting from the equation of motion in normalized coordinates with only 
the nth order multipole perturbation term. This is no restriction of generality since in 
linear approximation each multipole perturbation having its own resonance structure 
will be superimposed to that of other multipole perturbations. On the other hand, the 
treatment of only a single multipole perturbation will reveal much clearer the nature 
of the resonance. The equation of motion in normalized horizontal coordinates for 
an nth-order perturbation is from (15.1) 


ty + v9.W = Vo, B2/? BE BS? ,s(p)w'v', (16.1) 


where vo, is the unperturbed horizontal tune and7,s = 0,1,2...integers with r + 
s = n—1.A similar equation holds for vertical oscillations by replacing w (@) , Vox, x 
with v(g), etc. and vice versa. The perturbations can be any appropriate term in 
the equations of motion (6.95) and (6.96), however, we will consider primarily 
perturbation terms which occur most frequently in circular accelerators and are 
due to rotated quadrupole or nonlinear multipole fields. The general treatment of 
resonances for other perturbations is not fundamentally different and is left to the 
interested reader. From Chap. 9 we extract the dominant perturbation terms of order 
n = r-+s-+ 1 in normalized coordinates and compile them in Table 16.1 ordered 
by perturbations of order r and s. To keep track of all types of perturbations requires 
the use of many indices. We will keep the discussion more simple by ignoring 
coupling, which will be treated separately, and concentrate only on a single order 


Table 16.1 Lowest order 
perturbation terms 
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of perturbation. For simplicity we drop from here on the index , and set B, = B. 
Equation (16.1) becomes then 


n/2+1 


+ vow = v9 B"7*"p, (pw |, (16.2) 


All perturbations are periodic in g and can be expanded into a Fourier series 


dn (¢) = vp B"/?+ 1p, (9) = Sane” (16.3) 


Since the perturbation is supposed to be small, we will insert the unperturbed 
oscillation wo on the right-hand side of (16.2). The general form of the unperturbed 
betatron oscillation can be written like 


wo(y) = ae”? + be, (16.4) 


where a and b are arbitrary constants and we may now express the amplitude factor 
in the perturbation term w"~!(g) by a sum of exponential terms which we use on 
the right hand side of (16.2) as a first approximation 


w'l(p) ~wh'(~) = D> We ie, (16.5) 


\|<n-1 


Inserting both (16.3) and (16.5) into (16.2) we get for the equation of motion 


+ yaw =) Widam ert nor, (16.6) 


lm 


The solution of this equation includes resonant terms whenever there is a 
perturbation term with a frequency equal to the eigenfrequency vo of the oscillator. 
The resonance condition is therefore (m,n, / are integers) 


m+ly=v with = |i] <n—1. (16.7) 


From earlier discussions we expect to find here the integer resonance caused by 
dipole errors n = 1. In this case the index / can only be / = 0 and we get from (16.7) 


im (16.8) 


which is indeed the condition for an integer resonance. 

Magnetic gradient field errors (n = 2) can cause both a half integer-resonance as 
well as an integer resonance. The index / can have the values / = 0 and/ = +1. 
Note however that not all coefficients W; necessarily are nonzero. In this particular 
case, the coefficient for / = O is indeed zero as becomes obvious by inspection 
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of (16.4). The resonance conditions for these second order resonances are 


m+V9 =Vo ~ m=O — tune shift at any tune, 
M—Vo = Vo > m= 2vq — integer and half integer resonance, (16.9) 
m= vo — no resonance because Wo = 0. 


Among the resonance conditions (16.9) we notice that for m = 0 the effect of the 
perturbation on the particle motion exists independent of the particular choice of the 
tune vo. The perturbation includes a nonvanishing average value g29 which in this 
particular case represents the average gradient error of the perturbation. Like any 
other gradient field in the lattice, this gradient error also contributes to the tune and 
therefore causes a tune shift. From (16.2) we find the new tune to be determined by 
waa [1 - (6p), and the tune shift is bv ~ —5v9 (B’p2), in agreement with 
our earlier result in Sect. 15.3.1. 

Third order resonances (n = 3) can be driven by sextupole fields and the index / 
can have values 


l= —-—2, -1, 0, +1, +2. (16.10) 


Here we note that W; = W_, = 0 and therefore no resonances occur for / = +1. 
The resonance for sextupole field perturbations are then 


m—2v9 = Vo > m= 3v9 — third order resonance, 
m=vV0o > M= Vo — integer resonance, (16.11) 
m+2v9 = vo > m= —vVvo — integer resonance. 


Sextupole fields can drive third order resonances at tunes of 


(16.12) 


vo=r+h or vo =r- 


wre 


where r is an integer. They also can drive integer resonances. 
Finally we derive resonance conditions for octupole fields (n = 4) where 


ja=3, 9, =1, 0, 41, 43, 43 (16.13) 


and again some values of / do not lead to a resonance since the amplitude coefficient 
W, is zero. For octupole terms this is the case for / = 0 and / = +2. The remaining 
resonance terms are then 


m—3v9 = v9 > m=4v9 — quarter integer resonance, 
mM—Vo=Vo > m=2v9 — half integer resonance, 
m+v=vo > m=0 — tune spread at any tune, 
m+ 3v9 = V9 > m=—2Vv9 — half integer resonance. 


(16.14) 


16.1 Lattice Resonances 543 


The resonance condition form = 0 leads to a shift in the oscillation frequency. 
Different from gradient errors, however, we find the tune shift generated by 


octupole fields to be amplitude dependent v = v5 [! — (B *paw’),, . The amplitude 


dependence of the tune shift causes an asymmetric tune spread to higher or lower 
values depending on the sign of the perturbation term p4 while the magnitude of the 
shift is determined by the oscillation amplitude of the particle. 

The general resonance condition for betatron oscillations in one plane can be 
expressed by 


|m| = (|J| + 1)vo, (16.15) 


where |/| < n— 1 and the value |/| + 1 is the order of resonance. The index m is 
equal to the order of the Fourier harmonic of the perturbation and we call therefore 
these resonances structural or lattice resonances to distinguish them from resonances 
caused, for example, by externally driven oscillating fields. 

The maximum order of resonances in this approximation depends on the order 
of nonlinear fields present. An nth-order multipole field can drive all resonances 
up to n'-order with driving amplitudes that depend on the actual multipole field 
strength and locations within the lattice. Generally, the higher the order n the weaker 
is the resonance. In electron circular accelerators radiation damping makes higher 
order resonances ineffective. This is not the case for proton or ion beams which 
accumulate any effect leading, if not to beam loss, then to beam dilution or emittance 
blow-up. 

The term resonance is used very generally to include also effects which do not 
necessarily lead to a loss of the beam. Such “resonances” are characterized by 
m = 0 and are independent of the tune. In the case of gradient errors this condition 
was shown to lead to a stable shift in tune for the whole beam. Unless this tune 
shift moves the beam onto another resonance the beam stability is not affected. 
Similarly, octupole fields introduce a spread of tunes in the beam proportional to 
the square of the oscillation amplitude. Again no loss of particles occurs unless the 
tune spread reaches into the stop band of a resonance. By induction we conclude 
that all even perturbation terms, where n is an even integer, lead to some form 
of tune shift or spread. No such tune shifts occur for odd order perturbations 
in the approximation used here. Specifically we note that dipoles, sextupoles or 
decapoles etc. do not lead to a tune shift for weak perturbations. Later, however, 
we will discuss the Hamiltonian resonance theory and find, for example, that strong 
sextupole perturbations can indeed cause a tune spread in higher order. 

In this derivation of resonance parameters we have expanded the perturbations 
into Fourier series and have assumed the full circular accelerator lattice as the 
expansion period. In general, however, a circular accelerator is composed of one 
or more equal superperiods. For a circular lattice composed of N superperiods the 
Fourier expansion has nonzero coefficients only every Nth-harmonic and therefore 
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the modified resonance conditions are 
I/IN = (= Devo, (16.16) 


where j is an integer. A high super-periodicity actually eliminates many resonances 
and is therefore a desirable design feature for circular accelerator lattices. The 
integer and half-integer resonances, however, will always be present independent 
of the super-periodicity because the equilibrium orbits and the betatron functions 
respectively are not defined. On the other hand, integer and half-integer resonances 
driven by multipole perturbations may be eliminated in a high periodicity lattice 
with the overall effect of a reduced stop band width. It should be noted here, that the 
reduction of the number of resonances works only within the applied approximation. 
“Forbidden” resonances may be driven through field and alignment errors which 
compromise the high lattice periodicity or by strong non-linearities and coupling 
creating resonant driving terms in higher order approximation. Nevertheless, the 
forbidden resonances are weaker in a lattice of high periodicity compared to a low 
periodicity lattice. 


16.1.2 Coupling Resonances 


Betatron motion in a circular accelerator occurs in both the horizontal and vertical 
plane. Perturbations can be present which depend on the betatron oscillation 
amplitude in both planes. Such terms are called coupling terms. The lowest order 
coupling term is caused by a rotated quadrupole or by the rotational misalignment 
of regular quadrupoles. In general we have in the horizontal plane the equation of 
motion from (16.1) 


Wy + vow = v6.8? Br Bp, .(g)w'v*, (16.17) 


where r,s are integers and w,v describe betatron oscillations in the horizontal 
and vertical plane, respectively. Again we use the unperturbed solutions wo(¢) 
and vo,(@) of the equations of motion in the form (16.4) and express the higher 
order amplitude terms in the perturbation by the appropriate sums of trigonometric 
expressions: 


ars(9) = Vorb Br BS*Dis(~) = D> drsme”®, (16.18) 


and similar to (16.5) 


w= wel, (16.198 
\i|<r-1 
vg) = » Veet rn? , (16.19b) 


|e) <s-1 
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Insertion into (16.17) gives after some sorting 
+ Vow = D~ GrsmW)Veellrt Mort Ore, (16.20) 
where m, / and £ are integers. The resonance condition is 
m + lvox + €voy = Vox » (16.21) 
and the quantity 
[J+ |é)+1<n (16.22) 
designates the order of the coupling resonances. Again, for a super-periodicity N 
we replace m by jN, where j is an integer. As an example, we discuss a perturbation 
term caused by a rotated quadrupole for which the equation of motion is 
iv + vow = qoi(y)v. (16.23) 


In this case we have n = 2 andr = 0 and the lowest order resonance condition with 
1 = Oand £ = +1 is from (16.21) 


m+ Lvoy = Vor. (16.24) 
Resonance occurs for 
|m| = vox + Voy and |m| = Vox — Voy. (16.25) 


There is no coupling resonance for £ = 0 since Vo = 0. The resonances identified 
in (16.25) are called linear coupling resonances or linear sum resonance (left) and 
linear difference resonance (right) , respectively. 

Delaying proof for a later discussion we note at this point that the sum resonance 
can lead to a loss of beam while the difference does not cause a loss of beam 
but rather leads to an exchange of horizontal and vertical betatron oscillations. In 
circular accelerator design we therefore adjust the tunes such that a sum resonance 
is avoided. 


16.1.3 Resonance Diagram 


The resonance condition (16.15) has been derived for horizontal motion only, but a 
similar equation can be derived for the vertical motion. Both resonance conditions 
can be written in a more symmetric way 


lox + LVoy = JN, (16.26) 
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Fig. 16.1 Resonance n+l 
diagram for a ring with 
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where /,¢,j are integers and |/| + |€| is the order of the resonance. Plotting all 
lines (16.26) for different values of /, €,j in a (vy, vy) diagram produces what is 
called a resonance diagram. In Fig. 16.1 an example of a resonance diagram for 
N = 1 is shown displaying all resonances up to third order with |/| + |€| < 3. 

The operating points for a circular accelerator are chosen to be clear of any of 
these resonances. It should be noted here that the resonance lines are not mathemat- 
ically thin lines in the resonance diagram but rather exhibit some “thickness” which 
is called the stop band width. This stop band width depends on the strength of the 
resonance as was discussed earlier. 

Not all resonances are of the same strength and generally get weaker with 
increasing order. While a particle beam would not survive on an integer or a half- 
integer resonance all other resonances are basically survivable, at least for electron 
beams. For proton or ion beams higher order resonance must be avoided to prevent 
beam dilution. Only in particular cases, where strong multipole field perturbations 
cause a higher order resonance, may we observe beam loss. This is very likely to 
be the case for third order resonances in rings, where strong sextupole magnets are 
employed to correct for chromatic aberrations. 

The beneficial effect of a high super-periodicity or symmetry N in a circular 
accelerator becomes apparent in such a resonance diagram because the density 
of resonance lines is reduced by the factor N and the area of stability between 
resonances to operate the accelerator becomes proportionately larger. In Fig. 16.2, 
the resonance diagram for a ring with super-periodicity four (V = 4) is shown and 
the reduced number of resonances is obvious. Wherever possible a high symmetry in 
the design of a circular accelerator should be attempted. Conversely, breaking a high 
order of symmetry can lead to a reduction in stability if not otherwise compensated. 
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Fig. 16.2 Resonance 4n+4 
diagram for a ring with 
superperiodicity four, V = 4 1/2 
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16.2 Hamiltonian Resonance Theory* 


In the realm of Hamiltonian resonance theory we will be able to derive not only 
obvious resonant behavior but also resonant dynamics which does not necessarily 
lead to a loss of the beam but to a significant change of beam parameters. We 
also will be able to determine the strength of resonances, effectiveness, escape 
mechanisms and more. 


16.2.1 Non-linear Hamiltonian 


While simple Fourier expansions of perturbations around a circular accelerator 
allow us to derive the locations of lattice resonances in the tune diagram, we 
can obtain much deeper insight into the characteristics of resonances through the 
application of the Hamiltonian theory of linear and nonlinear oscillators. Soon after 
the discovery of strong focusing, particle dynamicists noticed the importance of 
perturbations with respect to beam stability and the possibility of beam instability 
even in the presence of very strong focusing. 

Extensive simulations and development of theories were pursued in an effort to 
understand beam stability in high-energy proton synchrotrons then being designed 
at the Brookhaven National Laboratory and CERN. The first Hamiltonian theory 
of linear and non linear perturbations has been published by Schoch [1] which 
includes also references to early attempts to solve perturbation problems. A modern, 
consistent and complete theory of all resonances has been developed, for example, 


548 16 Resonances 


by Guignard [2]. In this text, we will concentrate on main features of resonance 
theory and point the interested reader for more details to these references. 

Multipole perturbations have been discussed as the source of resonances and 
we will discuss in this chapter the Hamiltonian resonance theory. The equation of 
motion under the influence of an nth-order perturbation is in normalized coordinates 
and in the horizontal plane without coupling (see Table 16.1) 


iv + vow = qn(y) Ww," (16.27) 


which can be also derived from the nonlinear Hamiltonian 


w= 192 i Sos = AN 5, 
wa gw + svow +Ge)(>) Ww’. (16.28) 


Here we introduced 


1 —n/2 
An(Y) = —n(Y) - (2) (16.29) 
n 


2 
for future convenience. 

To discuss resonance phenomena it is useful to perform a canonical transfor- 
mation from the coordinates (w, w) to action-angle variables (J, Ww) which can be 
derived from the generating function (5.52) and the new Hamiltonian expressed in 
action-angle variables is 


H = vol +9, (y)J"” cos” ( — 2). (16.30) 


The action-angle variables take on the role of an “energy” and frequency of the 
oscillatory system. Due to the phase dependent perturbation p, (¢y) the oscillation 
amplitude J is no more a constant of motion and the circular motion in phase space 
becomes distorted as shown in Fig. 16.3 for a sextupolar perturbation. The oscillator 
frequency yy = dW/dgy = v is similarly perturbed and can be derived from the 
second Hamiltonian equation of motion 


dH. 
a” W = vo + 4G,(g)J"/?7! cos" (Ww — 8). (16.31) 


Perturbation terms seem to modify the oscillator frequency vo but because of the 
oscillatory trigonometric factor it is not obvious if there is a net shift or spread in 
the tune. We therefore expand the perturbation g,,(¢) as well as the trigonometric 
factor cos” y to determine its spectral content. The distribution of the multipole 
perturbations in a circular accelerator is periodic with a periodicity equal to the 
length of a superperiod or of the whole ring circumference and we are therefore 
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Fig. 16.3, Nonlinear 
perturbation of phase-space “| 
motion 
J 
WV 

able to expand the perturbation g,, (gy) into a Fourier series 

= TN 

ane) => ane, (16.32) 

l 


where WN is the super-periodicity of the circular accelerator. We also expand the 
trigonometric factor in (16.31) into exponential functions, while dropping the 
arbitrary phase ? 


cosy = \ > cae” (16.33) 


|m|<n 


and get 


Gn (g) cos” wy = > Qni ei Chm ei 
l 


|m|<n 
= > CnmQnl eee) (16.34) 
1 
|m|<n 
_ Cn0Gn0 + > 2Cnmnl cos(my _ IN@) : 
I>0 
O0<m<n 


In the last equation, the perturbation g,,(@) is expanded about a symmetry point 
merely to simplify the expressions of resonant terms. For asymmetric lattices the 
derivation is similar but includes extra terms. We have also separated the non- 
oscillatory term C,oPn0 from the oscillating terms to distinguish between systematic 
frequency shifts and mere periodic variations of the tune. The Hamiltonian (16.30) 
now becomes with (16.34) 


H = vod + Cn Guo”? + J"? SY Wm Gn Cos(my — INQ) . (16.35) 


I>0 
O0<m<n 
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The third term on the r.h.s. consists mostly of fast oscillating terms which in this 
approximation do not lead to any specific consequences. For the moment we will 
ignore these terms and remember to come back later in this chapter. The shift of the 
oscillator frequency due to the lowest-order perturbation becomes obvious and may 
be written as 


oH ee Moe n = n/2-1 : 
a Y= Vo + 5€n0 Gno J + oscillatory terms. (16.36) 

Since c,o 4 O for even values of n only, we find earlier results confirmed, where 
we observed the appearance of amplitude-dependent tune shifts and tune spreads for 
even-order perturbations. Specifically we notice, that there is a coherent amplitude 
independent tune shift for all particles within a beam in case of a gradient field 
perturbation with n = 2 and an amplitude dependent tune spread within a finite 
beam size for all other higher- and even-order multipole perturbations. 

To recapitulate the canonical transformation of the normalized variables to 
action-angle variables has indeed eliminated the angle coordinate as long as we 
neglect oscillatory terms. The angle variable therefore is in this approximation a 
cyclic variable and the Hamiltonian formalism tells us that the conjugate variable, in 
this case the amplitude J is a constant of motion or an invariant. This is an important 
result which we obtained by simple application of the Hamiltonian formalism 
confirming our earlier expectation to isolate constants of motion. 

This has not been possible in a rigorous way since we had to obtain approximate 
invariants by neglecting summarily all oscillatory terms. In certain circumstances 
this approximation can lead to totally wrong results. To isolate these circumstances 
we pursue further canonical transformations to truly separate from the oscillating 
terms all non-oscillating terms of order n/2 while the rest of the oscillating terms 
are transformed to a higher order in the amplitude J. 


16.2.2. Resonant Terms 


Neglecting oscillating terms is justified only in such cases where these terms 
oscillate rapidly. Upon closer inspection of the arguments in the trigonometric 
functions we notice however that for each value of m in (16.35) there exists a value 
1 which causes the phase 


mW, & my — INp (16.37) 


to vary only slowly possibly leading to a resonance. The condition for the occurrence 
of such a resonance is 7, ~ 0 or with W & vog 


m,Vo & rN, (16.38) 
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where we have set / = r to identify the index for which the resonance condi- 
tion (16.38) is met. The index m, is the order of the resonance and can be any 
integer | < m, <n. 

The effects of resonances do not only appear when the resonance condition 
is exactly fulfilled for wy, = 0. Significant changes in the particle motion can 
be observed when the particle oscillation frequency approaches the resonance 
condition. We therefore keep all terms which vary slowly compared to the betatron 
frequency WV. 

After isolating resonant terms we may now neglect all remaining fast oscillating 
terms with m # m,. Later we will show that these terms can be transformed to 
higher order and are therefore of no consequence to the order of approximation of 
interest. Keeping only resonant terms defined by (16.38) we get from (16.35) the 
nth-order Hamiltonian in normalized coordinates 


H = vol + Cn0Gnod”!? + J"? SY cum, Gar COS(M, Wr). (16.39) 


7 
O<m,<n 


The value of m, indicates the order of the resonance and we note that the 
maximum order of resonance driven by a multipole of order n is not higher than n. 
A dipole field therefore can drive only an integer resonance, a quadrupole field up to 
a half-integer resonance, a sextupole up to a third-order resonance, an octupole up 
to a quarter resonance and so forth although not all allowed resonances become real. 
We know for example already that a sextupole does not drive a tune shift or a quarter 
integer resonance in the approximation used so far. As we have noticed before, 
whenever we derive mathematical results we should keep in mind that such results 
are valid only within the approximation under consideration. It is, for example, 
known [3] that sextupoles can also drive quarter integer resonances through higher- 
order terms. In nonlinear particle beam dynamics any statement about stability or 
instability must be accompanied by a statement defining the order of approximation 
made to allow independent judgement for the validity of a result to a particular 
problem. 

The interpretation of the Hamiltonian (16.39) becomes greatly simplified after 
another canonical transformation to eliminate the appearance of the independent 
variable ». We thereby transform to a coordinate system that moves with the 
reference particle, thus eliminating the linear motion that we already know. The 
new coordinates rotate once per revolution and thereby eliminate the linear rotation 
in phase space that we know already. This can be achieved by a canonical similarity 
transformation from the coordinates (J, Ww) to (Jj, W) which we derive from the 
generating function 


mM, 


G=J (v = mt) j (16.40) 
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From this we get the relations between the old and new coordinates 


0G, rN 
J My, 


and 


dG, 
—=J=jJ,. 16.42 
Dw J=J (16.42) 


The quantity w% now describes the phase deviation of a particle from that of 
the reference particle. Since the generating function depends on the independent 
variable y we get for the new Hamiltonian H; = H + 0G, /d0@ or 


rN A — rn 
= ( - *) Ji + Cn0 Gyo Ft? + Came Gn cos(mW, + rN), (16.43) 


r 


where we have retained for simplicity only the highest-order resonant term 
(m, =n). With w = (dw/dg) = v and (16.38) a resonance condition occurs 
whenever 


vo % — = pv, (16.44) 


Setting Av, = vo — v; for the distance of the tune vp from the resonance tune v, the 
Hamiltonian becomes with all perturbation terms 


H= Av. Si+ YC Gn0J{ + YF? SY 2cum Fy CO8(™, Wr). (16.45) 
“ ae o<m,<n 

The coefficients c, 9 are defined by (16.33) and the harmonic amplitudes of 
the perturbations are defined by the Fourier expansion (16.32). The resonance 
order r and integer m, depend on the ring tune and are selected such that (16.44) 
is approximately met. A selection of most common multipole perturbations are 
compiled in Table 16.1 and picking an nth-order term we get from (16.29) the 
expression for Gn,. 

In the course of the mathematical derivation we started out in (16.28) with only 
one multipole perturbation of order n. For reasons of generality, however, all orders 
of perturbation n have been included again in (16.45). We will, however, not deal 
with the complexity of this multi-resonance Hamiltonian nor do we need to in 
order to investigate the character of individual resonances. Whatever the number 
of actual resonances may be in a real system the effects are superpositions of 
individual resonances. We will therefore investigate in more detail single resonances 
and discuss superpositions of more than one resonance later in this chapter. 
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16.2.3 Resonance Patterns and Stop-Band Width 


Equation (16.45) can be used to calculate the stop band width of resonances and 
to explore resonance patterns which are a superposition of particle trajectories 
H =const in (W,J1) phase-space. Depending on the nature of the problem 
under study, we may use selective terms from both sums in (16.45). Specifically 
to illustrate characteristic properties of resonances, we will use from the first sum 
the term c49G49, which originates from an octupole field. This is the lowest order 
term that provides some beam stability as we will see. From the second sum we 
choose a single nth-order term driving the rth-order resonance and get the simplified 
Hamiltonian 


Hy = AvyJi + c49 Gag Ji + 2Cam,Inr Jr? cos(m,W1) = const. (16.46) 


To further simplify the writing of equations and the discussion of results we 
divide (16.46) by 2¢nm,Qnr re where the amplitude Jj is an arbitrary reference 
amplitude of a particle at the starting point. Defining an amplitude ratio or beat 
factor 


J 
R=, (16.47) 
Jio 
and considering only resonances of order m, ~ n, (16.46) becomes 
RA+R?2 +R" cos nw, = const (16.48) 
where the detuning from the resonance is 
Ap, 
2Cam, nr J, 0 
and the tune-spread parameter 
Q= Sto _ (16.50) 


~~ = n/[2—2 
2Cnm, nr J19 


This expression has been derived first by Schoch [1] for particle beam dynamics. 
Because the ratio R describes the variation of the oscillation amplitude in units of 
the starting amplitude Jp we call the quantity R the beat factor of the oscillation. 

Before we discuss stop-bands and resonance patterns we make some general 
observations concerning particle stability. The stability of particle motion in the 
vicinity of resonances depends strongly on the distance of the tune from the nearest 
nth-order resonance and on the tune-spread parameter S2. When both parameters A 
and 2 vanish we have no stability for any finite oscillation amplitude, since (16.48) 
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can be solved for all values of w; only if R = 0. For a finite tune-spread parameter 
2 # 0 while A = 0 (16.48) becomes R? (2 +R”? cosny1) = const and 
resonances of order n > 4 exhibit some range of stability for amplitudes R’/?~? < 
|§2|. Oscillations in the vicinity of, for example, a quarter resonance are all stable for 
|§2| > 1 and all unstable for smaller values of the tune-spread parameter |{2| < 1. 
A finite tune-spread parameter §2 appears in this case due to an octupolar field and 
has a stabilizing effect at least for small amplitudes. 

For very small oscillation amplitudes (R — 0) the oscillating term in (16.48) 
becomes negligible for n > 4 compared to the detuning term and the particle 
trajectory approaches the form of a circle with radius R. This well behaved character 
of particle motion at small amplitudes becomes distorted for resonances of order 
n = 2 andn = 3 in case of small detuning and a finite tune spread parameter. We 
consider A = 0 and have 


R22 + R"” cos ny = const, (16.51) 


where n = 2 or n = 3. For very small amplitudes the quadratic term is 
negligible and the dominant oscillating term alone is unstable. The amplitude for 
a particle starting at R ~ 0 and w, = O grows to large amplitudes as yw 
increases, reaching values which make the quadratic tune-spread term dominant 
before the trigonometric term becomes negative. The resulting trajectory in phase 
space becomes a figure of eight for the half-integer resonance as shown in Fig. 16.4. 

In the case of a third-order resonance small-amplitude oscillations behave 
similarly and follow the outline of a clover leave as shown in Fig. 16.5. 
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Fig. 16.4 (R, W) phase-space motion for a half-integer resonance. Top row from left to right: 
Q = Oand A = (—5,—2,—1.1, 0, 2) ; bottom row: 2 = 1 and A = (—5,—2, —1.1, 0, 2) 
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A= 0, Q=1 A= 3, Q=1 


Fig. 16.5 (R, wW) phase-space motion for a third-order resonance. Top row from left to right: 
Q =0, A = (—6, —2, 0, 3); bottom row: 2 = 1, A = (—6, —2, 0, 3) 


16.2.4 Half-Integer Stop-Band 


A more detailed discussion of (16.45) will reveal that instability due to resonances 
does not only happen exactly at resonant tunes. Particle oscillations become unstable 
within a finite vicinity of resonance lines in the resonance diagram and such areas 
of instability are known as stop-bands. The most simple case occurs for 2 = 0 and 
a half-integer resonance, where n = 2 and 


R(A + cos2w)) = const. (16.52) 


For this equation to be true for all values of the angle variable y% we require that 
the quantity in the brackets does not change sign while y varies from 0 to 2 1. This 
condition cannot be met if |A| < 1. To quantify this we observe a particle starting 
with an amplitude J = Jo at yw, = 0 and (16.52) becomes 


RA+Rcos2w, =A+1. (16.53) 
Now we calculate the variation of the oscillation amplitude R as the angle 


variable yy, increases. The beat factor R reaches its maximum value at 2y,; = a 
and is 


> 0. (16.54) 
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The variation of the amplitude R is finite as long as A > 1. If A < 0, we geta 
similar stability condition 


|A| -1 


Rnax = 
|A| +1 


>0 (16.55) 


and stability occurs for A < —1. The complete resonance stability criterion for the 
half-integer resonance is therefore 


|A| > 1. (16.56) 


Beam instability due to a half-integer resonance (n = 2) occurs within a finite 
vicinity Av, = +2c2,q>, as defined by (16.49) and the total stop-band width for 
a half-integer resonance becomes 


Avi), = £2com, Gar (16.57) 


The width of the stop-band increases with the strength of the perturbation 
but does not depend on the oscillation amplitude Jo. However, for higher-order 
resonances the stop band width does depend on the oscillation amplitudes as will 
be discussed later. 

To observe the particle trajectories in phase space, we calculate the contour lines 
for (16.48) setting n = 2 and obtain patterns as shown in Fig. 16.4. Here the particle 
trajectories are plotted in the (y, /) phase space for a variety of detuning parameters 
A and tune-spread parameters (2. Such diagrams are called resonance patterns. 
The first row of Fig. 16.4 shows particle trajectories for the case of a half-integer 
resonance with a vanishing tune-spread parameter {2 = 0. As the detuning A is 
increased we observe a deformation of particle trajectories but no appearance of 
a stable island as long as |A| < 1. Although we show mostly resonance patterns 
for negative values of the detuning A < 0 the patterns look exactly the same for 
A > 0 except that they are rotated by 90°. For |A| > 1 the unstable trajectories 
part vertically from the origin and allow the appearance of a stable island that grows 
as the detuning grows. In the second row of resonance patterns, we have included 
a finite tune-spread parameter of £2 = 1 which leads to a stabilization of all large 
amplitude particle trajectories. Only for small amplitudes do we still recognize the 
irregularity of a figure of eight trajectory as mentioned above. 


16.2.5  Separatrices 


The appearance of island structures as noticeable from the resonance patterns is a 
common phenomenon and is due to tune-spread terms of even order like that of an 
octupole field. In Fig. 16.6 common features of resonance patterns are shown and 
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Fig. 16.6 Common features of resonance patterns 


we note specifically the existence of a central stable part and islands surrounding 
the central part. 

The boundaries of the areas of stable motion towards the islands are called 
separatrices. These separatrices also separate the area of stable motion from that 
for unstable motion. The crossing points of these separatrices, as well as the center 
of the islands, are called fixed points of the dynamic system and are defined by the 
conditions 


OH, _ 0 oH, 
av Od) 
Application of these conditions to (16.46) defines the location of the fixed points 


and we find from the first equation (16.58) the azimuthal positions 7, = wr of the 
fixed points from 


=0. (16.58) 


sin (m,Wir) = 0 (16.59) 
or 
mrWiek = kr, (16.60) 


where k is an integer number in the range 0 < k < 2m,. From the second 
equation (16.58) we get an expression for the radial location of the fixed points Jr, 


Av, + 2esoqaoie + 22Com Fn \cos(ak) = 0. (16.61) 


There are in principle 2m, separate fixed points in each resonance diagram. 
Closer inspections shows that alternately every second fixed point is a stable fixed 
point or an unstable fixed point, respectively. The unstable fixed points coincide with 
the crossing points of separatrices and exist even in the absence of octupole terms. 
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Fig. 16.7 Fourth-order resonance patterns. From left to right: 2 = 0,(A < 0,A =0,A > 0) 


Stable fixed points define the center of stable islands and, except for the primary 
stable fixed point at the origin of the phase diagram, exist only in the presence of a 
tune spread caused by octupole like terms Cy Gyo J"? in (16.43), which contribute 
to beam stability. Trajectories that were unstable without the octupole term become 
closed trajectories within an island area centered at stable fixed points. This island 
structure is characteristic for resonances since the degree of symmetry is equal to 
the order of the resonance (see Fig. 16.7). 


16.2.6 General Stop-Band Width 


From the discussion of the half-integer resonance, it became apparent that certain 
conditions must be met to obtain stability for particle motion. Specifically we expect 
instability in the vicinity of resonances and we will try to determine quantitatively 
the area of instability or stop-band width for general resonances. Similar to (16.53) 
we look for stable solutions from 


RA+R"’ cosny, = A+1, (16.62) 


which describes a particle starting with an amplitude R = 1. Equation (16.62) must 
be true along all points of the trajectory and for reasons of symmetry the particle 
oscillation amplitude approaches again the starting amplitude for wy; = O as yw > 
2x/n. Solving for A we get real solutions for R only if 


i RY2—] ; 
AT >- Rol = —5n for Rw, (16.63) 
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where the index + indicates the sign to be used on the r.h.s. of (16.62). Similarly, 
following a particle starting with R = 1 at Ww; = az/n to Ww, = 3x/n we get the 
condition 


A” < 5n. (16.64) 


NIA 


The total nth-order stop-band width is therefore with (16.49) 
Ave =tlena| se (16.65) 


indicating that stable particle motion is possible only for tunes outside this stop- 
band. The stop-band width of nonlinear resonances (n > 2) is strongly amplitude 
dependent and special effort must be exercised to minimize higher-order pertur- 
bations. Where higher-order magnetic fields cannot be eliminated it is prudent to 
minimize the value of the betatron functions at those locations. 

Where higher-order magnetic fields cannot be eliminated it is prudent to min- 
imize the value of the betatron functions at those locations. Such a case occurs, 
for example, in colliding-beam storage rings, where the strongly nonlinear field of 
one beam perturbs the trajectories of particles in the other beam. This effect is well 
known as the beam-beam effect. 

Through a series of canonical transformations and redefinitions of parameters 
we seem to have parted significantly from convenient laboratory variables and 
parameters. We will therefore convert (16.65) back to variables we are accustomed 
to use. We set / = r and m, = n where r © “vo and tacitly ignored lower-order 


N 
resonances m, <n. From (16.32) we find the Fourier components 


i 20 7 
Inr = =| a (pye "dg , (16.66) 
0 


and from (16.33) we have Cyn = x The amplitude factor - nel replaced 
by (8.95) which becomes with (5.54a), (5.54b) and w; = 0 


2 
Xo 


B° 


Jo = 4v0ws = $¥0 (16.67) 


—n/2 


Finally, we recall the definition (16.29) gn(g) = —49, (v) (2) 
nth-order stop-band width 


and get for the 


wi 2, 
(n) 
Avvo = Ftp (16.68) 


2n 
| i Gn(ye’do| , 


where g,, is the nth-order perturbation from Table 16.1. This result is general and 
includes our earlier finding for the half-integer resonance. For resonances of order 
n > 2 the stop-band width increases with amplitude limiting the stability of particle 
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Fig. 16.8 Stop-band width as a function of the amplitude Jo for resonances of order n = 2,3,4,5 
and detuning parameter 22 = 0 
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Fig. 16.9 Fifth-order resonance patterns. From left to right: 2 = 0, (A <0, A = 0, A > 0) 


beams to the vicinity of the axis (Fig. 16.8). The introduction of sufficiently strong 
octupole terms can lead to a stabilization of resonances and we found, for example, 
that the quarter resonance is completely stabilized if 2 > 1. For resonances of 
order n > 4, however, the term R”/? cos ny; becomes dominant for large values of 
the amplitude and resonance therefore cannot be avoided. 

Figure 16.9 shows, for example, a stable area for small amplitudes at the fifth- 
order resonance, as we would expect, but at larger amplitudes the motion becomes 
unstable. 


16.3. Third-Order Resonance 


The third-order resonance plays a special role in accelerator physics and we will 
therefore discuss this resonance in more detail. The special role is generated by 
the need to use sextupoles for chromaticity correction. While such magnets are 
beneficial in one respect, they may introduce third-order resonances that need to 
be avoided or at least kept under control. Sometimes the properties of a third-order 
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resonance are also used constructively to eject particles at the end of a synchrotron 
acceleration cycle slowly over many turns. 

In the absence of octupole fields the Hamiltonian for the third-order resonance is 
from (16.46) form = 3 and gay = 0 


Hy = Avip3J + Ga,J;! cos 3. (16.69) 


We expand cos 3w,=cos* 1 — 3cosWi sin? y and return to normalized coordi- 
nates 


fos 
w=,/—coswi, and w= /2voJ; sin yn. (16.70) 
Vo 


In these coordinates the Hamiltonian reveals the boundaries of the stable 
region from the unstable resonant region. Introducing the normalized coordinates 
into (16.69), we get the Hamiltonian 


“2 3/2 “9 
= MO oe eee Se a 
Hy = Avy3 5 (» + =) + G3, 53/2 (» sw") : (16.71) 


— _ 3/2 ; 
Dividing by q;, (2) ”? and subtracting a constant term 4W,’, where 


wa (16.72) 
0o=rz- ‘5 e 

3 d3yr-N 2Vo 
the Hamiltonian assumes a convenient form to exhibit the boundaries between the 
stable and unstable area 


~ w? Ww 
H, = 3Wo (w + =) + G - 30) — tw 
0 


II 


(w—4W) (w- v3= + wo) (w+ Vi~ +). (16.73) 
0 0 


This Hamiltonian has three linear solutions for H; = 0 defining the separatrices. 
The resonance plot for (16.73) is shown in Fig. 16.10 where we have assumed that 
Wo is positive. For a given distribution of the sextupoles q3, the resonance pattern 
rotates by 180° while moving the tune from one side of the resonance to the other. 
Clearly, there is a stable central part bounded by separatrices. The area of the central 
part depends on the strength and distribution of the sextupole fields summarized by 
43, and the distance Av,/3 of the tune from the third-order resonance. 


562 16 Resonances 


dw/dt 


Se SS 


——__] stable area 


ST 
BE 


——— 


—| separat 
—— —_—_—- 


Fig. 16.10 Third-order resonance 


We 


The higher-order field perturbation qg3, depends on the distribution of the 
sextupoles around the circular accelerator. In the horizontal plane 


1 
Ga(y) = — vB? 5m. (16.74) 
or with (16.29) 
q3(~) = 5 V 2v0B°?m. (16.75) 


The Fourier components of this perturbation are given by 


20 


ar qx(y)e""? dp (16.76) 


oe 


and the perturbation term becomes finally with m, = 3 and c33 = 7 from (16.33) 


2. 20 . 
ae i B52 mei? dy (16.77) 
247 0 


where g = i Ss m = m(@) is the sextupole distribution and 6B = 8 (g) 
the horizontal betatron function. From this expression, it becomes clear that 
the perturbation and with it the stable area in phase space depends greatly on 
the distribution of the sextupoles around the ring. Minimizing the rth Fourier 
component obviously benefits beam stability. 
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16.3.1 Particle Motion in Phase Space 


It is interesting to study particle motion close to a resonance in some more detail 
by deriving the equations of motion from the Hamiltonian (16.69). The phase 
variation is 


~ = am = Av3 + 3%, J{/7cos 3, « (16.78) 

Now, we follow a particle as it orbits the ring and observe its coordinates every 
time it passes by the point with phase @p or Wo, which we assume for convenience 
to be zero. Actually, we observe the particle only every third turn, since we are not 
interested in the rotation of the resonance pattern in phase space by 120° every turn. 

For small amplitudes the first term is dominant and we note that the particles 
move in phase space clockwise or counter clockwise depending on Av1/3 being 
negative or positive, respectively. The motion becomes more complicated in the 
vicinity and outside the separatrices, where the second term is dominant. For a 
particle starting at w, = O the phase yincreases or decreases from turn to turn 
and asymptotically approaches y, = +30° depending on the perturbation 3, being 
positive or negative, respectively. The particles therefore move clockwise or counter 
clockwise and the direction of this motion is reversed, whenever we move into an 
adjacent area separated by separatrices because the trigonometric term has changed 
sign. 

To determine exactly the position of a particle after 3g turns we have with (q) = 
3q +2 V9 


Wi(q) = 2m (39 — rN) q (16.79) 


With this phase expression we derive the associated amplitude J,, from the 
Hamiltonian (16.69) and may plot the particle positions for successive triple turns 
3q = 0,3,6,9,... in a figure similar to Fig. 16.10. The change in the oscillation 
amplitude is from the second Hamiltonian equation of motion 


oH, OJ 3/9. 

— = —-— = —393,J,'" sin3 16.80 

Be ag ei sin3vy, (16.80) 
and is very small in the vicinity of w; ~ O or even multiples of 30° (for w > 
separatrix in Fig. 16.10). For y being equal to odd multiples of 30°, on the other 
hand, the oscillation amplitude changes rapidly as shown in Fig. 16.10 on the left 
side beyond the crossing point of the separatrixes or beyond the unstable point. 
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Problems 


16.1 (S). Consider a simple optimized FODO lattice forming a circular ring. 
Calculate the natural chromaticity (ignore focusing in bending magnets) and correct 
the chromaticities to zero by placing thin sextupoles in the center of the quadrupoles. 
Calculate and plot the horizontal third-order stop-band width as a function of the 
horizontal tune. 


16.2 (S). Show that in (16.33) the coefficients c,9 are non-zero only for even values 
of n. 


16.3 (S). Show that in (16.33) the coefficients Cy, = = 


16.4. Plot a resonance diagram up to fourth order for the PEP lattice with tunes 
vy = 21.28 and v, = 18.16 and a super-periodicity of N = 6 or any other 
circular accelerator lattice with multiple super-periodicity. Choose the parameters 
of the diagram such that a resonance environment for the above tunes of at least +3 
(+ half the number of superperiods) integers is covered. 


16.5. Choose numerical values for parameters of a single multipole in the Hamilto- 
nian (16.45) and plot a resonance diagram H (J, w) =const. Determine the stability 
limit for your choice of parameters. What would the tolerance on the multipole field 
perturbation be if you require a stability for an emittance as large as € = 100 mm- 
mrad? 


16.6. Take the lattice of Problem 16.1 and adjust its tune to the third-order 
resonance so that the unstable fixed point on the symmetry axis are 5cm from 
the beam center. Determine the equations for the separatrices. Choose a point P 
just outside the stable area and close to the crossing of two separatrices along the 
symmetry axis. Where in the diagram would a particle starting at P be after 3, 6, and 
9 turns? At what amplitude could you place a 5 mm thin septum magnet to eject the 
beam from the accelerator? 
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Chapter 17 
Hamiltonian Nonlinear Beam Dynamics* 


Deviations from linear beam dynamics in the form of perturbations and aberrations 
play an important role in accelerator physics. Beam parameters, quality and stability 
are determined by our ability to correct and control such perturbations. Hamiltonian 
formulation of nonlinear beam dynamics allows us to study, understand and quantify 
the effects of geometric and chromatic aberrations in higher order than discussed so 
far. Based on this understanding we may develop correction mechanisms to achieve 
more and more sophisticated beam performance. We will first discuss higher-order 
beam dynamics as an extension to the linear matrix formulation followed by specific 
discussions on aberrations. Finally, we develop the Hamiltonian perturbation theory 
for particle beam dynamics in accelerator systems. 


17.1. Higher-Order Beam Dynamics 


Chromatic and geometric aberrations appear specifically in strong focusing trans- 
port systems designed to preserve carefully prepared beam characteristics. As a 
consequence of correcting chromatic aberrations by sextupole magnets, nonlinear 
geometric aberrations are introduced. The effects of both types of aberrations on 
beam stability must be discussed in some detail. Based on quantitative expressions 
for aberrations, we will be able to determine criteria for stability of a particle beam. 


17.1.1 Mutltipole Errors 


The general equations of motion (6.95), (6.96) exhibit an abundance of driving 
terms which depend on second or higher-order transverse particle coordinates 
(x,x’, y,’) or linear and higher-order momentum errors 5. Magnet alignment and 
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field errors add another multiplicity to these perturbation terms. Although the 
designers of accelerator lattices and beam guidance magnets take great care to 
minimize undesired field components and avoid focusing systems that can lead to 
large transverse particle deviations from the reference orbit, we cannot completely 
ignore such perturbation terms. 

In previous sections we have discussed the effect of some of these terms and have 
derived basic beam dynamics features as the dispersion function, orbit distortions, 
chromaticity and tune shifts as a consequence of particle momentum errors or 
magnet alignment and field errors. More general tools are required to determine the 
effect of any arbitrary driving term on the particle trajectories. In developing such 
tools we will assume a careful design of the accelerator under study in layout and 
components so that the driving terms on the r.h.s. of (6.95), (6.96) can be treated 
truly as perturbations. This may not be appropriate in all circumstances in which 
cases numerical methods need to be applied. For the vast majority of accelerator 
physics applications it is, however, appropriate to treat these higher-order terms as 
perturbations. 

This assumption simplifies greatly the mathematical complexity. Foremost, we 
can still assume that the general equations of motion are linear differential equations. 
We may therefore continue to treat every perturbation term separately as we have 
done so before and use the unperturbed solutions for the amplitude factors in the 
perturbation terms. The perturbations are reduced to functions of the location z along 
the beam line and the relative momentum error 6 only and such differential equations 
can be solved analytically as we will see. Summing all solutions for the individual 
perturbations finally leads to the composite solution of the equation of motion in the 
approximation of small errors. 

The differential equations of motion (6.95), (6.96) can be expressed in a short 
form by 


w+ K(2) Pe > Pise® xl xl? yr y/P 6 ; (17.1) 
L,v,0,p,T=0 


where u = x or u = y and the quantities pyyopr(z) represent the coefficients of 
perturbation terms. The same form of equation can be used for the vertical plane but 
we will restrict the discussion to only one plane neglecting coupling effects. 

Some of the perturbation terms pj pr can be related to aberrations known from 
geometrical light optics. Linear particle beam dynamics and Gaussian geometric 
light optics works only for paraxial beams where the light rays or particle trajectories 
are close to the optical axis or reference path. Large deviations in amplitude, as well 
as fast variations of amplitudes or large slopes, create aberrations in the imaging 
process leading to distortions of the image known as spherical aberrations, coma, 
distortions, curvature and astigmatism. While corrections of such aberrations are 
desired, the means to achieve corrections in particle beam dynamics are different 
from those used in light optics. Much of the theory of particle beam dynamics is 
devoted to diagnose the effects of aberrations on particle beams and to develop and 
apply such corrections. 
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The transverse amplitude x can be separated into its components which under the 
assumptions made are independent from each other 


X= Xp + Xo + X35 + toon F (17.2) 


The first three components of solution (17.2) have been derived earlier and are 
associated with specific lowest order perturbation terms: 

Xg(z) is the betatron oscillation amplitude and general solution of the homoge- 
neous differential equation of motion with vanishing perturbations pyyopr = O for 
all indices. 

X¢(z) is the orbit distortion and is a special solution caused by amplitude and 
momentum independent perturbation terms like dipole field errors or displacements 
of quadrupoles or higher multipoles causing a dipole-field error. The relevant 
perturbations are characterized by u = v = 0 = p = tT = O but otherwise 
arbitrary values for the perturbation pooo99. Note that in the limit poooo9 —> 0 we get 
the ideal reference path or reference orbit x,.(z) = 0. 

Xs(z) is the chromatic equilibrium orbit for particles with an energy different 
from the ideal reference energy, 6 4 0, and differs from the reference orbit with or 
without distortion x,(z) by the amount xs(z) which is proportional to the dispersion 
function n(z) and the relative momentum deviation 6, .xs(z) = 7 (z) 6. In this case 
fb=v=o=p=Oandrt=1. 

All other solutions x,¢9r are related to remaining higher-order perturbations. 
The perturbation term pjo990, for example, acts just like a quadrupole and may be 
nothing else but a quadrupole field error causing a tune shift and a variation in the 
betatron oscillations. Other terms, like pooj99 can be correlated with linear coupling 
or with chromaticity if pio901 4 0. Sextupole terms p2oo00 are used to compensate 
chromaticities, in which case the amplitude factor x” is expressed by the betatron 
motion and chromatic displacement 


Paar (xg + x3)? = (xp + 6)? => 2nxg8. (17.3) 


The xq-term, which we neglected while compensating the chromaticity, is the 
source for geometric aberrations due to sextupolar fields becoming strong for large 
oscillation amplitudes and the 7°5?-term contributes to higher-order solution of 
the n-function. We seem to make arbitrary choices about which perturbations to 
include in the analysis. Generally therefore only such perturbations are included in 
the discussion which are most appropriate to the problem to be investigated and 
solved. If, for example, we are only interested in the orbit distortion x., we ignore 
in lowest order of approximation the betatron oscillation xg and all chromatic and 
higher-order terms. Should, however, chromatic variations of the orbit be of interest 
one would evaluate the corresponding component separately. On the other hand, if 
we want to calculate the chromatic variation of betatron oscillations, we need to 
include the betatron oscillation amplitudes as well as the off momentum orbit xs. 
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In treating higher-order perturbations we make an effort to include all perturba- 
tions that contribute to a specific aberration to be studied or to define the order of 
approximation used if higher-order terms are to be ignored. A careful inspection 
of all perturbation terms close to the order of approximation desired is prudent 
to ensure that no significant term is missed. Conversely such an inspection might 
very well reveal correction possibilities. An example is the effect of chromaticity 
which is generated by quadrupole field errors for off momentum particles but can be 
compensated by sextupole fields at locations where the dispersion function is finite. 
Here the problem is corrected by fields of a different order from those causing the 
chromaticity. 

To become more quantitative we discuss the analytical solution of (17.1). Since in 
our approximation this solution is the sum of all partial solutions for each individual 
perturbation term, the problem is solved if we find a general solution for an arbitrary 
perturbation. The solution of, for example, the horizontal equation of motion 


x" + K(2)x = Pyvopr xxl? y? y/P Bt (17.4) 


can proceed in two steps. First we replace the oscillation amplitudes on the r.h.s. by 
their most significant components 


xH—> (xg + x9 + x5)¥, x? > (Xp 4X Ee)” 


17.5 

y—> (yp t+ yo + ¥8)", VP Og + +95)? - a 

As discussed before, in a particular situation only those components are even- 

tually retained that are significant to the problem. Since most accelerators are 

constructed in the horizontal plane we may set the vertical dispersion ys = 0. 

The decomposition (17.5) is inserted into the r.h.s of (17.4) and again only terms 

significant for the particular problem and to the approximation desired are retained. 

The solution X,,,¢)r can be further broken down into components each relating to 

only one individual perturbation term. Whatever number of perturbation terms we 
decide to keep, the basic differential equation for the perturbation is of the form 


P! + K(2)P = p&p, Xp, Xe Xe X8sX$, VBr Vp» Vor Vor V5. V5x 5s) » (17.6) 


for which we have discussed the solution in Sect. 5.5.4. Following these steps we 
may calculate, at least in principle, the perturbations P(z) for any arbitrary higher- 
order driving term p(z). In praxis, however, even principal solutions of particle 
trajectories in composite beam transport systems can be expressed only in terms 
of the betatron functions. Since the betatron functions cannot be expressed in a 
convenient analytical form, we are unable to obtain an analytical solution and must 
therefore employ numerical methods. 
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17.1.2. Non-linear Matrix Formalism 


In linear beam dynamics this difficulty has been circumvented by the introduction of 
transformation matrices, a principle which can be used also for beam transport sys- 
tems including higher-order perturbation terms. This non-linear matrix formalism 
was developed by Karl Brown [1-3] and we follow his reasoning in the discussion 
here. The solution to (17.1) can be expressed in terms of initial conditions. Similar 
to discussions in the context of linear beam dynamics we solve (17.6) for individual 
lattice elements only where K(z) = const. In this case (5.75) can be solved for any 
piecewise constant perturbation along a beam line. Each solution depends on initial 
conditions at the beginning of the magnetic element and the total solution can be 
expressed in the form 


/ 2 if 
X(Z) = C110 X0 + €120 Xp + €130 60 + C111 X + C112 X0X +..--s an 
y / 2 / : 
X'(Z) = €210 X0 + €220 Xq + €230 89 + C211 XO + C212 XOX) +. 


where the coefficients cj, are functions of z. The nomenclature of the indices 
becomes obvious if we set x} = x,x2 = x’, and x3 = 6. The coefficient cijx then 
determines the effect of the perturbation term x;x, on the variable x;. In operator 
notation we may write 


Cijk = (xilXj0Xx0)- (17.8) 


The first-order coefficients are the principal solutions 


€110(2) = C(z), cao(z) = C@), 
€120(z) = S(z),  €220(z) = S’(2), (17.9) 
€130(z) = D(z), €230(z) = D'(z). 


Before continuing with the solution process, we note that the variation of the 
oscillation amplitudes (x’,y’) are expressed in a curvilinear coordinate system 
generally used in beam dynamics. This definition, however, is not identical to the 
intuitive assumption that the slope x’ of the particle trajectory is equal to the angle 
© between the trajectory and reference orbit. In a curvilinear coordinate system the 
slope x = dx/dz is a function of the amplitude x. To clarify the transformation, we 
define angles between the trajectory and the reference orbit by 

dx dy 


—=0 and 


Q, 17.10 
ds ds ( ) 


where 


ds = (1+ xx) dz. (17.11) 


570 17 Hamiltonian Nonlinear Beam Dynamics* 


In linear beam dynamics there is no numerical difference between x’ and © which 
is a second-order effect nor is there a difference in straight parts of a beam transport 
line where k = 0. The relation between both definitions is from (17.10), (17.11) 

/ / 
. and @= y , 
1+ «x 1+ «x 


(17.12) 


where x’ = dx/dz and y’ = dy/dz. We will use these definitions and formulate 
second-order transformation matrices in a Cartesian coordinate system (x, y, z). 
Following Brown’s notation [1], we may express the nonlinear solutions of (17.4) 
in the general form 


3 3 
ui = Y | eyouyo + D) Tic) uo mio 5 (17.13) 
j=l j=] 
k=1 
with 
(uj, U2, u3) = (x, O, 4), (17.14) 


where z is the position along the reference particle trajectory. Nonlinear transforma- 
tion coefficients Tj, are defined similar to coefficients cj, in (17.8) by 


Tix = (ui\Ujouxo) » (17.15) 


where the coordinates are defined by (17.14). In linear approximation both 
coefficients are numerically the same and we have 


C110 C120 C130 C(z) S(z) D(z) 
C210 €220 C230 | = | C’(z) S'(z) D(z) | - (17.16) 
C310 €320 C330 0 0 1 


Earlier in this section we decided to ignore coupling effects which could be 
included easily in (17.13) if we set for example x4 = y and x5 = y’ and expand 
the summation in (17.13) to five indices. For simplicity, however, we will continue 
to ignore coupling. 

The equations of motion (6.95), (6.96) are expressed in curvilinear coordinates 
and solving (5.75) results in coefficients cj, which are different from the coefficients 
Tix tf one or more variables are derivatives with respect to z. In the equations of 
motion all derivatives are transformed like (17.12) generating a @-term as well as 
an x @-term. If, for example, we were interested in the perturbations to the particle 
amplitude x caused by perturbations proportional to x9 @o, we are looking for the 
coefficient T)12 = (x |xo Oo). Collecting from (6.95) only second-order perturbation 
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terms proportional to xx’, we find 
c= C112 X0. X = €112 X90 Op + O(3). (17.17) 


An additional second-order contribution appears as a spill over from the linear 
transformation 


X= C120 Xo = C120 ad + KyX0) Oo 7 (17.18) 
Collecting all x) Oo—terms, we get finally 
Ty12 = C112 + €120 Kx = C112 + KyS(Z) . (17.19) 


To derive a coefficient like Tr. = (O|xo @o) we also have to transform the 
derivative of the particle trajectory at the end of the magnetic element. First, we 
look for all contributions to x’ from XoXy-terms which originate from x’ = C29 eA + 
C212X0Xq. Setting in the first term x) = Oo (1 + ky xo) and in the second term xoxp ~ 
Xp Oo , we get with co29 = S’(z) and keeping again only second-order terms 


x = [ca + «,S'(z)] x0 - (17.20) 


On the Lh.s. we replace x’ by © (1 + «,x) and using the principal solutions we 
get 


xO & (Cyxo + Sx@p) (Ciro + 8.00) = (CS! + C1Sy) x00 (17.21) 


keeping only the x9@-terms. Collecting all results, the second-order coefficient for 
this perturbation becomes 


T12 = (O |xo Oo) = cain + Kx S"(z) — Kx (Cy) + C.S;) « (17.22) 


In a similar way we can derive all second-order coefficients Tj. Equa- 
tions (17.13) define the transformation of particle coordinates in second order 
through a particular magnetic element. For the transformation of quadratic terms 
we may ignore the third order difference between the coefficients cj and Tj and 
get 


x? = (Cyxo + Sexy + Dy8o)” ; 


x = (C,xo + Syxp + D,8o) (C.x0 + Sexy + D/.0) 


(17.23) 
x6 = (C.x0 + SxXp + D,60) 50 


etc: 
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All transformation equations can now be expressed in matrix form after correctly 
ordering equations and coefficients and a general second-order transformation 
matrix can be formulated in the form 


X Xo 
2) Oo 
t) 50 
x Dr 
xO = XO Ao ’ (17.24) 
xd Xo 50 
CE @? 
(OR) Oo do 
8 82 


where we have ignored the y-plane. The second-order transformation matrix is then 


M= (17.25) 

CSDM Ti Tie) =6°T122— T1206 Ss Te 
CSD Ta To2 Tr16 = Tx22S Tx6 ) To66 
001 0 0 0 0 0 0 

000 C 2CS 2CD S 2SD D? 
00 0 CC’ CS'+C'S CD'+C'D SS’ SD'+S’'D DD' 
000 O 0 C 0 S D 
000 C* 308 2CbD 8? 38D p 
000 O 0 Cc’ 0 S’ D' 
000 O 0 0 0 0 1 


with C = C,, S = S,,... etc. 

A similar equation can be derived for the vertical plane. If coupling effects are to 
be included the matrix could be further expanded to include also such terms. While 
the matrix elements must be determined individually for each magnetic element in 
the beam transport system, we may in analogy to linear beam dynamics multiply 
a series of such matrices to obtain the transformation matrix through the whole 
composite beam transport line. As a matter of fact the transformation matrix has 
the same appearance as (17.24) for a single magnet or a composite beam transport 
line and the magnitude of the nonlinear matrix elements will be representative of 
imaging errors like spherical and chromatic aberrations. 

To complete the derivation of second-order transformation matrices we derive, 
as an example, an expression of the matrix element 7 ;; from the equation of 
motion (6.95). To obtain all Xp-terms, we look in (6.95) for perturbation terms 
proportional to x”, xx’ and x”, replace these amplitude factors by principal solutions 
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and collect only terms quadratic in xp to get the relevant perturbation 
P(z) = [— (gm t+ 2kck + C2) CE A FKxC? + KCC] 2G - (17.26) 


First, we recollect that the theory of nonlinear transformation matrices is based 
on the constancy of magnet strength parameters and we set therefore x, = 0. Where 
this is an undue simplification like in magnet fringe fields one could approximate the 
smooth variation of «, by a step function. Inserting (17.26) into (5.75) the second- 
order matrix element 


Cnr = Tin (17.27) 
=. (tin dee) i C(t) Ge, 0) dt — 4a, C2 (0) Gz, t) at. 
0 0 


The integrands are powers of trigonometric functions and can be evaluated 
analytically. In a similar way we may now derive any second-order matrix element 
of interest. A complete list of all second order matrix elements can be found in [1]. 

This formalism is valuable whenever the effect of second-order perturbations 
must be evaluated for particular particle trajectories. Specifically, it is suitable for 
nonlinear beam simulation studies where a large number of particles representing 
the beam are to be traced through nonlinear focusing systems to determine, for 
example, the particle distribution and its deviation from linear beam dynamics at 
a focal point. This formalism is included in the program TRANSPORT [4] allowing 
the determination of the coefficients Tj, for any beam transport line and providing 
fitting routines to eliminate such coefficients by proper adjustment and placement 
of nonlinear elements like sextupoles. 


17.2. Aberrations 


From light optics we are familiar with the occurrence of aberrations which cause the 
distortion of optical images. We have repeatedly noticed the similarity of particle 
beam optics with geometric or paraxial light optics and it is therefore not surprising 
that there is also a similarity in imaging errors. Aberrations in particle beam optics 
can cause severe stability problems and must therefore be controlled. 

We distinguish two classes of aberrations, geometric aberrations and for off 
momentum particles chromatic aberrations. The geometric aberrations become 
significant as the amplitude of betatron oscillations increases while chromatic 
aberration results from the variation of the optical system parameters for different 
colors of the light rays or in our case for different particle energies. For the 
discussion of salient features of aberration in particle beam optics we study the 
equation of motion in the horizontal plane and include only bending magnets, 
quadrupoles and sextupole magnets. The equation of motion in this case becomes 


574 17 Hamiltonian Nonlinear Beam Dynamics* 


in normalized coordinates 
+ vow = vo B48 + vo B?kw6s — 406 8°? mw? , (17.28) 


where 6 = fy. The particle deviation w from the ideal orbit is composed of two 
contributions, the betatron oscillation amplitude wg and the shift in the equilibrium 
orbit for particles with a relative momentum error 6. This orbit shift ws is determined 


by the normalized dispersion function at the location of interest (ws =7n6= <8) 
and the particle position can be expressed by the composition 


w= weg +ws = we +70. (17.29) 


Inserting (17.29) into (17.28) and employing the principle of linear superposi- 
tion (17.28) can be separated into two differential equations, one for the betatron 
motion and one for the dispersion function neglecting quadratic or higher-order 
terms in 6. The differential equation for the dispersion function is then 


+ vat = vepl74 + vo p?kmS — v3 8°)? mis , (17.30) 


which has been solved earlier in Sect. 9.4.1. All other terms include the betatron 
oscillation wg and contribute therefore to aberrations of betatron oscillations 
expressed by the differential equation 


top + vows = voB?kw, 6 — voB>mnw, 6 — 518°? mws . (17.31) 


The third term in (17.31) is of geometric nature causing a perturbation of beam 
dynamics at large betatron oscillation amplitudes and, as will be discussed in 
Sect. 17.3, also gives rise to an amplitude dependent tune shift. This term appears 
as an isolated term in second order and no local compensation scheme is possible. 
Geometric aberrations must therefore be expected whenever sextupole magnets are 
used to compensate for chromatic aberrations. 

The first two terms in (17.31) represent the natural chromaticity from 
quadrupoles and the compensation by sextupole magnets, respectively. Whenever 
it is possible to compensate the chromaticity at the location where it occurs both 
terms would cancel for myn = k. Since the strength changes sign for both magnets 
going from one plane to the other the compensation is correct in both planes. This 
method of chromaticity correction is quite effective in long beam transport systems 
with many equal lattice cells. An example of such a correction scheme are the 
beam transport lines from the SLAC linear accelerator to the collision point of the 
Stanford Linear Collider, SLC, [5]. This transport line consists of a dense sequence 
of strong magnets forming a combined function FODO channel (for parameters 
see example #2 in Table 10.1). In these magnets dipole, quadrupole and sextupole 
components are combined in the pole profile and the chromaticity compensation 
occurs locally. 
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This method of compensation, however, does not work generally in circular 
accelerators because of special design criteria which often require some parts of the 
accelerator to be dispersion free and the chromaticity created by the quadrupoles in 
these sections must then be corrected elsewhere in the lattice. Consequently both 
chromaticity terms in (17.31) do not cancel anymore locally and can be adjusted to 
cancel only globally. 

The consequence of these less than perfect chromaticity correction schemes 
is the occurrence of aberrations through higher-order effects. We get a deeper 
insight for the effects of these aberrations in a circular accelerator by noting 
that the coefficients of the betatron oscillation amplitude wg for both chromatic 
perturbations are periodic functions in a circular accelerator and can therefore be 
expanded into a Fourier series. Only non-oscillatory terms of these expansions 
cancel if the chromaticity is corrected while all other higher harmonics still appear 
as chromatic aberrations. 


17.2.1 Geometric Aberrations 


Geometric perturbations from sextupole fields scale proportional to the square of the 
betatron oscillation amplitude leading to a loss of stability for particles oscillating 
at large amplitudes. From the third perturbation term in (17.31) we expect this limit 
to occur at smaller amplitudes in circular accelerators where either the betatron 
functions are generally large or where the focusing and therefore the chromaticity 
and required sextupole correction is strong or where the tunes are large. Most 
generally this occurs in large proton and electron colliding-beam storage rings or 
in electron storage rings with strong focusing. 


Compensation of Nonlinear Perturbations 


In most older circular accelerators the chromaticity is small and can be corrected 
by two families of sextupoles. Although in principle only two sextupole magnets 
for the whole ring are required for chromaticity compensation, this is in most 
cases impractical since the strength of the sextupoles becomes too large exceeding 
technical limits or leading to loss of beam stability because of intolerable geometric 
aberrations. For chromaticity compensation we generally choose a more even 
distribution of sextupoles around the ring and connect them into two families 
compensating the horizontal and vertical chromaticity, respectively. This scheme 
is adequate for most not too strong focusing circular accelerators. Where beam 
stability suffers from geometric aberrations more sophisticated sextupole correction 
schemes must be utilized. 

To analyze the geometric aberrations due to sextupoles and develop correction 
schemes we follow a particle along a beam line including sextupoles. Here we 
understand a beam line to be an open system from a starting point to an image point 


576 17 Hamiltonian Nonlinear Beam Dynamics* 


Fig. 17.1 Linear particle ' 
motion in phase space 


Fig. 17.2 Typical phase x! 
space motion in the presence ca|orteee - 
of nonlinear fields ‘ 


at the end or one full circumference of a circular accelerator. Following any particle 
through the beam line and ignoring for the moment nonlinear fields we expect the 
particle to move along an ellipse in phase space as shown in Fig. 17.1. Travelling 
through the length of a circular accelerator with phase advance -y = 27 vo a particle 
moves vo revolutions around the phase ellipse in Fig. 17.1. 

Including nonlinear perturbations due to, for example, sextupole magnets the 
phase space trajectory becomes distorted from the elliptical form as shown in 
Fig. 17.2. An arbitrary distribution of sextupoles along a beam line can cause large 
variations of the betatron oscillation amplitude leading to possible loss of particles 
on the vacuum chamber wall even if the motion is stable in principle. The PEP 
storage ring [6] was the first storage ring to require a more sophisticated sextupole 
correction [7] beyond the mere compensation of the two chromaticities because 
geometric aberrations were too strong to give sufficient beam stability. Chromaticity 
correction with only two families of sextupoles in PEP would have produced large 
amplitude dependent tune shifts leading to reduced beam stability. 

Such a situation can be greatly improved with additional sextupole families 
[7] to minimize the effect of these nonlinear perturbation. Although individual 
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perturbations may not be reduced much by this method the sum of all perturbations 
can be compensated to reduce the overall perturbation to a tolerable level. 

In this sextupole correction scheme the location and strength of the individual 
sextupoles are selected such as to minimize the perturbation of the particle motion 
in phase space at the end of the beam transport line. Although this correction scheme 
seems to work in not too extreme cases it is not sufficient to guarantee beam stability. 
This scheme works only for one amplitude due to the nonlinearity of the problem 
and in cases where sextupole fields are no longer small perturbations we must expect 
a degradation of this compensation scheme for larger amplitudes. As the example 
of PEP shows, however, an improvement of beam stability can be achieved beyond 
that obtained by a simple two family chromaticity correction. Clearly, a more formal 
analysis of the perturbation and derivation of appropriate correction schemes are 
desirable. 


Sextupoles Separated by a —Z-Transformation 


A chromaticity correction scheme that seeks to overcomes this amplitude dependent 
aberration has been proposed by Brown and Servranckx [8]. In this scheme possible 
sextupole locations are identified in pairs along the beam transport line such that 
each pair is separated by a negative unity transformation 


-10 0 0 
0-10 0 

=f[= : 17.32 
0 0-1 0 eee 
0 0 0-1 


Placing sextupoles of equal strength at these two locations we get an additive 
contribution to the chromaticity correction. The effect of geometric aberrations, 
however, is canceled for all particle oscillation amplitudes. This can be seen if we 
calculate the transformation matrix through the first sextupole, the —Z section, and 
then through the second sextupole. The sextupoles are assumed to be thin magnets 
inflicting kicks on particle trajectories by the amount 


Ax = —tinoes Gr - y) ; (17.33) 
and 


Ay’ = —molsxy, (17.34) 
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where £, is the sextupole length. We form a 4 x 4 transformation matrix through a 
thin sextupole and get 


x x0 
x x 
= M, (xo, yo) . 
y *) 
y Xo 

1 0 0 0 Xo 
1 1 

_ — mpl xo 1 5molsxo 0 X (17.35) 
0 0 1 0 Yo 
0 0 molsXo 1 Yo 


To evaluate the complete transformation we note that in the first sextupole the 
particle coordinates are (xo, yo) and become after the —Z-transformation in the 
second sextupole (—xo, —yo). The transformation matrix through the complete unit 
is therefore 


M;, = Ms (x0. yo) (-Z) Ms (—x0, —yo) - (17.36) 


Independent of the oscillation amplitude we observe a complete cancellation of 
geometric aberrations in both the horizontal and vertical plane. This correction 
scheme has been applied successfully to the final focus system of the Stanford 
Linear Collider [9], where chromatic as well as geometric aberrations must be 
controlled and compensated to high accuracy to allow the focusing of a beam to 
a spot size at the collision point of only a few micrometer. 

The effectiveness of this correction scheme and its limitations in circular 
accelerators has been analyzed in more detail by Emery [10] and we will discuss 
some of his findings. As an example, we use strong focusing FODO cells for 
an extremely low emittance electron storage ring [10] and investigate the beam 
parameters along this lattice. Any other lattice could be used as well since the 
characteristics of aberrations is not lattice dependent although the magnitude may 
be. The particular FODO lattice under discussion as shown in Fig. 17.3 is a thin 
lens lattice with 90° cells, a distance between quadrupoles of L, = 3.6m and an 
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Fig. 17.3, FODO lattice and chromaticity correction 
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a) b) c) 


| C 
Fig. 17.4 Phase ellipses along a FODO channel including nonlinear aberrations due to thin 
sextupole magnets separated by exactly 180° in betatron phase (consult text for details) 


d) 


CIN 


= 


integrated half quadrupole strength of (k€4)~' = J2 L,. The horizontal and vertical 
betatron functions at the symmetry points are 12.29 and 2.1088 m respectively. 
Three FODO cells are shown in Fig. 17.3 including one pair of sextupoles separated 
by 180° in betatron phase space. We choose a phase ellipse for an emittance of 
€ = 200mm-mrad which is an upright ellipse at the beginning of the FODO 
lattice, Fig. 17.4a. Due to quadrupole focusing the ellipse becomes tilted at the 
entrance to the first sextupole, Fig. 17.4b. The thin lens sextupole introduces a 
significant angular perturbation (Fig. 17.4c) leading to large lateral aberrations in 
the quadrupole QF (Fig. 17.4d). At the entrance to the second sextupole the distorted 
phase ellipse is rotated by 180° and all aberrations are compensated again by this 
sextupole, Fig. 17.4e. Finally, the phase ellipse at the end of the third FODO cell is 
again an upright ellipse with no distortions left, Fig. 17.4f. The range of stability 
therefore extends to infinitely large amplitudes ignoring any other detrimental 
effects. 

The compensation of aberrations works as long as the phase advance between 
sextupoles is exactly 180°. A shift of the second sextupole by a few degrees or a 
quadrupole error resulting in a similar phase error between the sextupole pair would 
greatly reduce the compensation. In Fig. 17.5 the evolution of the phase ellipse from 
Fig. 17.4 is repeated but now with a phase advance between the sextupole pair of 
only 175°. A distortion of the phase ellipse due to aberrations can be observed 
which may build up to instability as the particles pass through many similar cells. 
Emery has analyzed numerically this degradation of stability and finds empirically 
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71 iT » 
Fig. 17.5 Thin sextupole magnets separated by 175° in betatron phase space. The unperturbed 
phase ellipse (a) becomes slightly perturbed (b) at the end of the first triple FODO cell (Fig.17.3, 


and more so after passing through many such triplets (c) 
a) b) c) fo 
Fig. 17.6 Phase ellipses along a FODO channel including nonlinear aberrations due to finite 


length sextupole magnets placed exactly 180 degrees apart. Phase ellipse (a) transforms to (b) 
after one FODO triplet cell and to (c) after passage through many such cells 


fe) 
wm 


— 


the maximum stable betatron amplitude to scale with the phase error like Ay~°? 


[10]. The sensitivity to phase errors together with unavoidable quadrupole field 
errors and orbit errors in sextupoles can significantly reduce the effectiveness of 
this compensation scheme. 

The single most detrimental arrangement of sextupoles compared to the perfect 
compensation of aberrations is to interleave sextupoles which means to place other 
sextupoles between two pairs of compensating sextupoles [8]. Such interleaved 
sextupoles introduce amplitude dependent phase shifts leading to phase errors and 
reduced compensation of aberrations. This limitation to compensate aberrations is 
present even in a case without apparent interleaved sextupoles as shown in Fig. 17.6 
for the following reason. 

The assumption of thin magnets is sometimes convenient but, as Emery points 
out, can lead to erroneous results. For technically realistic solutions, we must 
allow the sextupoles to assume a finite length and find, as a consequence, a loss 
of complete compensation for geometric aberrations because sextupoles of finite 
length are only one particular case of interleaved sextupole arrangements. If we 
consider the sextupoles made up of thin slices we still find that each slice of the first 
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sextupole has a corresponding slice exactly 180° away in the second sextupoles. 
However, other slices are interleaved between such ideal pairs of thin slices. In 
Fig. 17.6 the sequence of phase ellipses from Fig. 17.4 is repeated with the only 
difference of using now a finite length of 0.3 m for the sextupoles. From the last 
phase ellipse it becomes clear that the aberrations are not perfectly compensated 
as was the case for thin sextupoles. Although the —Z-transformation scheme to 
eliminate geometric aberrations is not perfectly effective for real beam lines it is 
still prudent to arrange sextupoles in that way, if possible, to minimize aberrations 
and apply additional corrections. 


17.2.2 Filamentation of Phase Space 


Some distortion of the unperturbed trajectory in phase space due to aberrations 
is inconsequential to beam stability as long as this distortion does not build up 
and starts growing indefinitely. A finite or infinite growth of the beam emittance 
enclosed within a particular particle trajectory in phase space may at first seem 
impossible since we deal with macroscopic, non-dissipating magnetic fields where 
Liouville’s theorem must hold. Indeed numerical simulations indicate that the total 
phase space occupied by the beam does not seem to increase but an originally 
elliptical boundary in phase space can grow, for example, tentacles like a spiral 
galaxy leading to larger beam sizes without actually increasing the phase space 
density. This phenomenon is called filamentation of the phase space and can evolve 
like shown in Fig. 17.7. 

For particle beams this filamentation is as undesirable as an increase in beam 
emittance or beam loss. We will therefore try to derive the causes for beam 
filamentation in the presence of sextupole non-linearities which are the most 
important non-linearities in beam dynamics. In this discussion we will follow the 
ideas developed by Autin [11] which observes the particle motion in action-angle 
phase space under the influence of nonlinear fields. 


b) 


Fig. 17.7 Filamentation of phase space after passage through an increasing number of FODO cells 
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For simplicity of expression, we approximate the nonlinear sextupoles by thin 
magnets. This does not restrict our ability to study the effect of finite length 
sextupoles since we may always represent such sextupoles by a series of thin 
magnets. A particle in a linear lattice follows a circle in action-angle phase space 
with a radius equal to the action Jo. The appearance of a nonlinearity along the 
particle trajectory will introduce an amplitude variation AJ to the action which is 
from the Courant-Snyder invariant for both the horizontal and vertical plane 


AJ, = v,owAw + LwAw = twAw, 
a (17.37) 
AJ) = vyovAv + 55 UA = 5 VAD, 


since Aw = Av = 0 fora thin magnet. Integration of the equations of motion in 
normalized coordinates over the “length” ¢ of the thin magnet produces a variation 
of the slopes 


Aw = v,9VBr5ml(? —y’), 
Av = —Vy0 V Bye xy. 


We insert (17.38) into (17.37) and get after transformation into action-angle 
variables and linearization of the trigonometric functions the variation of the action 


(17.38) 


AJ, = me — | (cB _ 2,By%) sin Py + Jy Bx sin 3Wy 
—JyBy% [sin( + 20h) + sin(We — 2vy)]} (17.39) 
Ady = / 21, By[sin Ye + 2Wy) — sin(Yx — 2Wy)]. 


Since the action is proportional to the beam emittance, (17.39) allow us to study 
the evolution of beam filamentation over time. The increased action from (17.39) is 
due to the effect of one nonlinear sextupole magnet and we obtain the total growth 
of the action by summing over all turns and all sextupoles. To sum over all turns 
we note that the phases in the trigonometric functions increase by 27 vox.) every 
turn and we have for the case of a single sextupole after an infinite number of turns 
expressions of the form 


» sin[(Wyj + 27 von) + 2(Wy + 2 vy0n)] , (17.40) 


n=0 


where ,; and y,; are the phases at the location of the sextupole j. Such sums of 
trigonometric functions are best solved in the form of exponential functions. In this 
case the sine function terms are equivalent to the imaginary part of the exponential 
functions 


ela t2Wyyj) elt (v0 F205) (17.41) 
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The second factor forms an infinite geometric series and the imaginary part of the 
sum is therefore 


= cits v9) _ c0sl(Yxy = TVx0) + 2(Wyj — TVyo)] (17.42) 
1 — e227 (x0 +20) 2 sin[z(vx9 + 2vy0)] 


This solution has clearly resonant character leading to an indefinite increase of 
the action if v9 + 2vy9 is an integer. Similar results occur for the other three terms 
and Autin’s method to observe the evolution of the action coordinate over many 
turns allows us to identify four resonances driven by sextupolar fields which can 
lead to particle loss and loss of beam stability if not compensated. Resonant growth 
of the apparent beam emittance occurs according to (17.39) for 


ll a MAC ie (17.43) 

30.0 = GQ, or Vx9 — 2V50 =444, 
where the g; are integers. In addition to the expected integer and third integer 
resonance in the horizontal plane, we find also two third order coupling resonances 
in both planes where the sum resonance leads to beam loss while the difference 
resonance only initiates an exchange of the horizontal and vertical emittances. The 
asymmetry is not fundamental and is the result of our choice to use only upright 
sextupole fields. 

So far we have studied the effect of one sextupole on particle motion. Since no 
particular assumption was made as to the location and strength of this sextupole, we 
conclude that any other sextupole in the ring would drive the same resonances and 
we obtain the beam dynamics under the influence of all sextupoles by adding the 
individual contributions. In the expressions of this section we have tacitly assumed 
that the beam is observed at the phase 7,o,,0 = 0. If this is not the desired location 
of observation the phases 7,; need to be replaced by Wj; — Wo, etc., where the 
phases 7,;,; define the location of the sextupole j. Considering all sextupoles in a 
circular lattice we sum over all such sextupoles and get, as an example, for the sum 
resonance used in the derivation above from (17.39) 


2S,B 


Vx0 


mye; ig We. 
Mnvet2y = — Da *JyByj— sin Wy + 2). (17.44) 
, y 
; 


Similar expressions exist for other resonant terms. Equation (17.44) indicates a 
possibility to reduce the severity of driving terms for the four resonances. Sextupoles 
are primarily inserted into the lattice where the dispersion function is nonzero to 
compensate for chromaticities. Given sufficient flexibility these sextupoles can be 
arranged to avoid driving these resonances. Additional sextupoles may be located 
in dispersion free sections and adjusted to compensate or at least minimize the 
four resonance driving terms without affecting the chromaticity correction. The 
perturbation AJ is minimized by distributing the sextupoles such that the resonant 
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driving terms in (17.39) are as small as possible. This is accomplished by harmonic 
correction which is the process of minimization of expressions 


a ml; p>” evi +0, (17.45) 

De mt;B3!? BY > 0, (17.46) 

be ml; B1!? By ei > 0, (17.47) 

Di eiB”” By PHP — 0, (17.48) 
a ml Bi? By li) > 0, (17.49) 


The perturbations of the action variables in (17.39) cancel perfectly if we insert 
sextupoles in pairs at locations which are separated by a —Z transformation as 
discussed previously in this chapter. The distribution of sextupoles in pairs is 
therefore a particular solution to (17.45) for the elimination of beam filamentation 
and specially suited for highly periodic lattices while (17.45)—(17.49) provide more 
flexibility to achieve similar results in general lattices and sextupole magnets of 
finite length. 

Cancellation of resonant terms does not completely eliminate all aberrations 
caused by sextupole fields. Because of the existence of nonlinear sextupole fields 
the phases y; depend on the particle amplitude and resonant driving terms are 
therefore canceled only to first order. For large amplitudes we expect increasing 
deviation from the perfect cancellation leading eventually to beam filamentation 
and beam instability. Maximum stable oscillation amplitudes in (x, y)-space due to 
nonlinear fields form the dynamic aperture which is to be distinguished from the 
physical aperture of the vacuum chamber. This dynamic aperture is determined by 
numerical tracking of particles. Given sufficiently large physical apertures in terms 
of linear beam dynamics the goal of correcting nonlinear aberrations is to extend the 
dynamic aperture to or beyond the physical aperture. Methods discussed above to 
increase the dynamic aperture have been applied successfully to a variety of particle 
storage rings, especially by Autin [11] to the antiproton cooling ring ACOL, where 
a particularly large dynamic aperture is required. 


17.2.3, Chromatic Aberrations 


Correction of natural chromaticities is not a complete correction of all chromatic 
aberrations. For sensitive lattices nonlinear chromatic perturbation terms must 
be included. Both linear as well as nonlinear chromatic perturbations have been 
discussed in detail in Sect. 9.4.1. Such terms lead primarily to gradient errors and 
therefore the sextupole distribution must be chosen such that driving terms for half 
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integer resonances are minimized. Together with tune shifts due to gradient field 
errors we observe also a variation of the betatron function. Chromatic gradient errors 
in the presence of sextupole fields are 


Pilz) = (kK— mn) 5 (17.50) 


and the resulting variation of the betatron function has been derived in Sect. 15.3. 
For the perturbation (17.50) the linear variation of the betatron function with 
momentum is from (15.91) 


AB(z) a 5 z+L 
Bo ~ 2sin 27 v9 


B(k— mn) cos[2vo(y, — ge + 27)]de, (17.51) 


where L is the length of the superperiod, g, = g(z) and g; = g(¢). The same result 
can be expressed in the form of a Fourier expansion for N, superperiods in a ring 
lattice by 


as iNsqg 
(17.52) 
VG a gi 2)" 
where 
Vo 2n P i 
E> Br(k — mn)je*'4? do . (17.53) 
Qn 0 


Both expressions exhibit the presence of half integer resonances and we must 
expect the area of beam stability in phase space to be reduced for off momentum 
particles because of the increased strength of the resonances. Obviously, this 
perturbation does not appear in cases where the chromaticity is corrected locally 
so that (k — mn) = O but few such cases exist. To minimize the perturbation 
of the betatron function, we look for sextupole distributions such that the Fourier 
harmonics are as small as possible by eliminating excessive “fighting” between 
sextupoles and by minimizing the resonant harmonic gq = 2vo. Overall, however, 
it is not possible to eliminate this beta-beat completely. With a finite number of 
sextupoles the beta-beat can be adjusted to zero only at a finite number of points 
along the beam line. 

In colliding-beam storage rings, for example, we have specially sensitive sections 
just adjacent to the collision points. To maximize the luminosity the lattice is 
designed to produce small values of the betatron functions at the collision points 
and consequently large values in the adjacent quadrupoles. In order not to further 
increase the betatron functions there and make the lattice more sensitive to errors, 
one might choose to seek sextupole distributions such that the beta-beat vanishes at 
the collision point and its vicinity. 
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Having taken care of chromatic gradient errors we are left with the variation of 
geometric aberrations as a function of particle momentum. Specifically, resonance 
patterns vary and become distorted as the particle momentum is changed. Generally 
this should not cause a problem as long as the dynamic aperture can be optimized 
to exceed the physical aperture. A momentum error will introduce only a small 
variation to the dynamic aperture as determined by geometric aberrations for on 
momentum particles only. If, however, the dynamic aperture is limited by some 
higher-order resonances even a small momentum change can cause a big difference 
in the stable phase space area. 

Analytical methods are useful to minimize detrimental effects of geometric 
and chromatic aberrations due to nonlinear magnetic fields. We have seen how 
by careful distribution of the chromaticity correcting sextupoles, resonant beam 
emittance blow up and excessive beating of the betatron functions for off momentum 
particles can be avoided or at least minimized within the approximations used. In 
Sect. 17.3, we will also find that sextupolar fields can produce strong tune shifts 
for larger amplitudes leading eventually to instability at nearby resonances. Here 
again a correct distribution of sextupoles will have a significant stabilizing effect. 
Although there are a number of different destabilizing effects, we note that they 
are driven by only a few third order resonances. Specifically, in large circular 
lattices a sufficient number of sextupoles and locations for additional sextupoles 
are available for an optimized correction scheme. In small rings such flexibility 
often does not exist and therefore the sophistication of chromaticity correction 
is limited. Fortunately, in smaller rings the chromaticity is much smaller and 
some of the higher-order aberrations discussed above are very small and need 
not be compensated. Specifically, the amplitude dependent tune shift is generally 
negligible in small rings while it is this effect which limits the dynamic aperture in 
most cases of large circular accelerators. 

The optimization of sextupole distribution requires extensive analysis of the 
linear lattice and it is best to use a numerical program to do the well known but 
cumbersome work. At present the program OPA [12] is widely used. This program 
uses a linear lattice and adjusts the sextupoles such that chromaticities and some 
harmonics are corrected. With the new sextupole strengths the dynamic aperture 
can be obtained in the same program. 

In trying to solve aberration problems in beam dynamics we are, however, 
mindful of approximations made and terms neglected for lack of mathematical tools 
to solve analytically the complete nonlinear dynamics in realistic accelerators. The 
design goals for circular accelerators become more and more demanding on our 
ability to control nonlinear aberrations. On one hand the required cross sectional 
area in the vicinity of the ideal orbit for a stable beam remains generally constant 
for most designs but the degree of aberrations is increased in an attempt to reach very 
special beam characteristics. As a consequence, the nonlinear perturbations become 
stronger and the limits of dynamic aperture occur for smaller amplitudes compared 
to less demanding lattices and require more and more sophisticated methods of 
chromaticity correction and control of nonlinear perturbations. 
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17.2.4 Particle Tracking 


No mathematical methods are available yet to calculate analytically the limits of the 
dynamic aperture for any but the most simple lattices. High order approximations 
are required to treat strong aberrations in modern circular accelerator designs. The 
most efficient way to determine beam stability characteristics for a particular lattice 
design is to perform numerical particle tracking studies. 

Perturbations of localized nonlinear fields on a particle trajectory are easy to cal- 
culate and tracking programs follow single particles along their path incorporating 
any nonlinear perturbation encountered. Since most nonlinear fields are small, we 
may use thin lens approximation and passage of a particle through a nonlinear field 
of any order inflicts therefore only a deflection on the particle trajectory. During the 
course of tracking the deflections of all non-linearities encountered are accumulated 
for a large number of turns and beam stability or instability is judged by the particle 
surviving the tracking or not, respectively. The basic effects of nonlinear fields 
in numerical tracking programs are therefore reduced to what actually happens 
to particles travelling through such fields producing results in an efficient way. 
Of course from an intellectual point of view such programs are not completely 
satisfactory since they serve only as tools providing little direct insight into actual 
causes for limitations to the dynamic aperture and instability. 

The general approach to accelerator design is to develop first a lattice in linear 
approximation meeting the desired design goals followed by an analytical approach 
to include chromaticity correcting sextupoles in an optimized distribution. Further 
information about beam stability and dynamic aperture can at this point only 
be obtained from numerical tracking studies. Examples of widely used computer 
programs to perform such tracking studies are in historical order PATRICIA [7], 
RACETRACK [13], OPA [12] and more. 

Tracking programs generally require as input an optimized linear lattice and 
allow then particle tracking for single particles as well as for a large number of 
particles simulating a full beam. Nonlinear fields of any order can be included as thin 
lenses in the form of isolated multipole magnets like sextupoles or a multipole errors 
of regular lattice magnets. The multipole errors can be chosen to be systematic or 
statistical and the particle momentum may have a fixed offset or may be oscillating 
about the ideal momentum due to synchrotron oscillations. 

Results of such computer studies contribute information about particle dynamics 
which is not available otherwise. The motion of single particles in phase space can 
be observed together with an analysis of the frequency spectrum of the particle 
under the influence of all nonlinear fields included and at any desired momentum 
deviation. 

Further information for the dynamics of particle motion can be obtained from 
the frequency spectrum of the oscillation. An example of this is shown in Fig. 17.8 
as a function of oscillation amplitudes. For small amplitudes we notice only 
the fundamental horizontal betatron frequency v,. As the oscillation amplitude is 
increased this basic frequency is shifted toward lower values while more frequencies 
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Fig. 17.8 Frequency spectrum for betatron oscillations with increasing amplitudes (x) as deter- 
mined by particle tracking with PATRICIA 


appear. We note the appearance of higher harmonics of v, due to the nonlinear nature 
of motion. 

The motion of a particle in phase space and its frequency spectrum as a result of 
particle tracking can give significant insight into the dynamics of a single particle. 
For the proper operation of an accelerator, however, we also need to know the overall 
stability of the particle beam. To this purpose we define initial coordinates of a large 
number of particles distributed evenly over a cross section normal to the direction of 
particle propagation to be tested for stability. All particles are then tracked for many 
turns and the surviving particles are displayed over the original cross section at the 
beginning of the tracking thus defining the area of stability or dynamic aperture. 


17.3. Hamiltonian Perturbation Theory 


The Hamiltonian formalism has been applied to derive tune shifts and to discuss 
resonance phenomena. This was possible by a careful application of canonical 
transformation to eliminate, where possible, cyclic variables from the Hamiltonian 
and obtain thereby an invariant of the motion. We have also learned that this 
“elimination” process need not be perfect. During the discussion of resonance 
theory, we observed that slowly varying terms function almost like cyclic variables 
giving us important information about the stability of the motion. 

During the discussion of the resonance theory, we tried to transform perturbation 
terms to a higher order in oscillation amplitude than required by the approximation 
desired and where this was possible we could then ignore such higher-order fast- 
oscillating terms. This procedure was successful for all terms but resonant terms. 
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In this section we will ignore resonant terms and concentrate on higher-order 
terms which we have ignored so far [14]. By application of a canonical identity 
transformation we try to separate from fast oscillating terms those which vary only 
slowly. To that goal, we start from the nonlinear Hamiltonian (16.30) 


H=vwJ+p,(y) J" cos" v. (17.54) 


Fast-oscillating terms can be transformed to a higher order by a canonical transfor- 
mation which can be derived from the generating function 


ae 


HVN+eWe (17.55) 


where the function g(¥, g) is an arbitrary but periodic function in w and y which 
we will determine later. From (17.55) we get for the new angle variable y and the 
old action variable J 


i= Gt = Wt Soy.g rr, 


(17.56) 
jae ea! 
and the new Hamiltonian is 
dG, r) n 
mane og BO re , (17.57) 
dy dg 


We replace now the old variables (7, /) in the Hamiltonian by the new variables 
(J,, W) and expand 


a ni? a 
pir = (4 de ea”) ay 5a tien, (17.58) 


With (17.56), (17.58) the Hamiltonian (17.57) becomes 


ny 4 98 
dW + Pn(p) cos" py + ce (17.59) 


n—1 i n og pete 
+ J; | Seale) os w =| +O(s, ) ‘ 


Ah =vosi del? Ee 


All terms of order n + 1/2 or higher in the amplitude J as well as quadratic terms 
in g(¥, @) or derivations thereof have been neglected. We still must express all terms 
of the Hamiltonian in the new variables and define therefore the function 


3 3 
OW. ~) = vo + pn(y) cos" +. (17.60) 
V ag 
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In the approximation of small perturbations we have fj ~ wor Ww = w+ Aw and 
may expand (17.60) like 


Qi. 9) = OW.9) + eA (17.61) 
0 
= 2.9) + Sein. 9) Hie 


ay 
where we used the first equation of (17.56). The Hamiltonian can be greatly 


simplified if we make full use of the periodic but otherwise arbitrary function 
g(wW),@). With (17.62) we obtain from (17.59) 


(17.62) 


A, = WJ) + ee 


n— n dQ 
+55 pao cos nee — ev.) |: o (17.63) 
ay ow 
and we will derive the condition that 
O(y",~) =0. (17.64) 
First we set 
cos” Wy = ye Anne! (17.65) 


and try an expansion of g(wW,, g) in a similar way by setting 
(i. 9) = D> gm(yyei"—™, (17.66) 


where the function g obviously is still periodic in y and ¢ as long as g,,(g) is 
periodic. With 


= D> gm(y) im eimrve) (17.67) 


m>=—-n 


os 
aw 
and 


oem inomen() elni—voe) (17.68) 
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we get instead of (17.60) 


OW, 9) + OW, gv) = 


n n 
ivo 2 mgm e™VI-VP) 4, (g) > nme 


m>—-n m>=—-n 


n 
9 
+ > (= - ivomen} elm(Vi—v0v) — 9) 


m>=—-n 


noting from (17.62) that the difference AO = Q(wW,¢~) — Q(V,¢@) contributes 
nothing to the term of order ie * forn > 2. The imaginary terms cancel and we 
get 


” i 0 m j = 
O(W1.9) © Pal) Y~ imei” + S~ rd ro) — 0, (17.69) 


m>=—-n m=—-n 


This equation must be true for all values of g and therefore the individual terms of 
the sums must vanish independently 


0 ae | 
Pr(P)anm + - mo? = 0 (17.70) 


for all values of m. After integration we have 


ie . 
&m(Y) = &m0 — am f Pn(pye”™? dp (17.71) 
0 


and since the coefficients g,,(g) must be periodic [gm(~) = 8n(9 + *)| where NV 
is the super-periodicity, we are able to eventually determine the function g(W, @). 
With 


; 2 + 20 
Slee =z, (o + =) ei (Vi-v09— 5 ¥0) (17.72) 


and (17.71) we have 
gma? YP) _ ier e [ademas 
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Solving for gno we get 


sh We te 
&m0 (1 = ene) = am f Pnl(pyer? db ‘ (17.73) 
0 


A solution for g,,9 exists only if there are no perturbations and p(y) = 0 or if 


§ ag 2K. ‘ sys 
(1 = en) 0. In other words we require the condition 


mvo # qN, (17.74) 


where g is an integer number. The canonical transformation (17.55) leads to the 
condition (17.64) only if the particle oscillation frequency is off resonance. We have 
therefore the result that all nonresonant perturbation terms can be transformed to 
higher-order terms in the oscillation amplitudes while the resonant terms lead to 
phenomena discussed earlier. From (17.73) we derive g,,9, obtain the function g,,(¢) 
from (17.71) and finally the function g(¥, g) from (17.66). Since O(W1, gy) = 0, 
the Hamiltonian is from (17.63) 


Ay =v +I"? Oi.) (17.75) 


+ sit : |pa(o) cos” Vi se. ~~ ali, ”) e+ -* 


Nonresonant terms appear only in order Yee As long as such terms can be 
considered small we conclude that the particle dynamics is determined by the linear 
tune vo, a tune shift or tune spread caused by perturbations and resonances. Note that 
the Hamiltonian (17.75) is not the complete form but addresses only the nonresonant 
case of particle dynamics while the resonant case of the Hamiltonian has been 
derived earlier. 

We will now continue to evaluate (17.75) and note that the product 


d0(W1, 9) 
=0 177 
81.9) ah (17.76) 
in this approximation and get 
T(W. 9) = =Pn(@) 28. 17.77) 
we sbaly cos” ay’ (17. 


where we have dropped the index on y and set from now on Ww = W which is not 
to be confused with the variable y used before the transformation (17.55). Using 
the Fourier spectrum for the perturbations and summing over all but resonant terms 
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q # q we get from (17.73) 


8&m0 (1 _ eit x pny = dnm SS [ Png eimo- IN)? dep 


GF4: 
ein WYO 1 

= dim > Png. H... a 3. (17.78) 

—qN 

GF4: aoa 

or 
8m0 = 14pm » a . (17.79) 
mvo — GN 
TFd: 

Note that we have excluded in the sums the resonant terms gq = q, where 


m,Vo — gyN = 0. These resonant terms include also terms g = 0 which do not 
cause resonances of the ordinary type but lead to tune shifts and tune spreads. After 
insertion into (17.71) and some manipulations we find 


8m (QP) = idam > a 2, eye > a Pad eilnvo— IN) dd (17.80) 
mvo — GN 
GF 4a GF 4a 
el(mvo— gN)y 
= 1dnm ~~ aes 
GF 4K: 
and with (17.66) 
avg) =i s Le _SamPng_ gimy e-ig)e | (17.81) 
Emenee mvo — GN 


From (17.77) we get with (17.65) and (17.81) 
it = : im 
TW. 9) =i 5 DI Page? YT dune” may. 9) (17.82) 
Fa m=—n 


This function T(y, y) is periodic in w and gy and we may apply a Fourier expansion 
like 


T(W.9) = dX 2 Tig VO (17.83) 


#0. 
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where the coefficients T;,,; are determined by 


Qn 2n/N 
Ts = — | ce dy i eT, v) dg. (17.84) 
47? 
0 0 


To evaluate (17.84) it is most convenient to perform the integration with respect 
to the betatron phase w before we introduce the expansions with respect to @. 
Using (17.65), (17.66), (17.77), we get from (17.84) after some reordering 


T,; = =i> se mf ee = eiGtm— ryw dy 


m=—-n j=n2 
2n/N : 
7 | Pr(@) Bm(p)eemro "Ne dg. 
0 


The integral with respect to y is zero for all values j + m—r ¥ 0 and therefore 
equal to Gy.,—m 


Fi 2n/N 


Ty =i D> manem J prlp) Buoy eH de. (17.85) 


m=—n 
0 


Expressing the perturbation p,(y) by its Fourier expansion and replacing g,,(¢) 
by (17.80), (17.85) becomes 


Pn.s—q Pn, 
a 3 MQm,r—m Ann >» oe : (17.86) 
m=—n ga NO 4 


With this expression we have fully defined the function T(w, y) and obtain for the 
non-resonant Hamiltonian (17.75) 


Adar > yee (17.87) 


" s#xvO 


We note in this result a higher-order amplitude dependent tune spread which has 
a constant contribution To as well as oscillatory contributions. 

Successive application of appropriate canonical transformations has lead us 
to derive detailed insight into the dynamics of particle motion in the presence 
of perturbations. Of course, every time we applied a canonical transformation 
of variables it was in the hope of obtaining a cyclic variable. Except for the 
first transformation to action-angle variables, this was not completely successful. 
However, we were able to extract from perturbation terms depending on both 
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action-angle variables such elements that do not depend on the angle variable. As a 
result, we are now able to determine to a high order of approximation shifts in the 
betatron frequency caused by perturbations as well as the occurrence and nature of 
resonances. 

Careful approximations and simplifications had to be made to keep the math- 
ematical formulation manageable. Specifically we had to restrict the perturbation 
theory in this section to one order of multipole perturbation and we did not address 
effects of coupling between horizontal and vertical betatron oscillations. 

From a more practical view point one might ask to what extend this higher- 
order perturbation theory is relevant for the design of particle accelerators. Is the 
approximation sufficient or is it more detailed than needed? As it turns out so often 
in physics we find the development of accelerator design to go hand in hand with the 
theoretical understanding of particle dynamics. Accelerators constructed up to the 
late sixties were designed with moderate focusing and low chromaticities requiring 
no or only very weak sextupole magnets. In contrast more modern accelerators 
require much stronger sextupole fields to correct for the chromaticities and as a 
consequence, the effects of perturbations, in this case third-order perturbations, 
become more and more important. The ability to control the effects of such 
perturbations actually limits the performance of particle accelerators. For example, 
in colliding-beam storage rings the strongly nonlinear fields introduced by the beam- 
beam effect limit the attainable luminosity while a lower limit on the attainable beam 
emittance for synchrotron light sources or damping rings is determined by strong 
sextupole fields. 


17.3.1 Tune Shift in Higher Order 


In (16.36) we found the appearance of tune shifts due to even order multipole 
perturbations only. Third-order sextupole fields, therefore, would not affect the 
tunes. This was true within the degree of approximation used at that point. In this 
section, however, we have derived higher-order tune shifts and should therefore 
discuss again the effect of sextupolar fields on the tune. 

Before we evaluate the sextupole terms, however, we like to determine the 
contribution of a quadrupole perturbation to the higher-order tune shift. In lower 
order we have derived earlier a coherent tune shift for the whole beam. We 
use (17.86) and calculate Too for n = 2 


ma a — 
ges = S> prg Pr, - cP 2,m 42, ila Tee aia (17.88) 
mvo — GN 
Fa m=—2 
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With 4429 = do-2 = 2a29 = | and dz) = az; = 0 the term in the bracket 
becomes 


2 2 2qN 
—2v9-9qN © 2v—4qN (29)? —(qNY 


and (17.88) is simplified to 


2qN 
To = - P2,q P2,- (17.89) 
a eT Qu) = (GN) 


In this summation we note the appearance of the index gq in pairs as a positive 
and a negative value. Each such pair cancels and therefore 


Too2 = 0, (17.90) 


where the index » indicates that this coefficient was evaluated for a second- 
order quadrupole field. This result is not surprising since all quadrupole fields 
contribute directly to the tune and formally a quadrupole field perturbation cannot 
be distinguished from a “real” quadrupole field. 

In a similar way we derive the 7o9 coefficient for a third-order multipole or a 
sextupolar field. From (17.86) we get for n = 3 


maz ihG3,m 3,-m 43,—m 
Loi =, ey P3.q P3.-q 5 (17.91) 
mvVo — “mvo — qgN * 
Ps, m=—3 
Since cos? y is an even function we have a3, = d3,-m,43,1 = : and a33 = : 


The second sum in (17.91) becomes now 


1 3 qd qd 3 
ae a + 
64 \ 3v9 + gN Vo + gN Vo — gN 3v9 — gN 


1 oe & 18v9 
~ 64 \V2- (qn)? Gro)3— (GN) ) 


and after separating out the terms for g = 0, (17.91) becomes 


To0,3 = —=-—P30 (17.92) 


27v0 : 
— — i - 
a ) P3,q P3,-q EF (gN)? (v9)? — on | 
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This expression in general is nonzero and we found, therefore, that sextupole 
fields indeed, contribute to a tune shift although in a high order of approximation. 
This tune shift can actually become very significant for strong sextupoles and for 
tunes close to an integer or third integer resonances. Although we have excluded 
resonances (q = q,) , terms close to resonances become important. Obviously, the 
tunes should be chosen such as to minimize both terms in the bracket of (17.92). 
This can be achieved with v9 = gN + 5N and 3vg = rN + 5N where g and r are 
integers. Eliminating vo from both equations we get the condition 3g —r+ 1=0 
or r = 3q + 1. With this we finally get from the two tune conditions the relation 
2v9 =(2qg+1)Nor 


2q+1 
2 


N. (17.93) 


Vopt = 


Of course, an additional way to minimize the tune shift is to arrange the sextupole 
distribution in such a way as to reduce strong harmonics in (17.92). In summary, we 
find for the non-resonant Hamiltonian in the presence of sextupole fields. 


A = vod + Taal” + higher order terms (17.94) 
and the betatron oscillation frequency or tune is given by 
V = Vo + 2T993J. (17.95) 


In this higher-order approximation of beam dynamics we find that sextupole 
fields cause an amplitude dependent tune shift in contrast to our earlier first-order 
conclusion 


A = 
SS ae (ye +2uu' +B uw”) = Tye; (17.96) 
Vo Vo 


where we have used (5.59) with € the emittance of a single particle oscillating with 
a maximum amplitude a” = Be. We have shown through higher-order perturbation 
theory that odd order nonlinear fields like sextupole fields, can produce amplitude 
dependent tune shifts which in the case of sextupole fields are proportional to 
the square of the betatron oscillation amplitude and therefore similar to the tune 
shift caused by octupole fields. In a beam where particles have different betatron 
oscillation amplitudes this tune shift leads to a tune spread for the whole beam. 

In practical accelerator designs requiring strong sextupoles for chromaticity 
correction it is mostly this tune shift which moves large amplitude particles onto 
a resonance thus limiting the dynamic aperture. Since this tune shift is driven by 
the integer and third-order resonance, it is imperative in such cases to arrange the 
sextupoles such as to minimize this driving term for geometric aberration. 
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Problems 


17.1 (S). Derive the expression for the second-order matrix element T\6¢6 and give 
a physical interpretation for this term. 


17.2 (S). Show that the perturbation proportional to x9 is p(z|xj) = 


[(-4m — «> — 2k) C+ SK c?] x2 , where C = C (z) = cos /kzand C’ = C’ (2) 
and the second-order matrix element 


Tin = (—4m —«3 — 2k) 4 [kS? + 1 ©)] + de [2 — ©) — KS"). 


17.3 (S). Consider a large circular accelerator made of many FODO cells with 
a phase advance of 90° per cell. Locate chromaticity correcting sextupoles in 
the center of each quadrupole and calculate the magnitude for one of the five 
expressions (17.45)-(17.49). Now place non-interleaved sextupole in pairs 180° 
apart and calculate the same two expressions for the new sextupole distribution. 


17.4 (S). Use the lattice of Problem 17.3 and determine the tunes of the ring. Are 
the tunes the best choices for the super-periodicity of the ring to avoid resonance 
driven sextupole aberrations? How would you go about improving the situation? 


17.5. Expand the second-order transformation matrix to include path length terms 
relevant for the design of an isochronous beam transport system and derive expres- 
sions for the matrix elements. Which elements must be adjusted and how would you 
do this? Which parameters would you observe to control your adjustment? 


17.6. Sextupoles are used to compensate for chromatic aberrations at the expense 
of geometric aberrations. Derive a condition for which the geometric aberration 
has become as large as the original chromatic aberration. What is the average 
perturbation of geometric aberrations on the betatron motion? Try to formulate a 
“rule of thumb” stability criteria for the maximum sextupole strength. Is it better to 
place a chromaticity correcting sextupole at a high beta location (weak sextupole) 
or at a low beta location (weak aberration)? 


17.7. Consider both sextupole distributions of Problem 17.3 and form a phasor 
diagram of one of expressions (17.45)—(17.49) for the first four or more FODO cells. 
Discuss desirable features of the phasor diagram and explain why the —Z correction 
scheme works well. A phasor diagram is constructed by adding vectorially each 
term of an expression (17.45)—(17.49) going along a beam line. 


17.8. The higher-order chromaticity of a lattice may include a strong quadratic 
term. What dependence on energy would one expect in this case for the beta beat? 
Why? Can your findings be generalized to higher-order terms? 
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Part VI 
Acceleration 


Chapter 18 
Charged Particle Acceleration 


Particle acceleration by rf-fields has been discussed, for example, in considerable 
detail in [1, 2] where relationships between longitudinal phase oscillation and beam 
stability are derived and discussed. The accelerating fields were assumed to be 
available in resonant cavities, but we ignored conditions that must be met to generate 
such fields and ensure positive energy transfer to the particle beam. In this chapter, 
we will discuss relevant characteristics of rf-cavities and study the interaction of the 
rf-generator with accelerating cavity and beam. 

It is not the intention here to develop a general microwave theory but we 
will restrict ourselves rather to such aspects which are of importance for particle 
accelerator physics. Considerable performance limits occur in accelerators by 
technical limitations in various accelerator systems as, for example, the rf-system 
and it is therefore useful for the accelerator designer to have a basic knowledge of 
such limits. 


18.1 Rf-Waveguides and Cavities 


Commonly, high frequency rf-fields are used to accelerate charged particles and 
the interaction of such electromagnetic waves with charged particles has been 
discussed earlier together with the derivation of synchronization conditions to obtain 
continuous particle acceleration. In doing so plane rf-waves have been used ignoring 
the fact that such fields do not have electrical field components in the direction of 
particle and wave propagation. Although this assumption has not made the results 
obtained so far obsolete, a satisfactory description of the wave-particle interaction 
must include the establishment of appropriate field configurations. 
Electromagnetic waves useful for particle acceleration must exhibit field compo- 
nents in the direction of particle propagation which in our coordinate system is the 
z-direction. The synchronization condition can be achieved in two ways. First, an 
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electromagnetic wave travels along the direction of the desired particle acceleration 
with a phase velocity which is equal to the velocity of the particle. In this case, a 
particle starting, say, at the crest of the wave where the field strength is largest, would 
be continuously accelerated at the maximum rate as it moves along with the wave. 
Another way of particle acceleration occurs from electromagnetic fields created in 
rf-cavities placed at particular locations along the particle path. In this case, the 
phase velocity of the wave is irrelevant. For positive particle acceleration the phase 
of the electromagnetic field must be adjusted such that the integrated acceleration is 
positive, while the particle passes through the cavity. Obviously, if the velocity of 
the particle or the length of the cavity is such that it takes several oscillation periods 
for a particle to traverse the cavity no efficient acceleration is possible. 


18.1.1 Wave Equation 


To generate electromagnetic field components in the direction of wave propagation 
we cannot use free plane waves, but must apply specific boundary conditions 
by properly placing conducting metallic surfaces to modify the electromagnetic 
wave into the desired form. The theory of electromagnetic waves, waveguides and 
modes is well established and we repeat here only those aspects relevant to particle 
acceleration. For more detailed reading consult, for example, [3, 4]. Maxwell’s 
equations for our application in a charge free environment are 


= — — dB 
V(cE) = 0, Vee a (18.1) 
VB =0, cVxBe=euz. 


and we look for solutions in the form of rf-fields oscillating with frequency w and 
U = Upe ~where U = E or B. A uniform medium is assumed which need not 
be a vacuum but may have a dielectric constant € and a magnetic permeability ju. 
Maxwell’s curl equations become then 


V x E= —ioB 

; 18.2 
CV x B=ppok. “ee 

Eliminating the magnetic or electric field strength from both equations and using the 

vector relation V x (V x a) = V (Va) — Va we get the respective wave equations 


2 2 
VE + KE =0, ies 
VWB+KB=0, 
where 
2 
k=enS. (18.4) 
CG 
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In the case of a plane wave propagating along the z-axis the transverse partial 
derivatives vanish 2 a i = 0, since field parameters of a plane wave do not 
vary transverse to the direction of propagation. The differential equation (18.3) for 


the electrical field component then becomes (4 + i) E = 0 and the solution is 


E= Eee, (18.5) 


For real values of the wave number k the solutions of (18.3) describe waves 
propagating with the phase velocity 


z c 
Uph = - = 
a | 


<c. 18. 
Ja (a (18.6) 

An imaginary component of k, on the other hand, would lead to an exponential 
damping term for the fields, a situation that occurs, for example, in a conducting 
surface layer where the fields decay exponentially over a distance of the skin 
depth. Between conducting boundaries, the wave number is real and describes 
propagating waves of arbitrary frequencies. As has been noted before, however, 
such plane waves lack electrical field components in the direction of propagation. 
In the following section, we will therefore derive conditions to obtain from (18.3) 
waves with longitudinal field components. 


18.1.2 Rectangular Waveguide Modes 


Significant modification of wave patterns can be obtained from the proximity of 
metallic boundaries. To demonstrate this, we evaluate the electromagnetic field of 
a wave propagating along the axis of a rectangular metallic pipe or rectangular 
waveguide as shown in Fig. 18.1. Since we are interested in getting a finite value 
for the z-component of the electrical field we try the ansatz 


E, = VO WO) ¥-(2) (18.7) 


Fig. 18.1 Rectangular waveguide 
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and look for boundary conditions that are required to obtain nonvanishing longitu- 
dinal fields. Insertion into (18.3) gives 


vil) W/O) Ww" oe 
: So = ep =P, 18.8 
“LO wo Vo “2 ae) 


where the r.h.s. is a constant while the functions y,,(w) are functions of the variable 


u = x, y, or z. In order that this equation be true for all values of the coordinates, the 
Vi (uw) 


ratios Wal) 


must be constant and we may write (18.8) in the form 
+R +k =P. (18.9) 


Differentiating (18.7) twice with respect to z results in the differential equation 
for the z-component of the electrical field 


PE, 
7 -E;, (18.10) 


which can be solved readily. The wavenumber k, must be real for propagating waves 
and with the definition 


k=kK+h, (18.11) 
we get with (18.9) 
RP=P—-R. (18.12) 


The solution (18.7) of the wave equation for the z-component of the electrical field 
is then finally 


E, = Eq Wy) bO , (18.13) 


The nature of the parameters in this equation will determine if the wave fields are 
useful for acceleration of charged particles. The phase velocity is given by 


cS (18.14) 


kJ 


An electromagnetic wave in a rectangular metallic pipe is propagating only if 
the phase velocity is real or k > k, and the quantity k, is therefore called the 
cutoff wave number. For frequencies with a wave number less than the cutoff 
value the phase velocity becomes imaginary and the wave decays exponentially like 


exp (-/lP =k). 
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Conducting boundaries modify electromagnetic waves in such a way that finite 
longitudinal electric field components can be produced which, at least in principle, 
can be used for particle acceleration. Although we have found solutions seemingly 
suitable for particle acceleration, we cannot use such an electromagnetic wave 
propagating in a smooth rectangular pipe to accelerate particles. Inserting (18.9) 
into (18.14) the phase velocity of a traveling waveguide mode in a rectangular pipe 
becomes 


c 
JER 1 — (ke/k)? 


and is with k > k, in vacuum or air (€ + ju & 1) larger than the velocity of light. 
There can be no net acceleration since the wave rolls over the particles, which cannot 
move faster than the speed of light. This problem occurs in a smooth pipe of any 
cross section. We must therefore seek for modifications of a smooth pipe in such 
a way that the phase velocity is reduced or to allow a standing wave pattern, in 
which case the phase velocity does not matter anymore. The former situation occurs 
for traveling wave linac structures, while the latter is specially suited for single 
accelerating cavities. 

For a standing wave pattern k, = 0 or k = k, and with (18.9), (18.11) the cutoff- 
frequency is 


Uph = (18.15) 


We = ‘ (18.16) 


To complete the solution (18.13) for transverse dimensions, we apply boundary 
conditions to the amplitude functions y, and y,. The rectangular waveguide with 
a width a in the x-direction and a height b in the y-direction (Fig. 18.1) be aligned 
along the z-axis. Since the tangential component of the electrical field must vanish 
at conducting surfaces, the boundary conditions are 


Vx(x)=0 for x= tha, 

18.17 

Wwo)=0 for y= hb. ( ) 

The solutions must be cosine functions to meet the boundary conditions and the 
complete solution (18.13) for the longitudinal electric field can be expressed by 


max NITY «4. 
E, = Eo cos cos el (tke) 
a 


, (18.18) 


where m > 1 and n > 1 are integers defining transverse field modes. The 
trigonometric functions are eigenfunctions of the differential equation (18.10) with 
boundary conditions (18.17) and the integers m and n are eigenvalues. In a similar 
way we get an expression for the z-component of the magnetic field strength B,. 
The boundary conditions require that the tangential magnetic field component at a 
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conducting surface is the same inside and outside the conductor which is equivalent 
to the requirement that 


OB. OB. 
wee =0 and z =, (18.19) 
ox 


x=ttha dy y=t5b 


11 
ae) 
These boundary conditions can be met by sine functions and the z-component of 


the magnetic field strength is therefore in analogy to (18.18) given by 


TU nIt . a 
B, = Bosin — sin = el orks) (18.20) 
a 


The cutoff frequency is the same for both the electrical and magnetic field 
component and is closely related to the dimension of the wave guide. With the 
definition (18.11) the cutoff frequency can be determined from 


R=4e42 =(™)'+(2). (18.21) 


All information necessary to complete the determination of field components 
have been collected. Using (18.3), (18.18) the component equations are with 2 = 
—ik, 


2 OE. 2 % OB. % 
—iwB, = 7 or ik,Ey, ie wk, = cay + ik-cBy , 


—iwB, = ~ik,E, —  , iew® Ey = —ik,cB, — c% , (18.22) 
: _ dFy _ dE, - wp _ _,oBy dBy 
—i0@B, = = — ty leu TE, = cae Cc oan 


From the first four equations we may extract expressions for the transverse field 
components E,, E,, B,, By as functions of the known z-components 


_ : 1 dE, oO OB. 
E,= hg (kG 4 20% 5 
Ey = ig (ke + 2% J , 
’ ke z dy c Ox (18 23) 
om aa fetta w dE; : 
nie! < z™ ax Me dy }? 


= 2s 1 OB. (a) JE, 
cBy = —1 2 (Kec yy TEL ) ; 


c Ox 


where 
e =e -(")'_(@) (18.24) 


and k? = ew? /c?. 
By application of proper boundary conditions at the conducting surfaces of a 
rectangular waveguide we have derived expressions for the z-component of the 
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electromagnetic fields and are able to formulate the remaining field components 
in terms of the z-component. Two fundamentally different field configurations can 
be distinguished depending on whether we choose E-, or B, to vanish. All field 
configurations, for which EF, = 0, form the class of transverse electrical modes or 
short TE-modes. Similarly, all fields for which B, = 0 form the class of transverse 
magnetic modes or short TM-modes. Each class of modes consists of all modes 
obtained by varying the indices m and n. The particular choice of these mode 
integers is commonly included in the mode nomenclature and we speak therefore 
of TM, or TE ,-modes. For the remainder of this chapter we will concentrate 
only on the transverse magnetic or TM-modes, since TE-modes are useless for 
particle acceleration. The lowest order TM-mode is the TM;;-mode producing the 
z-component of the electrical field, which is maximum along the z-axis of the 
rectangular waveguide and falls off from there like a cosine function to reach zero 
at the metallic surfaces. Such a mode would be useful for particle acceleration if 
it were not for the phase velocity being larger than the speed of light. In the next 
subsection we will see how this mode may be used anyway. The next higher mode, 
the TM>;-mode would have a similar distribution in the vertical plane but exhibits a 
node along the x-axis. 

Before we continue the discussion on field configurations we note that electro- 
magnetic waves with frequencies above cutoff frequency (k > k.) propagate along 
the axis of the rectangular waveguide. A waveguide wavelength can be defined by 


2 
ee ro (18.25) 
JP -® 


which is always longer than the free space wavelength A = 27/k and 


ane ee (18.26) 


where A, = 21/ke. 
The frequency of this traveling electromagnetic wave is from (18.25) with (18.9), 
(18.16) 


® = @,/1+ 4. (18.27) 


Electromagnetic energy travels along the waveguide with a velocity known as the 
group velocity defined by 


daw c k2 Cc 
pe = j= 2 2¢ (18.28) 
“dk, — ./me kee 
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In contrast to the phase velocity, the group velocity is always less than the speed 
of light as it should be. Rectangular waveguides are mostly used to transport high 
frequency microwaves from the generator to the accelerating cavity. The bandwidth 
of waveguides is rather broad and the mechanical tolerances relaxed. Small variation 
in dimension due to pressurization or evacuation to eliminate field breakdown do 
generally not matter. 


18.1.3 Cylindrical Waveguide Modes 


For accelerating cavities we try to reach the highest fields possible at a well defined 
wavelength. Furthermore, accelerating cavities must be operated under vacuum. 
These requirements result in very tight mechanical tolerances which can be met 
much easier in round rf-cavities. Analogous to the rectangular case we derive 
therefore field configurations in cylindrical cavities (Fig. 18.2). The derivation of 
the field configuration is similar to that for rectangular waveguides although now 
the wave equation (18.3) is expressed in cylindrical coordinates (r, y, z) and we get 
for the z-component of the electrical field 


VE, 10E, 10°F, OE; 


Zz 2 = 
dr’ = r or ‘ r ao? m az? le (18.29) 
with 
2 
eae. (18.30) 
mo 


In a stationary configuration the field is expected to be periodic in g while the 
z-dependence is the same as for rectangular waveguides. Using the derivatives - = 


Fig. 18.2 Cylindrical 
resonant cavity (pill box 


cavity) f 
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—im, where m is an integer eigenvalue, and 2 = —ik,, we get from (18.29) for the 
z-component of the electric field 
OL fy ae oe (18.31) 
ar. Or eo gee , 


where k2 = k°—k? consistent with its previous definition. This differential equation 
can be solved with Bessel’s functions in the form [5] 


Ez, = EqJ\in(ker)eot ©, (18.32) 


which must meet the boundary condition E, = 0 for r = a, where a is the radius of 
the cylindrical waveguide. The location of the cylindrical boundaries are determined 
by the roots of Bessel’s functions of order m. For the lowest order m = 0 the first 
root a, is (see Fig. 18.3) 


: 2.405 
ka, = 2.405 or at a radius q= . (18.33) 


ke 
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To define a cylindrical cavity two counter propagating waves are created by adding 
end caps at z = thd and with ik, = =P and (18.12) 


2 2,2 
2. @ mp 
= = a 


(18.34) 


Solving for the resonance frequency w of the lowest order or the TMoj9-mode, we 
get with (18.33), m = 0 and p = 0 


c 2.405 (18.35) 
010 = . 
010 Jen a, 
and the z-component of the electrical field is 

E, = Ezo10Jo (2.405 “) enw, (18.36) 

a 

The waveguide wavenumber 

R=RP-# (18.37) 


must be positive in order to obtain a travelling wave rather than a wave decaying 
exponentially along the waveguide (k; < 0). Solving for k, we get with w@, = ck, 


2 
=r (1 = oP (18.38) 
wW 


The cutoff frequency is determined by the diameter of the waveguide and limits 
the propagation of electromagnetic waves in circular waveguides to wavelengths 
which are less than the diameter of the pipe. To determine the phase velocity of the 
wave we set = wt — k-z = const and get from the derivative y = w — k,Z = 0 
the phase velocity 


(a) 


: 18.39 
a ( ) 


Uph = 2 = 


Inserting (18.38) into (18.39) we get again a phase velocity which exceeds the 
velocity of light and therefore any velocity a material particle can reach. We were 
able to modify plane electromagnetic waves in such a way as to produce the 
desired longitudinal electric field component but note that these fields are not yet 
suitable for particle acceleration because the phase rolls over the particles and the 
net acceleration is zero. To make such electromagnetic waves useful for particle 
acceleration further modifications of the waveguide are necessary to slow down the 
phase velocity. 

To complete our discussion we determine also the group velocity which 
is the velocity of electromagnetic energy transport along the waveguide. The 
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group velocity v, is defined by 


dw 

= : 18.40 
Ug dk. ( ) 

Differentiating (18.38) with respect to k, we get 

@ dw 

=> 5— 18.41 
c? dk, ( ) 
or with (18.39) the group velocity 
d ah; 
ia = ae eg, (18.42) 
© dk, w Uph 


SINCE Uph > C. 


TM-Mode Field Components in Cylindrical Waveguides 


Similar to the case of rectangular waveguides we can derive all field components. 
Since we know E, and B, we may use (18.2), expressed in cylindrical coordinates, 
to determine all other field components. Furthermore, we note from (18.32) that 
ie = —imand 5. d — _ik,. Maxwell’s equations (18.2) in cylindrical coordinates are 
in component fon 


iF E,= ik, Ey = iwB,, i? cB, — ik, Kh = iewCE,, 
ik, - a = +iWBy, Sk as = = —ieu* "Ey, (18.43) 
Ey oe “ +i7E, = —iwB,, *cBy te sel + + eR, = 1p Bas 


These equations can be used to define individual field components. For example, 
from the first equation (left) and the second equation (right) we may eliminate B, 
and solve for Ey in terms of E, and B,.We are mostly interested in TM-mode fields 
where B.. Conversely, for TE-modes E, = 0. The field components applicable to 
both modes are given by (18.44). For the TM-modes By = 0 and for TE-modes 
Eo = 0. 


jib 
i= ve mick + io i) 
E, = = Edin (ker) eilor mop— kez) 
cBr = 7 (—ik.cH + en 24.) , 
cBy =—-b (kK. "cB, + iew 2 ) ; 


c or 


cB, = BoJm(ker) etme), 


(18.44) 
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18.2 Rf-Cavities 


18.2.1 Square Cavities 


The waveguide modes are not yet ready to be used for particle acceleration because 
of excessive phase velocities. This problem can be solved by considering two waves 
travelling in opposite directions on the same axis of the waveguide. Both fields have 
the form of (18.18) and the superposition of both waves gives 


E, = 2Eo cos Pag cose iat, (18.45) 
a b d 
where d is defined by 
a= - (18.46) 


and p is an integer. 

The superposition of two equal but opposite waves form a standing wave with 
nodes half a waveguide length apart. Closing off the waveguide at such nodes points 
with a metallic surface fulfills automatically all boundary conditions. The resulting 
rectangular box forms a resonant cavity enclosing a standing electro-magnetic wave 
which can be used for particle acceleration.In analogy to the waveguide mode 
terminology we extend the nomenclature to cavities by adding a third index for 
the eigenvalue p. The lowest cavity mode is the TMj19-mode. The indices m and n 
cannot be zero because of the boundary conditions for E,. For p = O we find E, 
to be constant along the axis of the cavity varying only with x and y.The boundary 
conditions are met automatically at the end caps since with p = 0 also k = 0 and the 
transverse field components vanish everywhere. The electrical field configuration 
for the TM;;9-mode consists therefore of a finite E,-component being constant 
only along z and falling off transversely from a maximum value to zero at the 
walls. In practical applications rectangular boxes are rarely used as accelerating 
cavities. There are, however, special applications like beam position monitors where 
rectangular cavities are preferred. 


18.2.2 Cylindrical Cavity 


Similarly, we may form a cylindrical cavity by two counter propagating waves. By 
adding endcaps at z = + 5 d standing waves are established and with k, = Pe we 
get from (18.37) 


2 Land 


w pu 
= So alae (18.47) 
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Solving for the resonance frequency w of the lowest order or the TMoj9-mode, we 
get with (18.33), m = 0 and p = 0 


Co=——— (18.48) 


and the z-component of the electrical field is 
E, = 2E- o1030 (2.405) cos (@o10f) . (18.49) 
a 


The resonance frequency is inversely proportional to the radius of the cavity and to 
keep the size of accelerating cavities manageable, short wave radio frequencies are 
chosen. For electron linear accelerators a wavelength of A = 10cm is often used 
corresponding to a frequency of 2997.93 MHz and a cavity radius of aj = 3.83cm. 
For storage rings a common frequency is 499.65 MHz or A = 60cm and the 
radius of the resonance cavity is a, = 22.97cm. The size of the cavities is in both 
cases quite reasonable. For much lower rf-frequencies the size of a resonant cavity 
becomes large. Where such low frequencies are desired the diameter of a cavity 
can be reduced at the expense of efficiency by loading it with magnetic material 
like ferrite with a permeability 44 > 1 as indicated by (18.48). This technique also 
allows the change of the resonant frequency during acceleration to synchronize 
with low energy protons, for example, which have not yet reached relativistic 
energies. To keep the rf-frequency synchronized with the revolution frequency, the 
permeability of the magnetic material in the cavity can be changed by an external 
electrical current. The drawback of using materials like ferrites is that they are lossy 
in electromagnetic fields, get hot and produce significant outgassing in vacuum 
environments. 

The nomenclature for different modes is similar to that for rectangular waveg- 
uides and cavities. The eigenvalues are equal to the number of field maxima in 9, r 
and z and are indicated as indices in this order. The TMo9-mode, therefore exhibits 
only a radial variation of field strength independent of g and z. Again, we distinguish 
TM-modes and TE-modes but continue to consider only TM-modes for particle 
acceleration. Electrical fields in such a cavity have all the necessary properties for 
particle acceleration. Small openings along the z-axis allow the beam to pass through 
the cavity and gain energy from the accelerating field. Cylindrical cavities can be 
excited in many different modes with different frequencies. For particle acceleration 
the dimensions of the cavity are chosen such that at least one resonant frequency 
satisfies the synchronicity condition of the circular accelerator. In general this is the 
frequency of the TMoi9-mode which is also called the fundamental cavity mode or 
frequency. 

From the expressions (18.44) we find that the lowest order TM-mode does not 
include transverse electrical field components since k, = 0 and m = 0. The only 
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transverse field is the azimuthal magnetic field which is with (18.49) 


r ‘ 
Bo = —iE, oo); ( 2.405— ) eo" | (18.50) 
JE * ( *) 


18.2.3 Energy Gain 


The kinetic energy gained in such a cavity can be obtained by integrating the time 
dependent field along the particle path. The cavity center be located at z = 0 anda 
particle entering the cavity at time @o,9f = —2/2 or at z = —d/2 may encounter 
the phase 6 of the microwave field. The electric field along the z-axis as seen by the 
particle travelling with velocity v has the form FE, = E,9 sin (we + 5) and we get 
for the kinetic energy gain of a particle passing through the cavity with velocity v 


d 


Nie 


AE kin = cE | 


cos (o= +38) dz. (18.51) 


1 
xd 


In general, the change in the particle velocity is small during passage of one 
rf-cavity and the integral is a maximum for 6 = 2/2 when the field reaches a 
maximum at the moment the particle is half way through the cavity. Defining an 
accelerating cavity voltage 


Vit = E,od = Eoiod (18.52) 


the kinetic energy gain is after integration 


sin 5, 
AExin = eVit = eVyy, (18.53) 


T-— . (18.54) 


The transit-time factor provides the correction on the particle acceleration due to 
the time variation of the field while the particles traverse the cavity. In a resonant pill 
box cavity (Fig. 18.4a) we have d = 4/2 and the transit-time factor for a particle 
traveling approximately at the speed of light is 


2 
Thithox = — < 1. (18.55) 
a 
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Fig. 18.4 Resonant cavities with drift tubes (schematic). (a) Pill box cavity. (b) Cavity with nose 
cones. (c) Cavity with drift tubes 


As the cavity length or the active accelerating gap in the cavity is reduced, the 
transient time factor can be increased. The simple pill box cavity may be modified 
by adding nose cones (Fig. 18.4b) or by adding drift tubes at the entrance and exit 
of the cavity as shown in Fig. 18.4c. In this case the parameter d in (18.54) is the 
active accelerating gap. 

For small velocities (v « c) the transit time factor and thereby the energy gain 
is small or maybe even negative. Maximum energy gain is obtained for particles 
travelling at or close to the speed of light. Externally driven accelerating cavity 


18.2.4 Rf-Cavity as an Oscillator 


Accelerator cavities can be described as damped oscillators with external excitation. 
Damping occurs due to energy losses in the walls of the cavity and transfer of energy 
to the particle beam while an external rf-power source is connected to the cavity 
to sustain the rf-fields. Many features of an accelerating cavity can be expressed 
in well-known terms of a damped, externally excited harmonic oscillator which is 
described in the form 


% + 2ak + wpx = De, (18.56) 


where a is the damping decrement, wp the unperturbed oscillator frequency and 
D the amplitude of the external driving force with frequency w. The equilibrium 
solution can be expressed in the form x = Ae’, where the complex amplitude A is 
determined after insertion of this ansatz into (18.56) 

D iw 


A= =ae (18.57) 


a, — w? + i2aw 
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Fig. 18.5 Resonance curve 
for a damped oscillator 


356 357 358 359 360 
frequency (MHz) 


The angle W is the phase shift between the external excitation and the oscillator 
and the amplitude a =Re(A) is from (18.57) 


D 
a= (18.58) 


V (2 — 2)’ + 402w? 


Plotting the oscillation amplitude a as a function of the excitation frequency w, 
we get the resonance curve for the oscillator as shown in Fig. 18.5. The resonance 
frequency at which the oscillator reaches the maximum amplitude depends on the 


damping and is 
@; = 1/4 — 2a?. (18.59) 


For an undamped oscillator the resonance amplitude becomes infinite but is finite 
whenever there is damping. The oscillator can be excited within a finite distance 
from the resonance frequency and the width of the resonance curve at half maximum 
amplitude is 


w~t2/3a for a<a,. (18.60) 


If there were no external excitation to sustain the oscillation, the amplitude would 
decay like a xe~*’. The energy of the oscillator scales like W « A” and the energy 
loss per unit time P = —dW/dt = 2aW , which can be used to determine the quality 
factor of this oscillator as defined in (18.80) 


Ow, 


Q= (18.61) 


=a 
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The quality factor is reduced as damping increases. For the case of an accel- 
erating cavity, we expect therefore a higher Q-value called the unloaded Qy when 
there is no beam, and a reduced quality factor called loaded Q when there is a beam 
extracting energy from the cavity. The time constant for the decay of oscillation 
amplitudes or the cavity damping time is 


1 20 
ga = 
a On 


(18.62) 


and the field amplitude decays to 1/e during Q/z oscillations. 
Coming back to the equation of motion (18.56) for this oscillator, we have the 
solution 


x) = ale) (18.63) 
noting that the oscillator assumes the same frequency as the external excitation but 


is out of synchronism by the phase YW. The magnitude and sign of this phase shift 
depends on the excitation frequency and can be derived from (18.57) in the form 


2 24; D ww 2 es 
oO, —@ a; = (cog sin) 3 


Both the real and imaginary parts must separately be equal and we get for the 
phase shift between excitation and oscillator 


d= 59 2 
wre"  “'§ ~o99" —™. (18.64) 
2aw Or 


where we have made use of (18.61) and the approximation w ~ @,. For excitation at 
the resonance frequency we find the oscillator to lag behind the driving force by 50 
and is almost in phase or totally out of phase for very low or very high frequencies, 
respectively. In rf-jargon this phase shift is called the tuning angle. 


18.2.5 Cavity Losses and Shunt Impedance 


Radio frequency fields can be enclosed within conducting surfaces only because 
electrical surface currents are induced by these fields which provide the shielding 
effect. For a perfect conductor with infinite surface conductivity these currents 
would be lossless and the excitation of such a cavity would persist indefinitely. This 
situation is achieved to a considerable degree, albeit not perfect, in superconducting 
cavities. In warm cavities constructed of copper or aluminum the finite resistance 
of the material causes surface currents to produce heating losses leading to a 
depletion of field energy. To sustain a steady field in the cavity, radio frequency 
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power must be supplied continuously. The surface currents in the conducting cavity 
boundaries can be derived from Maxwell’s curl equation or Ampere’s law (18.2). In 
cylindrical coordinates this vector equation becomes for the lowest order TM-mode 
in component form 


OBg : 
pe = Lol » 
va 
0= jg , (18.65) 
orB, B 0B € 
? ud + i = obj, + i ok, 
ror r or Cc 


Because we do not consider perfectly but only well conducting boundaries, 
we expect fields and surface currents to penetrate somewhat into the conducting 
material. The depth of penetration of fields and surface currents into the conductor 
is well-known as the skin depth [3] 


2 
bs = _—-_ ; (18.66) 
Lolw® Ow 


where o,, is the conductivity of the cavity wall and j2,, the permeability of the wall 
material. The azimuthal magnetic field component induces surface currents in the 
cylindrical walls as well as in the end caps. In both cases the magnetic field decays 
within a skin depth from the surface inside the conductor. The first Eq. (18.65) 
applies to the end caps and the integral through the skin depth is 


/ oO dz x By(r)|S*® x —By(r,S), (18.67) 
Ss 


since By(r, S+46,) ~ O just under the surface S of the wall. We integrate also the third 
Eq. (18.65) at the cylindrical walls and get for the first term [ B,/rdr ~ B,6,/a,, 
which is negligible small, while the second term has a form similar to (18.67). The 
electrical term E, vanishes because of the boundary condition and the surface current 
densities for the cylindrical wall and end caps, respectively, are therefore related to 
the magnetic fields by 


Hobbs = Bo(a, 2), 
: 18.68 
LoLjrds = Bo(r, + +d) ; ( ) 


The cavity losses per unit wall surface area are given by 


, (18.69) 
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where j, is the surface current density and 7, is the surface resistance given by 


= jo (18.70) 
2 Ow 


With js = j;z 5s, (18.50), (18.66) and the integration of (18.69) is performed over 
all inside surfaces of the cavity to give 


2 2 r 
Py = Feywd, €— F Bi | F (2.405) dS, (18.71) 
1 


where € and sz is the dielectric constant and permeability of the material inside 
the cavity, respectively and j1,, the wall permeability. Evaluating the integral over 
all surfaces, we get for the cylindrical wall the integral value 27a\dJ}(2.405). For 
each of the two end caps the integral 27r f % Ji(2.405 a r dr must be evaluated and 
is from integration tables [6] 


ay 
2n / J? (2.4057) rdr = majJj (2.405). (18.72) 
0 a 


The total cavity wall losses become finally with Vi¢ = Eoiod from (18.52) 


d 
Poy =4rennbye V3 (2. 492 (18.73) 
be 


It is convenient to separate fixed cavity parameters from adjustable parameters. 
Once the cavity is constructed, the only adjustable parameter is the strength of the 
electrical field Eoi9 or the effective cavity voltage V.y. Expressing the cavity losses 
in terms of an impedance, we get from (18.73) and (18.53) 


a (18.74) 


where the cavity shunt impedance including transient time factor is defined by! 


2 
i) 4 a 1 sin 24 
tet p ( 2v ) (18.75) 


TE) WSsE [lw ai(a1 + d) J7(2.405) \ 94 


2v 


The factor of 2 in (18.74) results from the fact that on average the rf-voltage 


is (v2, = Ve sin? wt) = ee In accelerator design, we prefer sometimes to use 


'The shunt impedance is defined in the literature sometimes by Pey = V2, /Rs in which case the 
numerical value of the shunt impedance is larger by a factor of two. 
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the shunt impedance per unit length or the specific shunt impedance. The required 
length depends on the accelerating voltage needed and the rf-power available. With 
the cavity shunt impedance per unit length 


R, 
= 18.76 
oar ( ) 
the cavity losses are instead of (18.74) 
V2 
fo = (18.77) 
2rsLey 


where L,, is the total length of all cavities producing the voltage Ve Since the 
cavity shunt impedance scales like R, « 1/./@ and the length for a resonant cavity 
like d « 1/q, the specific shunt impedance is proportional to the square root of the 
rf-frequency r, « ./@ favoring high frequencies. A practical limit is reached when 
the cavity apertures become too small for the particle beam to pass through or when 
the size of the cavities prevents an efficient cooling of wall losses. 

As an example, we calculate from (18.75) the shunt impedance for a pill box 
cavity designed for a resonance frequency of 358 MHz. The wavelength is A = 
85cm, the cavity length d = 42.5cm and the cavity radius aj = 32.535cm. This 
cavity was constructed with nose cones for the storage ring PEP [7] from aluminum. 
With a skin depth of 6, = 4.44 «1m the specific shunt impedance becomes r, = 15.2 
MQ2/m while the measured value for this cavity is 18.0 MQ/m. 

The difference is due to two competing effects. The open aperture along the axis 
for the beam has the tendency to reduce the shunt impedance while the nose cones 
being a part of the actual cavity increase the transient time factor and thereby the 
effective shunt impedance (18.75). The simple example of a pill box cavity produces 
rather accurate results, however, for more precise estimates computer programs 
have been developed to calculate the mode frequencies and shunt impedances for 
all modes in arbitrary rotational symmetric cavities (for example, SUPERFISH [8] 
or URMEL [9]). More sophisticated three-dimensional programs are available (for 
example, MAFIA [9]) to simulate rf-properties of arbitrary forms of cavities. 

The specific shunt impedance for a pill box cavity can be expressed in a simple 
form as a function of the rf-frequency only and is for realistic cavities approximately 


rs(M&82/m) ~ 1.28 / fre (MHz) for copper and 
. (18.78) 
rs(M&2/m) ~ 1.06./ fre (MHz) for aluminum. 


The shunt impedance should be maximum in order to minimize cavity losses for 
a given acceleration. Since the interior of the cavity must be evacuated up = € = | 
and {ty = | because we do not consider magnetic materials to construct a cavity. 
The only adjustable design parameters left are the skin depth and the transient time 


18.3 Rf-Parameters 623 


factor. The skin depth can be minimized by using well conducting materials like 
copper or aluminum. 

To derive the quality factor of the cavity the energy W stored in the electro- 
magnetic field within the cavity must be calculated. The field energy is the volume 
integral of the square of the electrical or magnetic field and we have in case of a 
TMo10-mode with W = 5&0 € 1 E? dV and (18.49) for the stored cavity energy 


W = 4e0€EpidazJ{ (2.405) . (18.79) 


The quality factor Q of a resonator is defined as the ratio of the stored energy to the 
energy loss per radian 


tored W 
O54 Storedenergy (18.80) 
energy loss/cycle Poy 


or with (18.73), (18.79) 


_ d fw a\ 


Q= i bad 


(18.81) 


The quality factor determines the cavity time constant since the fields decay 
exponentially like e~/"» due to wall losses, where Tey is the cavity time constant 
and the decay rate of the stored energy in the cavity is 


——wW. (18.82) 


The change in the stored energy is equal to the cavity losses P., and the cavity time 
constant is with (18.80) 


2w 2 
ty = = 20 (18.83) 
Poy wo 


which is equal to (18.62) and also called the cavity filling time because it describes 
the build up time of fields in a cavity following a sudden application of rf-power. 


18.3. Rf-Parameters 


A variety of rf-parameters has to be chosen for a circular accelerator. Some 
parameters relate directly to beam stability criteria and are therefore easy to 
determine. Other parameters have less of an impact on beam stability and are 
often determined by nonphysical criteria like availability and economics. Before 
rf-parameters can be determined a few accelerator and lattice parameters must be 
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known. Specifically, we need to know the desired minimum and maximum beam 
energy, the beam current, the circumference of the ring, the momentum compaction 
factor, and the bending radius of the magnets. Further, we make a choice of the 
maximum desired rate of particle acceleration per turn or determine the energy 
loss per turn to synchrotron radiation which needs to be compensated. During the 
following discussion we assume that these parameters are known. 

One of the most prominent parameters for rf-accelerating systems is the rf- 
frequency of the electromagnetic fields. For highly relativistic beams there is no 
fundamental reason for a particular choice of the rf-frequency and it can therefore 
be selected on technical and economic grounds. The rf-frequency must, however, 
be an integer multiple, the harmonic number, of the particle revolution frequency. 
The harmonic number can be any integer from a beam stability point of view. In 
specific cases, the harmonic number need to be a multiple of a smaller number. 
Considering, for example, a colliding beam facility with Np collision points an 
optimum harmonic number is divisible by Mp/2. In this case Njp/2 bunches 
could be filled in each of the two counter rotating beams leading to a maximum 
collision rate. Other such considerations may require the harmonic number to 
contain additional factors. In general, most flexibility is obtained if the harmonic 
number is divisible by small prime numbers. 

Within these considerations the harmonic number can be chosen from a large 
range of rf-frequencies without generally affecting beam stability. Given complete 
freedom of choice, however, a low frequency is preferable to a high frequency. 
For low rf-frequencies the bunch length is longer and electromagnetic interaction 
with the beam environment is reduced since high frequency modes are not excited 
significantly. A longer bunch length also reduces the particle density in the bunch 
and thereby potentially troublesome intra-beam scattering [10, 11]. In proton and 
heavy ion beams a longer bunch length leads to a reduced space charge tune 
shift and therefore allows to accelerate a higher beam intensity. For these reasons 
lower frequency systems are used mostly in low energy circular accelerators. The 
downside of low rf-frequencies is the fact that the accelerating cavities become very 
large or less efficient and rf-sources are limited in power capability. 

The size of circular accelerators imposes a lower limit on the rf-frequency 
since the synchronicity condition requires that the rf-frequency be at least equal 
to the revolution frequency in which case the harmonic number is equal to unity. 
A higher harmonic number to accommodate more than a single particle bunch 
further increases the required rf-frequency. Most electron and very high energy 
proton accelerators operate at rf-frequencies of a few hundred MHz, while lower 
frequencies are preferred for ion or medium energy proton accelerators. 

For some applications it is critical to obtain short particle bunches which is 
much easier to achieve with a high rf-frequency. The appropriate choice of the rf- 
frequency therefore dependents much on the desired parameters for the particular 
application and is mostly chosen as a compromise between competing requirements 
including economic considerations like cost and availability. 
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18.3.1 Synchronous Phase and Rf-voltage 


The most common use of an rf-system is for acceleration while particles pass 
through a resonant cavity at the moment when the voltage reaches the crest of the 
rf-wave and particles gain a kinetic energy equivalent to the full cavity voltage. 
This is the general accelerating mode in linear accelerators. In circular accelerators, 
however, the principle of phase focusing requires that particles be accelerated off 
the crest at a synchronous phase W;, where the effective accelerating voltage is 
Y= ee sin y;. The peak rf-voltage Ve and the synchronous phase are determined 
by the desired energy acceptance and acceleration per turn. 

The energy acceptance of a circular accelerator has been derived in Chap. 9, 
is proportional to the square root of the cavity voltage and must be adjusted for 
the larger of several energy acceptance requirements. To successfully inject a beam 
into a circular accelerator the voltage must be sufficiently large to accept the finite 
energy spread in the injected beam. In addition, any phase spread or timing error of 
the incoming beam translates into energy errors due to synchrotron oscillations. For 
acceleration of a high intensity beam an additional allowance to the rf-voltage must 
be made to compensate beam loading, which will be discussed later in more detail. 

After injection into a circular accelerator an electron beam may change con- 
siderably its energy spread due to quantum excitation as a result of emitting 
synchrotron radiation. This energy spread has a Gaussian distribution and to assure 
long beam lifetime the energy acceptance must be large enough to contain at 
least seven standard deviations. In proton and heavy ion accelerators some phase 
space manipulation may be required during the injection process which contributes 
another lower limit for the required rf-voltage. In general, there are a number of 
requirements that determine the ultimate energy acceptance of an accelerator and 
the most stringent requirement may very well be different for different accelerator 
designs and applications. Generally, circular accelerators are designed for an energy 
acceptance of a few percent. 


18.4 Linear Accelerator 


The phase velocity vp, must be equal to the particle velocity vp for efficient 
acceleration and we need therefore to modify or “load” the wave guide structure to 
reduce the phase velocity to become equal to the particle velocity. This can be done 
by inserting metallic structures into the aperture of the circular wave guide. Many 
different ways are possible, but we will consider only the disk loaded waveguide 
which is the most common accelerating structure for electron linear accelerators. 

In a disk loaded waveguide metallic plates are inserted normal to the waveguide 
axis at periodic intervals with iris apertures to allow for the passage of the particle 
beam as shown in Fig. 18.6. 
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beam path 


Fig. 18.6 Disk loaded accelerating structure for an electron linear accelerator (schematic) 


The boundary conditions and therefore the electromagnetic fields in such a 
structure are significantly more complicated than those in a simple circular tube. 
It would exceed the goal of this text to derive the theory of disk loaded waveguides 
and the interested reader is referred to the review article by Slater [12]. 

Insertion of disks in periodic intervals into a uniform waveguide essentially 
creates a sequence of cavities with electromagnetic coupling through either the 
central hole, holes at some radius on the disks or external coupling cavities. The 
whole arrangement of cells acts like a band pass filter allowing electromagnetic 
fields of certain frequencies to propagate. By proper choice of the geometric 
dimensions the pass band can be adjusted to the desired frequency and the phase 
velocity can be designed to be equal to the velocity of the particles. For electron 
linear accelerators the phase velocity is commonly adjusted to the velocity of light 
since electrons quickly reach such a velocity. 


18.4.1 Basic Waveguide Parameters 


Without going into structure design and detailed determination of geometric 
parameters we can derive parameters relating to the acceleration capability of such 
structures. Conservation of energy requires that 


OW OP 
— + — 4+ Py +nevE, = 0, (18.84) 
i dz 


where W is the stored energy per unit length, P the energy flux along z, P,, wall 
losses per unit length and nev E, the energy transferred to n particles with charge e 
each moving with the velocity v in the electric field E,. The wall losses are related 
to the quality factor Q of the structure defined by 


wow 
= —_, 18. 
Q P. (18.85) 
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where Py /q@ are wall losses per unit length and per radian of field oscillation. The 
energy flux P is with the group velocity v, 


P= vU,W. (18.86) 
In case of equilibrium, the stored energy in the accelerating structure does not 
change with time, OW/dt = 0, and 


oP : wP. 
— = —Py — iE, = ——~ —iEz, (18.87) 
Oz UgQ 


where i, = nev is the beam current. Considering the case of small beam loading 
ip E, < wP/(vgQ) we may integrate (18.87) to get 


P = Poexp (-=)) = Pye ™, (18.88) 
g 


where we have defined the attenuation coefficient 


(a) 


040° 


2a = (18.89) 


Equation (18.88) shows an exponential decay of the energy flux along the 
accelerating structure with the attenuation coefficient 2~. The wall losses are often 
expressed in terms of the total voltage or the electrical field defined by 


y FP 
Py = —-=—, 18.90 
ee ( ) 
where Z, = r,L is the shunt impedance for the whole section, E the maximum 


value of the accelerating field, E, = Ecos Ws, Ws the synchronous phase at which 
the particle interacts with the wave, r, the shunt impedance per unit length, and L 
the length of the cavity. From (18.90) we get with (18.87) and (18.89) for negligible 
beam current the accelerating field 

A @ Ts 


P= road = 2ar,P. (18.91) 
g 


The total accelerating voltage along a structure of length L is 


Es L 
Vo = / E.dz = Ecos, / e % dz (18.92) 
0 0 
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or after integration 


Vp = —~—Ecos yy. (18.93) 
a 
Defining an attenuation factor t by 
t=aL (18.94) 


we get with (18.91) for the total accelerating voltage per section of length L 


Se 


i= 
Vo Slew oe 


C 


COs Ws. (18.95) 


The maximum energy is obtained if the particles are accelerated at the crest of 
the wave, where wy, = 0. 

Tacitly it has been assumed that the shunt impedance r, is constant resulting 
in a variation of the electrical field strength along the accelerating section. Such a 
structure is called a constant impedance structure and is characterized physically by 
equal geometric dimensions for all cells. 

In a constant impedance structure the electric field is maximum at the beginning 
of the section and drops off toward the end of the section. A more efficient use 
of accelerating sections would keep the electric field at the maximum possible 
value just below field break down throughout the whole section. A structure with 
such characteristics is called a constant gradient structure because the field is now 
constant along the structure. 

As an example for an electron linear accelerator, the SLAC constant gradient 
linac structure has the following parameters [13] 


fit = 2856 MHz L= 10ft = 3.048m 
rs =53M2/m = aj = 0.040 m (18.96) 
OQ = 12000 Tt = 0.57 


A constant gradient structure can be realized by varying the iris holes in the disks 
to smaller and smaller apertures along the section. This kind of structure is actually 
used in the SLAC accelerator as well as in most modern linear electron accelerators. 
The field E = const and therefore from (18.88) with (18.94) 


dP P(L)—Po 97, Po 
oes oe ==(( 6") 18.97 
Oz L ( a L ( ) 
On the other hand, we have from (18.87) 
oP P. 
= ® ~ const (18.98) 


az Ovg 
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and to make dP/dz constant the group velocity must vary linearly with the local 
rf-power like 


aP 
vg ~ P(z) = Po+ ae (18.99) 


Furthermore, since dP/dz < 0 the group velocity is made to decrease along the 
section by reducing gradually the iris radii. From (18.98) 


w P(z) 
e(Z) = -— 18.100 
"2) =~ 6 ap/az mee 
or with (18.97) 
oP 2 
wPot+ 52 oL—(1—e7)z 
= a 18.101 
Ys) = — 9 pp ac OQ 1—e ( ) 
and the filling time is after integration of (18.101) 
L 
d 
gf Baoee. (18.102) 
0 Ug @ 


The electric field in the accelerating section is from (18.90) with (18.87) 


: aP 
B= yr | (18.103) 


and the total accelerating voltage Vo or gain in kinetic energy per section is 


L 
AEkin = eVo = ef E,dz = eV rsLPoV 1 — e~** cos Wy , (18.104) 
0 


where w, is the synchronous phase at which the particles travel with the electromag- 
netic wave. The energy gain scales with the square root of the accelerating section 
length and rf-power delivered. 

As a numerical example, we find for the SLAC structure from (18.104) the gain 
of kinetic energy per 10 ft section as 


AEkin (MeV) = 10.48. Po (MW) , (18.105) 


where Po is the rf-power delivered to the section. The energy gain (18.105) is 
the maximum value possible ignoring beam loading or energy extraction from the 
fields by the beam. The total accelerating voltage is reduced when we include beam 
loading due to a pulse current i,. Referring the interested reader to reference [13] 
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we only quote the result for the energy gain in a linear accelerator with constant 
gradient sections including beam loading 


2te 7" 
Vi = VrsLPoV1 — 0 — Sigrsh (1 - =) (18.106) 


For the SLAC linac structure this equation becomes with t = 0.57 


Exin (MeV) = 10.48. Po (MW) — 37.47ip (A). (18.107) 


The beam loading depends greatly on the choice of the attenuation factor t 
as is shown in Figs. 18.7 and 18.8 where the coefficients f, = V1 -—e?* and 


=2t ‘ . : 
fi=i (1 - a) are plotted as functions of rt. Both coefficients increase as the 


I 

attenuation factor is increased and reach asymptotic limits. The ratio f,/f,, however, 
decreases from infinity to a factor two which means that beam loading occurs much 
stronger for large values of the attenuation factor compared to low values. During the 


attenuation factor: T 


Fig. 18.7 Energy coefficient f, as a function of t 
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Fig. 18.8 Beam loading coefficient f, as a function of t 
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Fig. 18.9 Optimum beam current as a function of t 
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Fig. 18.10 Linac efficiency as a function of beam current 


design of the linac structure it is therefore useful to know the intended use requiring 
different optimization for high-energy or high-current acceleration. 

We may also ask for the efficiency of transferring rf-power into beam power 
which is defined by 


ip Vi uo emer 2 2re7* 
pe wh =a sins (1 = ) (18.108) 
0 


Po 0 7 1—e72" 
The linac efficiency has clearly a maximum and the optimum beam current is 


ee (1 _ ey 
rs5L 1 — (1 + 2t)e72* © 


(18.109) 


ip,opt = 


The optimum beam current is plotted in Fig. 18.9 as a function of the attenuation 
coefficient t and the linac efficiency is shown in Fig. 18.10 as a function of beam 
current in units of the optimum current with the attenuation factor as a parameter. 
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The optimum beam current increases as the attenuation factor is reduced while 
the linac efficiency reaches a maximum for the optimum beam current. 


18.4.2 Particle Capture in a Linear Accelerator Field* 


The capture of particles and the resulting particle energy at the end of the acceler- 
ating section depends greatly on the relative synchronism of the particle and wave 
motion. If particles with velocity vp are injected at low energy (vp <K c) into an 
accelerator section designed for a phase velocity Up, > vp the electromagnetic 
wave would roll over the particles with reduced acceleration. The particle velocity 
and phase velocity must be equal or at least close to each other. Because small 
mismatches are quite common, we will discuss particle dynamics under those 
circumstances and note that there is no fundamental difference between electron 
and proton linear accelerators. The following discussion is therefore applicable to 
any particle type being accelerated by traveling electromagnetic fields in a linear 
accelerator. 

We observe the relative motion of both the particle and the wave from the 
laboratory system. During the time Ar particles move a distance Az, = v, At and 
the wave a distance Az), = Upp At. The difference in the distance traveled can be 
expressed in terms of a phase shift 


Az 
Ay = —k(Azph — Azp) = —k(Uph — Up) (18.110) 
Pp 
The wave number k is 


2 
pe (18.111) 
Uph AxtUph 


and inserted into (18.110) the relative phase shift over a distance Az, becomes 


Ay = = Ag, (18.112) 


To complete the equation of motion we consider the energy gain of the particles 
along the same distance Az, which is 


AEkin = —eE-(y)Azp. (18.113) 


Equations (18.112) and (18.113) form the equations of motion for particles 
in phase space. Both equations are written as difference equations for numerical 
integration since no analytic solution exists. For the most trivial case Uph = Up 
and w = const allowing easy integration of (18.113). This trivial case becomes the 
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Fig. 18.11 Capture of electrons in a3 m linac section for initial phase Wo and initial kinetic energy 
Exin,o- Contour lines are lines of constant particle energy in MeV at the end of the section. The phase 
Wo = 0 corresponds to the crest of the accelerating wave 


overwhelming common case for electrons which reach a velocity very close to the 
speed of light. Consistent with this, most accelerating sections are dimensioned for 
a phase velocity equal to the speed of light. 

As an illustrative example, we integrate (18.112) and (18.113) numerically to 
determine the beam parameters at the end of a single 3 m long accelerating section 
(Uph = c) for an initial particle distribution in phase and momentum at the entrance 
to the accelerating section. This situation is demonstrated in Fig. 18.11 for a constant 
field gradient of E = 12.0 MeV/m. The momentum and phase at the end of the 
accelerating section are shown as functions of the initial momentum and phase. We 
note from Fig. 18.11 that particles can be captured in the accelerating field only 
in the vicinity of W ~ 0 to +90° at almost any initial phase and momentum. At 
phases from yo ~*~ —45 to —160° slow particles at sub-relativistic energies loose 
whatever little energy they had to move randomly in the rf-wave rolling over them. 
On the other hand, particles which enter the accelerating section ahead of the crest 
(Wo 2 0°) gain maximum momentum while the wave’s crest moves over them. 

Such diagrams calculated for particular parameters under consideration provide 
valuable information needed to prepare the beam for optimum acceleration. The 
most forgiving operating parameters are, where the contour lines are far apart. In 
those areas a spread in initial phase or energy has little effect on the final phase or 
energy. If a beam with a small energy spread at the end of acceleration is desired, 
the initial phase should be chosen to be at small positive values or just ahead of the 
wave crest as shown in Fig. 18.11. Even for a long bunch the final energy spread is 
small while reaching the highest total energy. 

On the other hand, if a short bunch length at the end of acceleration is of biggest 
importance, an initial phase of around wo ~ 100° seems to be more appropriate. In 
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this case, however, the final energy is lower than the maximum possible energy and 
the energy spread is large. 

Once the particular particle distribution delivered to the linear accelerator and the 
desired beam quality at the end is known one can use Fig. 18.11 for optimization. 
Conversely such diagrams can be used to judge the feasibility of a particular design 
to reach the desired beam characteristics. 


18.5 Preinjector and Beam Preparation* 


Although the proper choice of the initial rf-phase with respect to the particle beam 
greatly determines the final beam quality, the flexibility of such adjustments is 
limited. Special attention must be given to the preparation of the beam before 
acceleration. In most cases, particles are generated in a continuous stream or from a 
microwave source of different frequency. Depending on the particle source, special 
devices are used for initial acceleration and bunching of the beam. We will discuss 
basic principles of beam preparation. 


18.5.1 Prebuncher 


Many particle sources, be it for electrons, protons or ions, produce a continuous 
stream of particles at modest energies limited by electrostatic acceleration between 
two electrodes. Not all particles of such a beam will be accelerated because of the 
oscillatory nature of the accelerating field. For this reason and also in case short 
bunches or a small energy spread at the end of the linac is desired, the particles 
should be concentrated at a particular phase. This concentration of particles in 
the vicinity of an optimum phase maximizes the particle intensity in the bunch in 
contrast to a mechanical chopping of a continuous beam. To bunch particles requires 
specific beam manipulation which we will discuss here in more detail. 

A bunched beam can be obtained from a continuous stream of nonrelativistic 
particles by the use of a prebuncher. The basic components of a prebuncher is 
an rf-cavity followed by a drift space. As a continuous stream of particles passes 
through the prebuncher, some particles get accelerated and some are decelerated. 
The manipulation of the continuous beam into a bunched beam is best illustrated in 
the phase space diagrams of Fig. 18.12. 

Figure 18.12a shows the continuous particle distribution in energy and phase at 
the entrance of the prebuncher. Depending on the phase of the electric field in the 
prebuncher at the time of passage, a particle becomes accelerated or decelerated 
and the particle distribution at the exit of the prebuncher is shown in Fig. 18.12b. 
The particle distribution has been distorted into a sinusoidal energy variation. 
Since the particles are nonrelativistic the energy modulation reflects also a velocity 
modulation. We concentrate on the origin of the distribution at ¢ = O and 
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Fig. 18.12 Phase space diagrams for a continuous beam passing through a prebuncher. Before 
acceleration (a) and right after (b). A distance L downstream of the buncher cavity the phase space 
distribution shows strong bunching (c). [Note: the beam moves from left to right] 


AExin = 0 as the reference phase and note that particles ahead of this reference 
phase have been decelerated and particles behind the reference phase have been 
accelerated. Following this modulated beam through the drift space we observe due 
to the velocity modulation a bunching of the particle distribution which reaches a 
maximum at some distance as shown in Fig. 18.12c. A significant beam intensity 
has been concentrated close to the reference phase of the prebuncher. 

The frequency used in the prebuncher depends on the desired bunch distribution. 
For straight acceleration in a linear accelerator one would choose the same 
frequency for both systems. Often, however, the linear accelerator is only an injector 
into a bigger circular accelerator which generally operates at a lower frequency. 
For optimum matching to the final circular accelerator the appropriate prebuncher 
frequency would be the same as the cavity frequency in the circular accelerator 
cavity. 
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The effect of the prebuncher can be formulated analytically in the vicinity of the 
reference phase. At the exit of the prebuncher, operating at a voltage V = Vo sing, 
the energy spread is 


AEkin = eVo sing = mc” By? AB , (18.114) 


which is related to a velocity spread Af. Perfect bunching occurs a time Aft later 
when for sing ~ @ 


CApAr= Le, (18.115) 
20 


where A,- is the rf-wavelength in the prebuncher cavity. Solving for At we get for 
nonrelativistic particles with y = 1 and B < 1 


oem (18.116) 
2m eVo 
and optimum bunching occurs a distance L downstream from the cavity 
(=e (18.117) 
—=si bs = ¥ . 
. kweVo 


where vo is the velocity of the reference particle and kp = 27/A+-. The minimum 
bunch length in this case is then 


_ SExin 


6L= ; 
kreVo 


(18.118) 


where d£in is the total energy spread in the beam before the prebuncher. 

In this derivation, we have greatly idealized the field variation being linear instead 
of sinusoidal. The real bunching is therefore less efficient than the above result and 
shows some wings as is obvious from Fig. 18.12c. In a compromise between beam 
intensity and bunch length one might let the bunching go somewhat beyond the 
optimum and thereby pull in more of the particle intensity in the wings. 

There are still particles between the bunches which could either be eliminated by 
an rf-chopper or let go to be lost in the linear accelerator because they are mainly 
distributed over the decelerating field period in the linac. 


18.5.2 Beam Chopper 


A conceptually simple way to produce a bunched beam is to pass a continuous 
beam from the source through a chopper system, where the beam is deflected across 
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Fig. 18.13 Principal functioning of a chopper system 


a narrow slit resulting in a pulsed beam behind the slit. The principle components 
of such a chopper system are shown in Fig. 18.13. 

As was mentioned in the previous section this mode of bunching is rather 
wasteful and therefore an rf-prebuncher which concentrates particles from a large 
range of phases towards a particular phase is more efficient for particle bunching. 
However, we still might want to add a beam chopper. 

One such reason could be to eliminate most of the remaining particles between 
the bunches from the prebuncher. Although these particles most likely get lost 
during the acceleration process a significant fraction will still reach the end of the 
linac with an energy spread between zero and maximum energy. Because of their 
large energy deviation from the energy of the main bunches, such particles will be 
lost in a subsequent beam transport system and therefore create unnecessary high 
radiation levels. It is therefore prudent to eliminate such particles at low energies. A 
suitable device for that is a chopper which consists of an rf-cavity excited similar to 
the prebuncher cavity but with the beam port offset by a distance r from the cavity 
axis. In this case the same rf-source as for the prebuncher or main accelerator can 
be used and the deflection of particles is effected by the azimuthal magnetic field in 
the cavity. 

The prebuncher produces a string of bunches at the prebuncher frequency. For 
many applications, however, a different bunch structure is desired. Specifically it 
often occurs that only one single bunch is desired. To produce a single pulse, the 
chopper system may consist of a permanent magnet and a fast pulsed magnet. 
The permanent magnet deflects the beam into an absorber while the pulsed magnet 
deflects the beam away from the absorber across a small slit. The distance between 
the center of the pulsed magnet and the slit be D (Fig. 18.13), the slit aperture A 
and the rate of change of the magnetic field B. For an infinitely thin beam the pulse 
length behind the slit is then 


——— (18.119) 


where ¢ is the deflection angle, £ the effective magnetic length of the pulsed magnet 
and cp the momentum of the particles. In order to clean the beam between bunches 


638 18 Charged Particle Acceleration 


or to select a single bunch from a train of bunches the chopper parameters must be 
chosen such that only the desired part of the beam passes through. 


18.5.3. Buncher Section 


A buncher section is similar to an ordinary electron linac section but dimensioned 
for sub relativistic particle. Particles arriving from the source, prebuncher or chopper 
may not be at relativistic energies and therefore cannot follow the rf-wave in an 
ordinary linac section which have a phase velocity equal to the velocity of light. 
To optimize the whole acceleration system a buncher section is inserted as the first 
linac section. The length of each cell is shorter than in a normal linac structure 
and changes from a length appropriate for the velocity of the incoming particles to 
longer and longer cell lengths until the particle has reached the constant velocity of 
light for which the normal linac structure is dimensioned. In this case the particle 
would be accelerated from beginning at the desired phase. In case of an rf-gun the 
electrons emerge relativistic and no buncher is necessary. In many cases, especially 
at smaller facilities, the buncher section is omitted at some degradation of beam 
performance. 


Problems 


18.1 (S). Determine within a factor of two or less the longest TE or TM-mode 
wavelength that can propagate through a round tube of diameter 2R. 


18.2 (S). Consider a pill box cavity made of copper and calculate the frequency 
shift per 1°C temperature change. The linear expansion coefficient for copper is 
nr = 16.6-10~° m/(m°C). What is the temperature tolerance if the rf frequency 
should not change by more than +10~°. 


18.3 (S). Determine the frequency scaling of cavity dimensions, transit time factor, 
quality factor, shunt impedance, specific shunt impedance and cavity filling time. 


18.4 (S). In electron linear accelerators operating at 3 GHz accelerating fields of 
more than 50 MeV/m can be reached. Why can such high fields not be used in a 
storage ring? Discuss quantitatively, while scaling linac parameters to the frequency 
of your choice in the storage ring. 


18.5 (S). Discuss the graph in Fig. 18.11. Specifically explain in words the particle 
dynamics within random features. How come particles get accelerated even though 
they enter the linac while the accelerating field is negative? (note: interpretation 
of the graph for initial energies < 1MeV does not have enough resolution to be 
reliable.) 
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18.6 (S). Design a 500 MHz prebuncher system for a 3 GHz linear accelerator. 
Particles in a continuous beam from the source have a kinetic energy of Ey = 
100keV with an energy spread of +0.02%. Specify the optimum prebuncher 
voltage and drift length to compress the maximum intensity into a bunch with a 
phase distribution of less than +12° at 3 GHz. 


18.7 (S). Calculate for a SLAC type linac section the no-load energy gain and the 
energy gain for a pulse current of ij = 20mA. The rf-power is Pp = 15 MW per 
section at a pulse length of 2.5 jus. Compare the efficiency to the situation when only 
one bunch of n> = 10!° electrons is accelerated. What is the linac efficiency for this 
current and what is the energy gain in this case? 


18.8. Consider a rectangular box cavity with copper walls and dimensioned for an 
rf-wavelength of A = 10.5cm. Calculate the wall losses due to the fundamental 
field only and determine the shunt impedance per unit length r, and the quality 
factor Q for this cavity. These losses are due to surface currents within a skin depth 
generated by the rf-fields on the cavity surface. Compare these parameters with 
those of (18.96). Is the shape of the cavity very important? Determine the resonance 
width and temperature tolerance for the cavity. 


18.9. Plot the electrical and magnetic field distribution for the three lowest order 
modes in a rectangular and cylindrical cavity. Calculate the shunt impedance and 
compare the results. Which type of cavity is more efficient? 


18.10. Derive a general expression of the shunt impedance for general TM-modes 
in a cylindrical cavity. 


18.11. Derive expressions for the maximum electric field strength and the waveg- 
uide losses per unit length for the TE;9 mode in a rectangular waveguide. Use this 
result to design a waveguide for 3 GHz. Calculate the cut-off frequency, the phase 
and group velocities and the waveguide wavelength. What criteria did you use to 
choose the dimensions a and b? Sketch the electrical and magnetic fields. 


18.12. Consider a 8 GeV electron storage ring with a FODO lattice and a beam 
current of 200mA. Determine the equilibrium energy spread and specify rf- 
parameters which will be sufficient to compensate for synchrotron radiation losses 
and provide an energy acceptance for all particles in a Gaussian energy distribution 
up to 7o,/E. What is the synchrotron tune and the bunch length in your storage 
ring? 


18.13. Consider a pill box cavity with copper walls for a storage ring and choose 
a rf-frequency of 750 MHz. Derive an expression for the wall losses due to the 
fundamental field only and derive an expression for the shunt impedance of the 
cavity defined by Ry = a Py, where Vi¢ is the maximum rf-voltage and P,¢ 
the cavity wall losses. What are the rf-losses if this cavity is used in the ring of 
Problem 18.12? Assume that you can cool only about 150kW/m of cavity length. 
How many cavities would you need for your ring example? 
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18.14. The electromagnetic field for a cylindrical waveguide have been derived 
in Sect. 18.1.3. Derive in a similar way expressions for resonant field modes in a 
rectangular waveguide. 
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Chapter 19 
Beam-Cavity Interaction* 


The proper operation of the rf-system in a particle accelerator depends more than 
any other component on the detailed interaction with the particle beam. This results 
from the observation that a particle beam can induce fields in the accelerating 
cavities of significant magnitude compared to the generator produced voltages and 
we may therefore not neglect the presence of the particle beam. This phenomenon 
is called beam loading and can place severe restrictions on the beam current that 
can be accelerated. In this section, main features of such interaction and stability 
conditions for most efficient and stable particle acceleration will be discussed. 


19.1 Coupling Between rf-Field and Particles 


In our discussions about particle acceleration we have tacitly assumed that particles 
would gain energy from the fields in accelerating cavities merely by meeting the 
synchronicity conditions. This is true for a weak particle beam which has no 
significant effect on the fields within the cavity. As we try, however, to accelerate 
an intense beam, the actual accelerating fields become modified by the presence 
of considerable electrical particle beam currents. This beam loading can ultimately 
limit the maximum beam intensity. 

The phenomenon of beam loading will be defined and characterized in this 
section leading to conditions and parameters to assure positive energy flow from 
the rf-power source to the beam. Fundamental consideration to this discussion are 
the principles of energy conservation and linear superposition of fields which allow 
us to study field components from one source independent of fields generated by 
other sources. Specifically, we may treat beam induced fields separately from fields 
generated by rf-power sources. 
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19.1.1 Network Modelling of an Accelerating Cavity 


The electrical excitation of a rf-cavity can be accurately described by an oscillator 
as discussed in Sect. 18.2.4 and we will use therefore characteristic parameters 
and terminology of externally driven, damped oscillators in our further discussions 
of rf-systems. Electrically, an accelerating cavity can be represented by a parallel 
resonant circuit (Fig. 19.1) which is driven by an external rf-current source J, from 
a generator and the particle beam Jy. 

The amount of rf-power available from the generator in the accelerating cavity 
depends greatly on the relative impedance of cavity and generator. Both have to be 
matched to assure optimum power transfer. To derive conditions for that we define 
the internal impedance of the current source or rf-generator in terms of the cavity 
shunt impedance R, of an empty cavity as defined in (18.74) 


ie (19.1) 


where f is the coupling coefficient still to be defined. This coefficient depends on 
the actual hardware of the coupling arrangement for the rf-power from the generator 
at the entrance to the cavity and quantifies the generator impedance as seen from the 
cavity in units of the cavity shunt impedance R, (Fig. 19.1). Since this coupling 
coefficient depends on the hardware, we need to specify the desired operating 
condition to determine the proper adjustment of the coupling during assembly. This 
adjustment is done by either rotating a loop coupler with respect to the cavity axis 
or adjustment of the aperture in case of capacitive coupling through a hole. 

The inductance L and capacitance C form a parallel resonant circuit with the 
resonant frequency 


1 
oO, = —=. (19.2) 
VLC 
The rf-power available at the cavity from the generator is 
Pe = ZKV; s (19.3) 


Fig. 19.1 Network model for 


an rf generator and an I,/ 2 
accelerating cavity I J B 
g 
Ve RJ B Vey L R, C 


19.1 Coupling Between rf-Field and Particles 643 


where Y; is the loaded cavity admittance including energy transfer to the beam 
and V, is the generator voltage. Unless otherwise noted, the voltages, currents and 
power used in this section are the amplitudes of otherwise oscillating quantities. At 
resonance where all reactive power vanishes we use the generator current J, and 
network admittance Y = Y, + Y, to replace the generator voltage 


 & 


A 
yy, 


and get after insertion into (19.3) the generator power in the form 


Y, 
a (19.4) 


1 
P, = = ———— I... 
. 2(¥,+N)° * 


Noting that the generator power has a maximum, which can be determined from 
OP,/0Y_ = 0, we obtain the well-known result that the rf-power transfer from the 
generator becomes a maximum if the load is matched to the internal impedance of 
the generator by adjusting 


Y= Y, or RL = Sener (19.5) 


replacing the admittances by the respective impedances. The maximum available 
rf-power at the cavity is therefore with Y, = B/R, 


(19.6) 


To calculate the quality factor for a cavity, we note the stored energy is W = 
+CV? and the energy loss rate Py = $V? /R. Using the definition (18.80) the 
unloaded quality factor becomes with R = R, at resonance 


Qo = a,CR, . (19.7) 


The admittance for the total circuit as seen by the beam is that of cavity plus 
generator or 


1 fp 1 _14+8 


= 19.8 
Ry Rs 7 Rs Rg ee) 

From this and (19.7) we get the loaded quality factor 
0 =0,CR, = Qo (19.9) 


1+p 
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Off resonance the generator voltage and current are no more in phase. The phase 
difference can be derived from the complex impedance of the network, which is the 
same seen from the generator as well as seen from the beam 


1 1 1 
ses ti —* 19.1 
Z Bo (19.10) 


The complex impedance becomes with (19.2), (19.9) 


ee: page (19.11) 
= = 1 7 
Z Ry @ Wy 

and with J, = V,/Z the generator current is 


aS ny, a = 8 -itenw) (19.12) 
c= R, iQ a a itan : : 


T 


Close to resonance the tuning angle W becomes from (19.12) with w + a, 
2 152 = 
SE 90° (19.13) 

WW, Or 


tanW » —Q 


in agreement with (18.64) except for a phase shift of —90°, which was introduced 
here to be consistent with our definition of the synchronous phase w,. The variation 
of the tuning angle is shown in Fig. 19.2 as a function of the generator frequency. 
From (19.12), the generator voltage at the cavity is finally 


T.Rp 


=” _=],R Wei”. 19.14 
.i-imae  F Oro 


tuning angle (deg) 


= 90 T T T ] T 
ro) frequency 


Fig. 19.2. Tuning angle y as a function of the generator frequency 
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At frequencies below the resonance frequency the tuning angle is positive and 
therefore the generator current lags the voltage by the phase W. This case is also 
called inductive detuning since the impedance looks mainly inductive. Conversely, 
the detuning is called capacitive detuning because the impedance looks mostly 
capacitive for frequencies above resonance frequency. 

A bunched particle beam passing through a cavity acts as a current just like the 
generator current and therefore the same relationships with respect to beam induced 
voltages exist. In case of capacitive detuning, for example, the beam induced voltage 
V, lags in phase behind the beam current Jp. 

The effective accelerating voltage in the cavity is a composition of the generator 
voltage, the induced voltage, and the phase relationships between themselves and 
relative to the particle beam. To assure a stable beam, the resulting cavity voltage 
must meet the requirements of particle acceleration to compensate, for example, lost 
energy into synchrotron radiation. We determine the conditions for that by deriving 
first the generator voltage V,, at resonance and without beam loading while voltage 
and current are in phase. From Fig. 19.1 we get 


Ip i: Rle 
v= = = (19.15) 


and with (19.6) the generator voltage at resonance becomes 


2/28 pe Pe (19.16) 


Vor = 1+B 
The generator voltage at the cavity is therefore with (19.6) 
Ve = Ver cos We . (19.17) 


This is the cavity voltage seen by a negligibly small beam and can be adjusted to 
meet beam stability requirements by varying the tuning angle W and rf-power Py. 


19.2 Beam Loading and Rf-System 


For more substantial beam currents the effect of beam loading must be included 
to obtain the effective cavity voltage. Similar to the derivation of the generator 
voltage in a cavity, we may derive the induced voltage from the beam current passing 
through that cavity. Since there is no fundamental difference between generator and 
beam current, the induced voltage is in analogy to (19.17) 


Vp = —Vor cos W el , (19.18) 
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where the negative sign indicates that the induced voltage is decelerating the beam. 
The particle distribution in the beam occurs in bunches and the beam current 
therefore can be expressed by a Fourier series. Here we are only interested in the 
harmonic J, of the beam current and find for bunches short compared to the rf- 
wavelength 


In = 2Ip (19.19) 


where J, is the average beam current and / the harmonic number. The approximation 
for short bunches with £ < A, holds as long as sin kyg€ & keg € with ky. = 270/ Ae. 
For longer bunches the factor 2 becomes a more complicated formfactor as can be 
derived from an appropriate Fourier expansion. At the resonance frequency w, = 
hwo, the beam induced voltage in the cavity is then with (19.8) from (19.15) 


Rsln 2RgIy 
Vex = = . 
1+ 6 1+ 8 


(19.20) 


The resulting cavity voltage is the superposition of both voltages, the generator 
and the induced voltage. This superposition, including appropriate phase factors, is 
often represented in a phasor diagram. In such a diagram a complex quantity Z is 
represented by a vector of length |Z| with the horizontal and vertical components 
being the real and imaginary part of z, respectively. The phase of this vector 
increases counter clockwise and is given by tang =Im(Z) /Re(Z). In an application 
to rf-parameters we represent voltages and currents by vectors with a length equal 
to the magnitude of voltage or current and a counter clockwise rotation of the vector 
by the phase angle ¢. 

The particle beam current can be chosen as the reference being parallel to the 
real axis and we obtain from the quantities derived so far the phasor diagram as 
shown in Fig. 19.3. First we determine the relationships between individual vectors 
and phases and then the correct adjustments of variable rf-parameters. In Fig. 19.3 
the generator current is assumed to have the still to be determined phase 7? with 
respect to the beam current while the generator voltage and beam induced voltage 
lag by the phase W behind the beam current. The resulting cavity voltage Vy is the 
phasor addition of both voltages Vz + Vj as shown in Fig. 19.3. 

The adjustment of the rf-system must now be performed in such a way as to 
provide the desired gain in kinetic energy Up = e Ve sin Ws where Vey is the 
maximum value of the cavity voltage and w, the synchronous phase. To maximize 
the energy flow from the generator to the cavity the load must be matched such that 
it appears to the generator purely resistive. This is achieved by adjusting the phase 
We to get the cavity voltage V., and generator current J, in phase which occurs for 


We = in, (19.21) 


as shown in Fig. 19.4. Obviously, this is only true for a specific value of the beam 
current. General operation will deviate from this value and therefore we often match 
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Fig. 19.4 Phasor diagram with optimum tuning angle 


to the maximum desired beam current. For lower currents, where the energy transfer 
is not optimum anymore, some loss of efficiency is acceptable. 

The tuning angle adjustment for optimum matching can be derived from Fig. 19.4 
and applying the law of sines we have with (19.17) 


Vp Vor COS Win sin Wy sin Wn 


= = : 19.22 
Vey Vey sin W, cos Ws ( ) 
The optimum tuning angle is from (19.22) 
Vor 
tanY, = v. COS Ws . (19.23) 


cy 


This tuning is effected by a shift in the resonant frequency of the cavity 
with respect to the generator frequency by, for example, moving a tuner in or 
out. From (19.13) we get with (19.9), (19.20), (19.23) for the frequency shift or 
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frequency tuning 


a, Py 
60 =wW- a, = “703By (19.24) 
where the cavity power is defined by 
Ve, 
Py = OR. (19.25) 
and the beam power by 
Py = [pVey sin Ws . (19.26) 


To determine the required generator power the components of the cavity voltage 
vector can be expressed by other quantities and we get from Fig. 19.4 


Vey sin Ws = Vor COS Wn, COS (We — Yin) — Vor cos” Yn (19.27) 
and 
Vey COS Ws = Vor COS Mp sin (We _ Yn) + Vp, cos Wy sin Wy . (19.28) 


Combining both equations to eliminate the phase (We - Yin) , we get 


Sl i 2 cost mn ge a) (19.29) 
= 7 + COS Yn c — Vbr SIN Ym : 
er cos Wn i cos Win ‘ 


and with (19.16), (19.20) the required generator power for the condition of optimum 
matching is 


Ve, 1+ By | / sins 2Relp 
p= s : Duy 19.30 
®= OR, 4B (= Wn | Vy+B) °° ) om 


cos Ws 2Rslp ; . 
+ _ sin Wy ; 
cosWy = Vey (1 + B) 


This expression can be greatly simplified with (19.23) to become 


(1+ By 2Rolb . 
Peon = Sap V5 4 iz pany (19.31) 


Equation (19.31) represents a combination of beam current through Jy, rf- 
generator power P,, coupling coefficient 6, and shunt impedance R, to sustain 
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a cavity voltage V.y. Specifically, considering that the rf-power Py, and coupling 
coefficient f is fixed by the hardware installed a maximum supportable beam current 
can be derived as a function of the desired or required cavity voltage. Solving for 
the cavity voltage, (19.31) becomes after some manipulation 


a oe Cas thin) Bite 5 Bp r) (19.32) 


This expression exhibits a limit for the beam current above which the second 
square root becomes imaginary. The condition for real solutions requires that 


Pp < Peo (19.33) 


leading to a limit of the maximum sustainable beam current of 


(fee Pats... (19.34) 
~ 14+ B Vey sin Ws 

Inspection of (19.31) shows that the required generator power can be further 
minimized by adjusting for the optimum coupling coefficient 6. Optimum coupling 
can be derived from 0P,/08 = 0 with the solution 


2RslIp Py 
sinw, = 1+ —. (19.35) 


cy Poy 


Bopt = 1+ 


The minimum generator power required to produce an accelerating voltage 
Voy sin W, is therefore from (19.31) with (19.35) 


2 
Pg min = 5p Bove = BoptPey (19.36) 
and the optimum tuning angle from (19.23) 


—1 
tan Yon = cot Ws. (19.37) 


In this operating condition all rf-power from the generator is absorbed by the 
beam loaded cavity and no power reflection occurs. The maximum beam power is 
therefore P, = Py — P.y and the maximum beam current 


P. g Vey 


Ih< : - - : 
Vey sinw, 2R,sin Ys 


(19.38) 


650 19 Beam-Cavity Interaction* 


Conditions have been derived assuring most efficient power transfer to the beam 
by proper adjustment of the cavity power input coupler to obtain the optimum 
coupling coefficient. Of course this coupling coefficient is optimum only for a 
specific beam current which in most cases is chosen to be the maximum desired 
beam current. 

We are now in a position to determine the total rf-power flow. From conservation 
of energy we have 


Pg = Py +Pyp+Pr, (19.39) 


where P, is the reflected power which vanishes for the case of optimum coupling. 


19.3. Higher-Order Mode Losses in an Rf-Cavity 


The importance of beam loading for accurate adjustments of the rf-system has been 
discussed qualitatively but not yet quantitatively. In this paragraph, quantitative 
expressions will be derived for beam loading. Accelerating cavities constitute an 
impedance to a particle current and a bunch of particles with charge g passing 
through a cavity induces electromagnetic fields into a broad frequency spectrum 
limited at the high frequency end by the bunch length. The magnitude of the 
excited frequencies in the cavity depends on the frequency dependence of the 
cavity impedance, which is a function of the particular cavity design and need not 
be known for this discussion. Fields induced within a cavity are called modes, 
oscillating at different frequencies with the lowest mode being the fundamental 
resonant frequency of the cavity. Although cavities are designed primarily for one 
resonant frequency, many higher-order modes or HOM’scan be excited at higher 
frequencies. Such modes occur above the fundamental frequency first at distinct 
well-separated frequencies with increasing spectral densities at higher frequencies. 

For a moment we consider here only the fundamental frequency and deal with 
higher-order modes later. Fields induced by the total bunch charge act back on 
individual particles modifying the overall accelerating voltage seen by the particle. 
To quantify this we use the fundamental theorem of beam loading formulated by 
Wilson [1] which states that each particle within a bunch sees one half of the induced 
field while passing through the cavity. 

We prove this theorem by conducting a Gedanken experiment proposed by 
Wilson. Consider a bunch of particles with charge g passing through a lossless cavity 
inducing a voltage Vj, in the fundamental mode. This induced field is opposed to the 
accelerating field since it describes a loss of energy. While the bunch passes through 
the cavity this field increases from zero reaching a maximum value at the moment 
the particle bunch leaves the cavity. Each particle will have interacted with this field 
and the energy loss corresponds to a fraction f of the induced voltage V;,,, where 
the index h indicates that we consider only the fundamental mode. The total energy 
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lost by the bunch of charge q is 
AE, = —qif Vin. (19.40) 
This energy appears as field energy proportional to the square of the voltage 
Wi =i: (19.41) 


where c, is a constant. 

Now consider another bunch with the same charge q2 = gq, = q following 
behind the first bunch at a distance corresponding to half an oscillation period 
at the fundamental cavity frequency. In addition to its own induced voltage this 
second bunch will see the field from the first bunch, now being accelerating, and 
will therefore gain an energy 


AE, = QVin — Gof Vin = Vin —f). (19.42) 


After passage of the second charge, the cavity returns to the original state before 
the first charge arrived because the field from the first charge having changed sign 
exactly cancels the induced field from the second charge. The cavity has been 
assumed lossless and energy conservation requires therefore that AE, + AE, = 0 
or —gf Vin + qVin (1 —f ) = 0 from which we get 


f=q (19.43) 


proving the statement of the fundamental theorem of beam loading. The energy loss 
of a bunch of charge g due to its own induced field is therefore 


AE, = —4qVin. (19.44) 


This theorem will be used to determine the energy transfer from cavity fields to 
a particle beam. To calculate the induced voltages in rf-cavities, or in arbitrarily 
shaped vacuum chambers providing some impedance for the particle beam can 
become very complicated. For cylindrically symmetric cavities the induced voltages 
can be calculated numerically with programs like SUPERFISH [2], URMEL[3] or 
MAFIA [3]. 

For a more practical approach Wilson [1] introduced a loss parameter k which can 
be determined either by electronic measurements or by numerical calculations. This 
loss parameter for the fundamental mode loss of a bunch with charge q is defined by 


AEn = knq? (19.45) 
and together with (19.44) we get the induced voltage 


Vin = —2knq (19.46) 
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or after elimination of the charge 


Vin 
AE, = , 
4kp 


(19.47) 


where the index h indicates that the parameter should be taken at the fundamental 
frequency. The loss parameter can be expressed in terms of cavity parameters. From 
the definition of the cavity quality factor (18.80) and cavity losses from (18.77) we 
get 


2Rey _ V? 


OQ oaoW’ 


(19.48) 


where w is the frequency and W the stored field energy in the cavity. Applying 
this to the induced field, we note that AE; is equal to the field energy W;, and 
combining (19.47), (19.48) the loss parameter to the fundamental mode in a cavity 
with shunt impedance R;, and quality factor Q), is 


k= (19.49) 


The excitation of higher-order mode fields by the passing particle bunch leads 
to additional energy losses which are conveniently expressed in units of the energy 
loss to the fundamental mode 


AEnom = (Thom _ 1) AE; , (19.50) 


where /hom is the ratio of the total energy losses into all cavity modes to the loss into 
the fundamental mode only. The induced higher order field energy in the cavity is 
therefore 


Whom = (Thom _ 1) Wh. (19.51) 


Again we may define a loss parameter k, for an arbitrary nth-mode and get 
analogous to (19.49) 


On 2Rn 


ky SS ’ 
4 On 


(19.52) 


where R,, and Q,, are the shunt impedance and quality factor for the nth-mode or 
frequency @,, respectively. The total loss parameter due to all modes is by linear 
superposition 


k= y ha (19.53) 


19.3 Higher-Order Mode Losses in an Rf-Cavity 653 


The task to determine the induced voltages has been reduced to the determination 
of the loss parameters for individual modes or if this is not possible or desirable we 
may use just the overall loss parameter k as may be determined experimentally. This 
is particularly convenient for cases where it is difficult to calculate the mode losses 
but much easier to measure the overall losses by electronic measurements. 

The higher-order mode losses will become important for discussion of beam 
stability since these fields will act back on subsequent particles and bunches thus 
creating a coupling between different parts of one bunch or different bunches. 


19.3.1 Efficiency of Energy Transfer from Cavity to Beam 


Higher-order mode losses affect the efficiency by which energy is transferred to the 
particle beam. Specifically, since the higher-order mode losses depend on the beam 
current we must expect some limitation in the current capability of the accelerator. 

With these preparations we have now all information to calculate the transfer 
of energy from the cavity to the particle beam. Just before the arrival of a particle 
bunch let the cavity voltage as generated by the rf-power source be 


Vey = —Vg ele, (19.54) 


where V, is the generator voltage and WY, the generator voltage phase with respect 
to the particle beam. To combine the generator voltage with the induced voltage we 
use phasor diagrams in the complex plane. 

The generator voltage is shown in Fig. 19.5 as a vector rotated by the angle ¥, 
from the real axis representing the cavity state just before the beam passes. The 
beam induced voltage is parallel and opposite to the real axis. Both vectors add up 
to the voltage V just after the beam has left the cavity. 

The difference of the fundamental field energy before and after passage of the 
particle bunch is equal to the energy transferred to the passing particle bunch minus 


Fig. 19.5 Phasor diagram for 
cavity voltages with beam 
loading 


Vg 
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the higher-order field energy and is from the phasor diagram 
AE hom = 0 (v3, = v*) = Whom = & (2VeyVp cos Wp — V2) — Whom, (19.55) 


where a is the proportionality factor between the energy gain AE and the square of 
the voltage defined by a = AE/V?. With (19.45), (19.46), we get from (19.55) the 
net energy transfer to a particle bunch [4] 


AEnom = & (2VcyVp cos Wy — Reus) . (19.56) 


The energy stored in the cavity before arrival of the beam is W.y = ave, and the 
energy transfer efficiency to the beam becomes 


AE, _ .Vo Vb 


b 
4% (19.57) 
ie 


It is obvious from (19.57) that energy transfer is not guaranteed by the syn- 
chronicity condition or the power of the generator alone. Specifically, the second 
term in (19.57) becomes dominant for a large beam intensity and the efficiency may 


even become negative indicating reversed energy transfer from the beam to cavity 
cos W, 


fields. The energy transfer efficiency has a maximum for Vp = = ~ Vey and is 
cos” W, 
max = ; (19.58) 
Thom 


a result first derived by Keil et al. [5] and is therefore frequently called the Keil- 
Schnell-Zotter criterion. The maximum energy transfer efficiency is limited by the 
phase of the generator voltage and the higher-order mode losses in the cavity. 


19.4 Beam Loading 


Only one passage of a bunch through a cavity has been considered in the previous 
section. In circular accelerators, however, particle bunches pass periodically through 
the accelerating cavities and we have to consider the cumulative build up of induced 
fields. Whenever a particle bunch is traversing a cavity the induced voltage from this 
passage is added to those still present from previous bunch traversals. For simplicity, 
we assume a number of equidistant bunches along the circumference of the ring, 
where adjacent bunches are separated by an integer number m, of the fundamental 
rf-wavelength. The induced voltage decays exponentially by a factor e-? between 
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Fig. 19.6 Phasor diagram for 
the superposition of induced 
voltages in an accelerating 
cavity 


two consecutive bunches with 


t 
p=—, (19.59) 
ta 
where f, is the time between bunches and fg the cavity voltage decay time for the 
fundamental mode. The phase of the induced voltage varies between the passage of 
two consecutive bunches by 


Q = pty — 2M. (19.60) 


At the time a bunch passes through the cavity the total induced voltages are then 
the superposition of all fields induced by previous bunches 


Vi = Vin (1 +e7Pel? + e Pel? +.--) (19.61) 


shown in Fig. 19.6 as the superposition of all induced voltages in form of a phasor 
diagram together with the resultant induced voltage Vj. The sum (19.61) can be 
evaluated to be 


Wey (19.62) 


1—ePeiv ’ 
which is the total induced voltage in the cavity just after the last bunch passes; 
however, the voltage seen by this last bunch is only half of the induced voltage and 
the total voltage Vi, acting back on the bunch is therefore 


1 1 
Vo = Vin (— — 5) : (19.63) 


l—e-Pelv 2 


The voltage Vi, can be expressed in more practical units. Considering the 
damping time (18.62) for fields in a cavity we note that two damping times exist, 
one for the empty unloaded cavity fgg and a shorter damping time fg when there is 
also a beam present. For the unloaded damping time we have from (18.62) 
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2Qon 
tao = 2201 (19.64) 


Wh 


where Qo, is the unloaded quality factor. From (19.45), (19.47) we get with g = Iotp, 
where Jp is the average beam current, 


Wh 


Vin = 
20on 


Rrloty 
and with (19.64) 
ty 
Vin = Rplo— . (19.65) 
tao 


Introducing the coupling coefficient 6, we get from (19.9), (19.64) 


tao = (1+ B) tp. (19.66) 
In analogy to (19.59) we define 
a= (19.67) 
1+ tao 
and (19.65) becomes 
Vin = poRnlo . (19.68) 


We are finally in a position to calculate from (19.63), (19.68) the total beam 
induced cavity voltage V, in the fundamental mode for circular accelerators. 


19.5 Phase Oscillation and Stability 


In the course of discussing phase oscillations we found it necessary to select 
carefully the synchronous phase depending on the particle energy being below or 
above the transition energy. Particularly, we found that phase stability for particles 
above transition energy requires the rf-voltage to decrease with increasing phase. 
From the derivative of (19.27) with respect to y, we find with (19.21) and since 
Vor cos W > 0 the condition for phase stability sin(yg — Ym) < 0 or 


1 
50 <Wst Yn < 3a. (19.69) 
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From (19.28) and (19.69) we get 
—Vey |cos Ws| — Vor Sin Ym cos My < 0 
or with (19.23) 
Vor Sin Ys < Vey . (19.70) 


which is Robinson’s phase-stability criterion or the Robinson condition [6] for the 
tuning angle of the accelerator cavity. The maximum current that can be accelerated 
in a circular accelerator with stable phase oscillations is limited by the effective 
cavity voltage. In terms of rf-power, (19.70) is with (19.20) equivalent to 


Py < (1+ B) Prey (19.71) 
and the stability condition for the coupling coefficient is from (19.35) 
B > Bopt — 2. (19.72) 


The stability condition is always met for rf-cavities with optimum coupling B = 


Bopt: 


19.5.1 Robinson Damping 


Correct tuning of the rf-system is a necessary but not a sufficient condition for 
stable phase oscillations. In Chap. 12 we found the occurrence of damping or anti- 
damping due to forces that depend on the energy of the particle. Such a case occurs 
in the interaction of bunched particle beams with accelerating cavities or vacuum 
chamber components which act like narrow band resonant cavities. The revolution 
time of a particle bunch depends on the average energy of particles within a bunch 
and the Fourier spectrum of the bunch current being made up of harmonics of the 
revolution frequency is therefore energy dependent. On the other hand by virtue 
of the frequency dependence of the cavity impedance, the energy loss of a bunch 
in the cavity due to beam loading depends on the revolution frequency. We have 
therefore an energy dependent loss mechanism which can lead to damping or worse 
anti-damping of coherent phase oscillation and we will therefore investigate this 
phenomenon in more detail. Robinson [6] studied first the dynamics of this effect 
generally referred to as Robinson damping or Robinson instability. 

Above transition energy the revolution frequency is lower for higher bunch 
energies compared to the reference energy and vice versa. To obtain damping of 
coherent phase oscillations, we would therefore tune the cavity such that the bunch 
would loose more energy in the cavity while at higher energies (lower frequency) 
during the course of coherent synchrotron oscillation and loose less energy at 
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a) b) 
stable unstable unstable stable 
@=h®, o=ha, 
beam spectrum beam spectrum 


below transition above transition 


Fig. 19.7 Cavity tuning for positive Robinson damping below and above transition energy. (a) 
Below transition. (b) Above transition 


lower energies (higher frequency). In this situation, the impedance of the cavity 
should decrease with increasing frequency for damping to occur as demonstrated in 
Fig. 19.7. 

Here the resonance curve or impedance spectrum is shown for the case of a 
resonant frequency above the beam frequency hap in Fig. 19.7a and below the beam 
frequency in Fig 19.7b. Consistent with the arguments made above we would expect 
damping in case of Fig. 19.7b for a beam above transition and anti-damping in case 
of Fig. 19.7a. Adjusting the resonance frequency of the cavity to a value below 
the beam frequency hwpwhere wpis the revolution frequency, is called capacitive 
detuning. Conversely, we would tune the cavity resonance frequency above the beam 
frequency (@, > hwo) or inductively detune the cavity for damping below transition 
energy (Fig. 19.7a). 

In a more formal way we fold the beam-current spectrum with the impedance 
spectrum of the cavity and derive scaling laws for the damping as well as the 
shift in synchrotron frequency. During phase oscillations the revolution frequency 
is modulated and as a consequence the beam spectrum includes in addition to the 
fundamental frequency two side bands or satellites. The beam-current spectrum is 
composed of a series of harmonics of the revolution frequency up to frequencies 
with wavelength of the order of the bunchlength 


I(t) = Ip + Y~ Incos (nwt — ¢) . (19.73) 


n>0 


where /, is the average circulating beam current and g a phase shift with respect 
to the reference particle. The Fourier coefficient for bunches short compared to the 
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wavelength of the harmonic is given by 
In = 2h. (19.74) 


Here we restrict the discussion to the interaction between beam and cavity at 
the fundamental cavity frequency and the only harmonic of interest in the beam 
spectrum is therefore the Ath harmonic 


I,(t) = 21, cos (hot — ~) . (19.75) 


By virtue of coherent synchrotron oscillations the phase oscillates for each 
particle in a bunch like 


p(t) = go sin Qt, (19.76) 


where @p is the maximum amplitude and £2, the synchrotron oscillation frequency 
of the phase oscillation. We insert this into (19.75) and get after expanding the 
trigonometric functions for small oscillation amplitudes gy « 1 


I,(t) = 2h, cos (h@pt) — Ibo [cos (hapt + 2st) — cos (hapt — 2,f)]. (19.77) 


This expression exhibits clearly sidebands or satellites in the beam spectrum at 
hay + 92;. Folding the expression for the beam current with the cavity impedance 
defines the energy loss of the particle bunch while passing through the cavity. The 
cavity impedance is a complex quantity which was derived in (19.11) and its real 
part is shown together with the beam spectrum in Fig. 19.8. The induced voltage in 
the cavity by a beam J;,(t) = J, cos hwot is 


V, = —ZI,(t) = —Z,I, cos (haot) — ZIy sin (haot) , (19.78) 


Fig. 19.8 Cavity impedance 
and beam spectrum in the 
vicinity of the fundamental rf 
frequency w,¢ = hwo 
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where we have split the impedance in its real and imaginary part and have expressed 
the imaginary part of the induced voltage by a —/2 phase shift. Applying (19.78) 
to all components of the beam current (19.77) we get the induced voltage in the 
cavity 


Vi, = —Z°2h, cos hat — Z?21, sin hwot (19.79) 


+ Z;" Ingo cos hwot cos 2.t — Z;" Iy~o sin hwot sin S2<t 


+ Z;"Ipgo Sin haot cos 2st + Z;" Ingo cos hwot sin 2¢t 
— Z, Ingo cos haot cos 2,t — Z, Ino sin hwot sin S2t 


—Z, Ingo sinhaot cos 2st + Z; Ib~o cos hwot sin S2st , 


where Z°,Z+ and Z~ are the real r and imaginary i cavity impedances at the 
frequencies hay, hay + 2, hwy — 2; respectively. We make use of the expression 
for the phase oscillation (19.76) and its derivative 


P(t) = Yo, cos L2,t, (19.80) 


multiply the induced voltage spectrum (19.79) by the current spectrum (19.77) and 
get after averaging over fast oscillating terms at frequency hwo 


Zp, =Z, ; 


(Vila) = —215 )Z? -[Z)- 3 (Z + Zi )le-=35-*@ - (19.81) 


This is the rate of energy loss of the particle bunch into the impedance of the 
cavity. Dividing by the total circulating charge Ty/, we get the rate of relative energy 
loss per unit charge 


de = (eViln) _ Q 


dt TolbEy —-— Beka |nel’ 


(19.82) 


where Tp is the revolution time and /, the average beam current. 

We made use of the relation between the energy deviation from the ideal energy 
and the rate of change of the phase (9.17) on the r.h.s. of the equation. From (19.81), 
(19.82) and making use of the definition of the synchrotron frequency in (9.32) 

2 = coliel oy. |cos Ws|, we get a differential equation of the form 


G + 2ard + AQ’y =0 (19.83) 


with a Robinson damping decrement 


Qs S 
Eo 2a ie, (19.84) 


ae = ——————_. 
ss 2Vey |cos Ws| + * y 
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and a shift in synchrotron oscillation frequency 


2B2% 


AQ? = —— 
Vey |cos Ws| 


[22-3 (Z° +Z)|h. (19.85) 


The unperturbed phase equation (9.26) is 
G + 2a0G + Qyy = 0 (19.86) 


and combining both, we derive a modification of both the damping and oscillation 
frequency. The combined damping decrement is 


B25 (z;* 


Ae = Acn — 
: ” Vey |cos Ws] * 7 


—Z,)h>0 (19.87) 


where Qo is the radiation damping in electron accelerators. The total damping 
decrement must be positive for beam stability. The interaction of the beam with 
the accelerating cavity above transition is stable for all values of the beam current 
if Z* < Zor if the cavity resonant frequency is capacitively detuned. Due to 
the imaginary part of the impedance the interaction of beam and cavity leads to 
a synchrotron oscillation frequency shift given by 


Qs = Qo - 


Ss 


2822 
eA [Z? —5(Z +Z,)] bh. (19.88) 


This frequency shift has two components, the incoherent frequency shift due 
to the impedance Z? at the fundamental beam frequency hwo and a frequency 
shift for coherent bunch-phase oscillations due to the imaginary part of the cavity 


impedances. For small frequency shifts A92, = 2, — 250, (19.88) can be linearized 
for 


Qo Vey |cos Ws 


1 


: [2 -4(ZF +Z))]. (19.89) 


The cavity impedance is from (19.10) 


1=i0 = 
| A ae (19.90) 


1 +93 (Sat) 


From the imaginary part of the cavity impedance and capacitive detuning we 
conclude that above transition energy, the incoherent synchrotron tune shift is 
positive 


AQ, incoh > 0 (19.91) 
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while the coherent synchrotron tune shift is negative 
AQs.coh <0. (19.92) 


This conclusion may in special circumstances be significantly different due 
to other passive cavities in the accelerator. The shift in the synchrotron tune is 
proportional to the beam current and can be used as a diagnostic tool to determine 
the cavity impedance or its deviation from the ideal model (19.90). 

In the preceding discussion it was assumed that only resonant cavities contribute 
to Robinson damping. This is correct to the extend that other cavity like structures 
of the vacuum enclosure in a circular accelerator have a low quality factor Q for the 
whole spectrum or at least at multiples of the revolution frequency and therefore do 
not contribute significantly to this effect through a persistent energy loss over many 
turns. Later we will see that such low-@Q structures in the vacuum chamber may lead 
to other types of beam instability. 


19.5.2. Potential Well Distortion 


The synchrotron frequency is determined by the slope of the rf-voltage at the 
synchronous phase. In the last subsection the effect of beam loading at the cavity 
fundamental frequency was discussed demonstrating the need to include the induced 
voltages in the calculation of the synchrotron oscillation frequency. These induced 
voltages cause a perturbation of the potential well and as a consequence a change 
in the bunch length. In this subsection we will therefore also include higher-order 
interaction of the beam with its environment. 

It is not possible to derive a general expression for the impedance of all com- 
ponents of a vacuum chamber in a circular accelerator. However, measurements [7] 
have shown that the impedance spectrum of circular accelerator vacuum chambers, 
while excluding accelerating cavities, has the form similar to that of the SPEAR 
storage ring shown in Fig. 19.9. 


Fig. 19.9 SPEAR impedance Z(o) 
spectrum [7] Ts 
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Up to the transition frequency f,, which is determined by vacuum chamber dimen- 
sions, the impedance is predominantly inductive and becomes capacitive above the 
transition frequency. We are looking here only for fields with wavelength longer 
than the bunch length which may distort the rf-voltage waveform such as to change 
the slope for the whole bunch. Later we will consider shorter wavelength which 
give rise to perturbations within the bunch. Because the bunch length is generally of 
the order of vacuum chamber dimensions we only need to consider the impedance 
spectrum below transition frequency which is predominantly inductive. To preserve 
generality, however, we assume a more general but still purely imaginary impedance 
defined by 


Z(@)| = iw@Z). (19.93) 


Studying the modification of a finite bunch length due to potential-well distor- 
tions we use for mathematical simplicity a parabolic particle distribution [8] in phase 
(Fig. 19.10) normalized to the bunch current les I(p)dg =I, 


T 2 
I(y) = ~ (: - ) (19.94) 
Ge Qo 


where 2g, is the bunch length expressed in terms of a phase with respect to the 
fundamental rf-wavelength. The combined induced voltage in the whole vacuum 
chamber is 


V Zi haoZ as hi (Z / ) (19.95) 
— —_ = — = ny) —, . 
2 ll at MoI dg mM {AI do 
where we have introduced the normalized impedance 
al tg 1 
= iaZ| . (19.96) 
n 


which is the longitudinal impedance divided by the frequency in units of the 
revolution frequency or by the mode number n = w/w. Inserting (19.94) 


Fig. 19.10 Current 
distribution for potential-well 31,/4@, 
distortion 


-@, 0 9, 
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into (19.95) we get the induced voltage 


3hIy Im (Z| /n 
Vz= mei (19.97) 
29) 


which must be added to the rf-voltage Viz. = Vey sin (Ws + ~). Forming an effective 
voltage we get 


3hIp Im (Z; /n) 


Vere = Vey cos W, | 1 — 
7 " ( 20) Vey Cos Ws 


p + Vey sin Ws . (19.98) 


This modification of the effective cavity voltage leads to an incoherent shift of 
the synchrotron oscillation frequency 


a 3ncel 
=p — TE tm (Zj/n) , (19.99) 
Q 4g? Ev: 
where we used the definition of the synchrotron tune v2 = — 


Above transition energy 7,.cosW; > 0 and therefore the frequency shift is 
positive for Im(Z,/n) < 0 and negative for Im{Z,/n} > 0. We note specifically 
that the shift depends strongly on the bunch length and increases with decreasing 
bunch length, a phenomenon we observe in all higher-order mode interactions. 

Note that this shift of the synchrotron oscillation frequency does not appear for 
coherent oscillations since the induced voltage also moves with the bunch oscil- 
lation. The bunch center actually sees always the unaltered rf-field and oscillates 
according to the slope of the unperturbed rf-voltage. The coherent synchrotron 
oscillation frequency therefore need not be the same as the incoherent frequency. 
This has some ramification for the experimental determination of the synchrotron 
oscillation frequency. 

The shift in incoherent synchrotron oscillation frequency reflects also a change 
in the equilibrium bunch length which is different for proton or ion beams compared 
to an electron bunch. The energy spread of radiating electron beams is determined 
only by quantum fluctuations due to the emission of synchrotron radiation and 
is independent of rf-fields. The electron bunch length scales therefore inversely 
proportional to the synchrotron oscillation frequency and we get with §2,/Qs9 = 
ovo /o¢ from (19.99) after solving for o¢ /aeo 


OL eB eely (4) = (19.100) 
Fin Fe0 91? /210;)Eve . 


where we replaced the parabolic current distribution by a Gaussian distribution with 
equal total bunch current and equal intensity in the bunch center by setting gy = 
3/27 /4ho;/R and where oxo is the unperturbed bunch length. 
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Non-radiating particles, in contrast, must obey Liouville’s theorem and the 
longitudinal beam emittance € Ap will not change due to potential-well distortions. 
For proton or ion bunches we employ the same derivation for the bunch lengthening 
but note that the bunch length scales with the energy spread in such a way that the 
product of bunch length £ and momentum spread Ap remains constant. Therefore 
£ x 1/./Q, and the perturbed bunch length is from (19.99) with £ = (R/h) ge 


¢4 eh R. (Z 
Sfeklt im(41) ¢ 1=0, (19.101) 


 4nEv2 n} lo 


Of course, along with this perturbation of the proton or ion bunch length goes an 
opposite perturbation of the energy spread. 


Problems 


19.1 (S). Consider an electron storage ring to be used as a damping ring for a linear 
collider. The energy is E = 1.21 GeV, circumference C = 35.27m, bending radius 
p = 2.037m, momentum compaction factor a, = 0.01841, rf harmonic number 
h = 84, cavity shunt impedance of Ry = 8.4 MQ. An intense bunch of N. = 
5 x 10!° particles is injected in a single pulse and is stored for only a few msec 
to damp to a small beam emittance. Specify and optimize a suitable rf-system and 
calculate the required rf-cavity power, cavity voltage, coupling factor first while 
ignoring beam loading and then with beam loading. Assume a quantum lifetime 
of Lh. 


19.2 (S). Show that for bunches short compared to the rf-wavelength the harmonic 
amplitudes are J, = 2/p. 
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Part VII 
Coupled Motion 


Chapter 20 
Dynamics of Coupled Motion* 


In linear beam dynamics transverse motion of particles can be treated separately in 
the horizontal and vertical plane. This can be achieved by proper selection, design 
and alignment of beam transport magnets. Fabrication and alignment tolerances, 
however, will introduce, for example, rotated quadrupole components where only 
upright quadrupole fields were intended. In other cases like colliding beams for 
high energy physics large solenoid detectors are installed at the collision points 
to analyse secondary particles. Such solenoids cause coupling which must be 
compensated. The perturbation caused creates a coupling of both the horizontal 
and vertical oscillation and independent treatment is no longer accurate. Such linear 
coupling can be compensated in principle by additional rotated or skew quadrupoles, 
but the beam dynamics for coupling effects must be known to perform a proper 
compensation. 

Since coupling is caused by linear as well as nonlinear fields, we observe this 
effect in virtually any accelerator. In order to be able to manipulate coupling in a 
controlled and predictable way, we need to understand its dynamics in more detail. 
In this chapter, we will derive first the equations of motion for the two most general 
sources of coupling, the solenoid field and the field of a rotated quadrupole, solve 
the equations of motion and formulate modifications to beam dynamics parameters 
and functions of linear uncoupled motion. 


20.1 Equations of Motion in Coupled Systems 


The most generally used magnets that introduce coupling in beam transport systems 
are rotated quadrupoles and solenoid magnets and we will restrict our discussion 
of coupled beam dynamics to such magnets defining the realm of linear coupling. 
Equations (6.95), (6.96) include all linear and nonlinear coupling terms up to third 
order while longitudinal fields are treated in Sect. 6.6. The equations of motion in 
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the presence of upright and rotated quadrupoles as well as solenoid fields are 


x" +kx = —ky + Sy’ + 3S'y, 


20.1 
y" ky = kx — Sxl — 58'x ey) 
where the solenoid field is expressed by 
e 
S(z) = Pe (20.2) 


We use in this chapter the symbol S for the solenoid field not to be confused with 
the sine-like solution. In the following subsections we will derive separately the 
transformation through both rotated quadrupoles and solenoid magnets. 


20.1.1 Coupled Beam Dynamics in Skew Quadrupoles 


The distribution of rotated or skew quadrupoles and solenoid magnets is arbitrary 
and therefore no analytic solution can be expected for the differential equa- 
tions (20.1). Similar to other beam line elements, we discuss solutions for the 
equations of motion within individual magnets only and assume that strength 
parameters within hard-edge model magnets stay constant. We discuss first solutions 
of particle motion in skew quadrupoles alone and ignore solenoid fields. The 
equations of motion for skew quadrupoles are from (20.1) 


x 4 k y=0, 
tks SO, (20.3) 

These equations look very similar to the equations for ordinary upright 
quadrupoles except that the restoring forces now depend on the particle amplitude 
in the other plane. We know the solution of the equation of motion for an upright 
focusing and defocusing quadrupole and will try to apply these solutions to (20.3). 
Combining the observation that each quadrupole is focusing in one plane and 
defocusing in the other with the apparent mixture of both planes for a skew 
quadrupole, we will try an ansatz for (20.3) which is made up of four principal 
solutions 


x=acosg+ — sing +c coshg + ai sinh @ , 


fe fe (20.4) 
y=Acosg+ VE sing + C coshe + 7 sinhg , 


where g = af kz and the variable z varies between zero and the full length of the 
quadrupole, 0 < z < fy. The coefficients a, b,c,...D must be determined to be 
consistent with the initial parameters of the trajectories (xo, x4. yo. Yo). For z = 0 we 
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get 
— = A 
haa Se es ee: (20.5) 
X =b+d, y =B+D. 
Solutions (20.4) must be consistent with (20.3) from which we find 
a=A c=-C 
, , 20.6 
b=B, d=-D. ( ) 


From (20.5), (20.6) we get finally the coefficients consistent with the initial 
conditions and the differential equations (20.3) 


a=A=S(%+y0), D=B=F0H+N), 


(20.7) 
c=—-C=5@0-y0), d= —-D = 5) —-%)- 
With these definitions the transformation through a skew quadrupole is 
xX XO 
x x 
= Mag: | * (20.8) 
y Yo 
sf Yo 
where M,, is the transformation matrix for a skew quadrupole, 
+ Lot = Lo 
c ye Cc oye 
1] —/kS~ ct —/kSt C7 
Mag(s10) = 5 se i Ae ay (20.9) 
vi vi 


ayes = SEs Gt 


with C+ = C*(g) = cosy + coshg and S~ = S*(y) = sing + sinhg and 
p= Jk. 

This transformation matrix is quite elaborate and becomes useful only for 
numerical calculations on computers. We employ again thin lens approximation 
where the quadrupole length vanishes (h4 > 0) in such a way as to preserve the 
integrated magnet strength or the focal length f. The matrix (20.9) then reduces to 
the simple form 


i ty © 0 
0 1-I/f 0 

Meq(0 [Esq ) = (20.10) 
eee 0 0 I. & 
-1/f0 0 1 
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where the focal length is defined as f~! = kK Lg. Note, that we have not set 
sq = 0 but retained the linear terms in ¢,g, which is a more appropriate thin-lens 
approximation for weak skew quadrupoles of finite length. Along the diagonal, the 
transformation matrix looks like a drift space of length ¢,q while the off-diagonal 
elements describe the coupling due to the thin skew quadrupole. 


20.1.2. Particle Motion in a Solenoidal Field 


The equations of motion in a solenoid can be derived from (6.92a), neglecting all 
transverse beam deflection and electric fields 


x de’? e Sis 
we = 50 (Bs — By), 


1 
x 22 de, (20.11) 
y 14% = 52 (B: — xB), 


where the solenoid field component B,, assumed to be colinear with the z-direction, 
can be derived from (6.103) 


B= (—3B,x,—3B.y, Bs) « (20.12) 


Following the same derivation as in Sect.6.5, the general equations of motion in a 
solenoid field including up to third-order terms are 


e 
fed pale Oey (20.13) 
P P 
1 @ /2,) 2. pi 13 /2 pl 
+ a y By + x*yB + 2y°B, + yy” Bi) + O(4), 
y’= —"B, x — 1o py (20.14) 
P “?P 
-} 7 (2x'y?B, + xy? Bi, + 2x3B, + xx?B‘) + O(4). 
P 


Considering only linear terms, the equations of motion in a solenoidal field simplify 
to 


x" = +£Byy + SEBly, 
a a (20.15) 


exhibiting clearly coupling terms. In a uniform field, where BY = 0, the particle 
trajectory assumes the form of a helix parallel to the axis of the solenoid field. 

The equations of motion (20.15) have been derived under the assumption of 
paraxial rays so that we can set v ~ v,. Ina solenoid field this approximation is not 
generally acceptable since we may, for example, be interested in using a solenoid 
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to focus particles emerging from a target at large angles. We therefore replace all 
derivatives with respect to z by derivatives with respect to the time, use the particle 
velocity v, and replace 4 > ia. In a uniform solenoid field the equations of 
motion are then 


t=+ (£B.v) 3 =0,), 


. : ; ; (20.16) 
y= (£8.v) x= -a,X, 
where the Larmor frequency w, is defined by 
Bs a 
oo = a = 8, , (20.17) 
Pp E 


and E is the total particle energy. Multiplying (20.16) by x and y, respectively, and 
adding both equations we get d(x? + 3*)/dt = 0 or 


P+y¥= v? = const. (20.18) 
The transverse particle velocity vy; or total transverse momentum of the particle 
cpr stays constant during the motion in a uniform solenoid field. For x» = 0 and 


yo = UY, for example, the transverse velocities can be expressed by 


xX = YU sina, f, 


, (20.19) 
y= y;, cosa, t 
and the solutions of the equations of motion are 
x(t) = x9 — “+ cosa@,t, 
ou ‘ (20.20) 


y(t) = yo + . sina, t. 


The amplitude of the oscillating term in (20.20) is equal to the radius of the helical 
path 


eB,’ 


Pn (20.21) 


where we have used the Larmor frequency (20.17) and set the transverse momentum 
Pt = ymy,. The longitudinal motion is unaffected for not too strong solenoid fields 
and v, = 0 as can be derived from the Lorentz equation since all transverse field 
components vanish and 


z(t) = 2 + et. (20.22) 
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The time to complete one period of the helix is 


T= — (20.23) 


during which time the particle moves along the z-axis a distance 


Pz 
eB,” 


Az = 2n (20.24) 


where p-, is the z-component of the particle momentum. 

The solutions of the equations of motion for a solenoid magnet are more complex 
since we must now include terms that depend on the slope of the particle trajectories 
as well. Ignoring skew quadrupoles the differential equations of motion in a solenoid 
magnet becomes from (20.15) 


x” —S()y — 4S’) y=0, 


20.25 
y +S@x + 7S'@x=0. a 


Coupling between both planes is obvious and the variation of coordinates in 
one plane depends entirely on the coordinates in the other plane. We note a high 
degree of symmetry in the equations in the sense that both coordinates change 
similar as a function of the other coordinates. This suggests that a rotation of the 
coordinate system may help simplify the solution of the differential equations. We 
will therefore try such a coordinate rotation in complex notation by defining 

R=(«t+iy)e79®, (20.26) 


where the rotation angle @ may be a function of the independent variable z. A single 
differential equation can be formed from (20.25) in complex notation 


(x + iy)” +i1S(z) (e+ iy) +i5S'(2 @ + iy) = 0. (20.27) 
The rotation (20.26) can now be applied directly to (20.27) and with 
(x + iy)’ = R’ el? + if’ Ret? 
and 
(x + iy)” = R’e'* + 2id/R'e'? + if" Re'® — pb” Re'®. (20.28) 
After insertion into (20.26) and sorting of terms 


R" — [S(z)¢’ + $7 ]R + i2[b’ + 55(2)]R’ + ilo” + 55'(2]R = 0. (20.29) 
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At this point, the introduction of the coordinate rotation allows a great simplifica- 
tion (20.28) by assuming a continuous rotation along the beam axis with a rotation 
angle defined by 


b(z) = —-4 / “ s(t) de (20.30) 


20 


where the solenoid field starts at z. We are able to eliminate two terms in the 
differential equation (20.28). Since a positive solenoid field generates Lorentz forces 
that deflect the particles onto counter clockwise spiraling trajectories, we have 
included the negative sign in (20.30) to remain consistent with our sign convention. 
From (20.30) it follows that ¢’ = —3S (z) and 6” = —}3' (z), which after insertion 
into (20.28) results in the simple equation of motion 


R" +59 (QR=0. (20.31) 
With R = v + iw, we finally get two uncoupled equations 


vo” + 49'(z)v =0, (20.32) 


w" + 59° (2)w=0. 


Introducing a coordinate rotation allow us to reduce the coupled differential 
equations (20.25) to the form of uncoupled equations of motion exhibiting focusing 
in both planes. At the entrance to the solenoid field ¢ = 0 and therefore vp = xo and 
wo = yo. To determine the particle motion through the solenoid field of length L, 
we simply follow the particle coordinates (v, w) through the solenoid as if it were 
a quadrupole of strength k, = + S?(Ls) followed by a rotation of the coordinate 
system by the angle —¢(L;) thus reverting to Cartesian coordinates (x, y). 


20.1.3 Transformation Matrix for a Solenoid Magnet 


Similar to the transformation through quadrupoles and other beam transport mag- 
nets, we may formulate a transformation matrix for a solenoid magnet. Instead of 
uncoupled 2 x 2 transformation matrices, however, we must use 4 x 4 matrices to 
include coupling effects. Each coordinate now depends on the initial values of all 
coordinates, x(z) = (x0.xp.Yo.¥p), etc. The transformation through a solenoid is 
performed in two steps in which the first is the solution of (20.32) in the form of the 
matrix M,, and the second is a coordinate rotation introduced through the matrix 
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M,. The total transformation is therefore 


XxX Xo 
J J 
~*~ 1—m,m, | ~° (20.33) 
y Yo 
y’ Yo 


In analogy to the transformation through an upright quadrupole, we get 
from (20.32) the transformation matrix MM, from the beginning of the solenoid field 
at z) to a point z inside the solenoid magnet. The strength parameter in this case is 
iS? assumed to be constant along the length of the magnet and the transformation 
matrix is 


cosP <sing 0 0 
—Ssing cosd 0 0 
s s) = 2 : A 20.34 
M:s(Zol2) 0 0 cos = sing ( ) 
0 0 —Ssing cos 


where d = +Sz. The next step is to introduce the coordinate rotation M, which we 
derive from the vector equation 


(x tiy) = (v+iw)e #9 , (20.35) 


where the vectors are defined like x = (x, x’), etc. Note that the value of the rotation 
angle ¢ is proportional to the strength parameter and the sign of the solenoid field 
defines the orientation of the coordinate rotation. Fortunately, we need not keep 
track of the sign since the components of the focusing matrix M, are even functions 
of z and do not depend on the direction of the solenoid field. 

By separating (20.35) into its real and imaginary part and applying Euler’s 
identity e = cosa + isina, we get for the rotation matrix at the point z within 
the solenoid magnet 


cos @ 0 sing 0 
S . AY . 
—5sing cosp 5cosd sing 
—sing 0 cos @ 0 


S cos — sing —Ssing cos d 


M, = (20.36) 


The total transformation matrix for a solenoid magnet from zp = 0 to z finally is 
the product of (20.34) and (20.36) 


cos’? tsin2d fsin2d zsin’ 
S gi 5 : 
—ssin2¢ cos2 5cos2@ sin2 
Msoi(O|z < L) = To ? 2 o i mee) ? ie ig 
— 7 sin2¢ —Fsin’d cos d 3 sin 26 


—3 cos 2¢ —sin2¢d —$ sin2 cos 2 


(20.37) 
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This transformation matrix is correct inside the solenoid magnet but caution must 
be taken applying this transformation matrix for the whole solenoid by setting z = 
L,. The result would be inaccurate because of a discontinuity caused by the solenoid 
fringe field. Only the focusing matrix M, for the whole solenoid becomes a simple 
extension of (20.34) to the end of the solenoid by setting ¢(L;) = ® = SSL. 

Due to the solenoid fringe field, which in hard-edge approximation adopted here 
is a thin slice, the rotation matrix exhibits a discontinuity. For z = L, + €, where 
€ — 0 the phase is ¢(Ls;) = @® but the solenoid strength is now zero, S = 0. 
Therefore, the rotation matrix (20.36) assumes the form 


cos 0 sn®@ 0 
0 cos @ 0 sin@® 
—_ : 20. 
aa —sin ® 0 cosh 0 aise) 


0 —sn® 0 cos® 


Notice that this matrix at the solenoid entrance is just the unit matrix. This does 
not mean that we ignored the entrance fringe field, it only indicates that this effect 
is already included in (20.37). After multiplication of (20.34) with (20.38), the 
transformation matrix for a complete solenoid magnet is finally 


cos @ isin2d Fsin2d <sin’d 
Sas 2 “S orn2 lo: 
—2sin2® cos*-® —s sin ® ~sin2@ 
Meso O|L) = rf F Fi 2 2 20.39 
(012) —sin2® —< sin?® cos? 1sin2@® ( ) 


S sin? @ —4 sin 2 -* sin2® cos? @® 


Comparing matrices (20.37), (20.39), we find no continuous transition between 
both matrices since only one matrix includes the effect of the fringe field. In reality, 
the fringe field is not a thin-lens and therefore a continuous transition between both 
matrices could be derived. To stay consistent with the rest of this book, however, we 
assume for our discussions hard-edge magnet models. 

From the matrix (20.34) some special properties of particle trajectories in a 
solenoid can be derived. For @ = 51 a parallel beam becomes focused to a point at 
the magnet axis. A trajectory entering a solenoid with the strength ® = 5 SL= 2/2 
at say yo will follow one quarter of a spiraling trajectory with a radius p = yo/2 and 
exit the solenoid at x = y = 0. Similarly, a beam emerging from a point source 
on axis and at the start of the solenoid field will have been focused to a parallel 
beam at the end of the solenoid. Such a solenoid is used to focus, for example, a 
divergent positron beam emerging from the target source and is called a A/4-lens or 
quarter-wavelength solenoid for obvious reasons. 

The focusing properties of the whole solenoid are most notable when the field 
strength is weak and the focal length is long compared to the length of the solenoid. 
In this case, the focal length can be taken immediately from the M2; and M43 element 
of the transformation matrix as we did for quadrupoles and other focusing devices 
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and is with @ = 5SLs 


1 


Z = My; = —$S sing cosd, (20.40) 
1 iat 

Fa = My = —35S sing cos¢. (20.41) 
; 


In contrast to quadrupole magnets, the focal length of a solenoid magnet is the 
same in both planes and is in thin-lens approximation, @ = 5SL, — 0 while 
S?L, =const. 


1 
fool 


The thin lens transformation matrix for a weak solenoid is thereby 


1 2 
= 197, = (<) B? L,. (20.42) 
4\p : 


100 0 
med 0 0 

Msoi(0|L) = Ha 0.4 6 (20.43) 
0 Ee nm 


The focal length is always positive and a solenoid will therefore always be 
focusing independent of the sign of the field or the sign of the particle charge. 

Transformation matrices have been derived for the two most important coupling 
magnets in beam transport systems, the skew quadrupole and the solenoid magnet, 
which allows us now to employ linear beam dynamics in full generality including 
linear coupling. Using (4 x 4)-transformation matrices any particle trajectory can 
be described whether coupling magnets are included or not. Specifically, we may 
use this formalism to incorporate compensating schemes when strongly coupling 
magnets must be included in a particular beam transport line. 


20.2 Betatron Functions for Coupled Motion 


For the linear uncoupled motion of particles in electromagnetic fields we have 
derived powerful mathematical methods to describe the dynamics of single particles 
as well as that of a beam composed of a large number of particles. Specifically, the 
concept of phase space to describe a beam at a particular location and the ability 
to transform this phase space from one point of the beam transport line to another 
allow us to design beam transport systems with predictable results. These theories 
derived for particle motion in one degree of freedom can be expanded to describe 
coupled motion in both the horizontal and vertical plane. 
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20.3 Conjugate Trajectories 


Lattice functions have been defined to express solutions to the equations of motion 
for individual trajectories. Conversely, there must be a way to express these lattice 
functions by the principal solutions of the equation of motion. This would enable 
us to determine lattice functions for coupled particle motion by integrating the 
equations of motion for two orthogonal trajectories. To do this, we start from the 
differential equation of motion in normalized coordinates for which two linearly 
independent principal solutions are given by 


wi(g) = cos (vg) , 20.44 
wa(g) = sin (vg) . —— 


For simplicity, we set the initial amplitudes equal to unity and get in regular 
coordinates with u(z) = w./ f(z) the conjugate trajectories are 


ui (z) = vy B(z) cos y(z), 20.45 
w(2) = VB@ sin (2), ai 


where u(z) stands for x(z) or y(z), and their derivatives 


Wi (2) = A cos Ww) — Fas sin (2), 


a(z) 


uz) = AG sin W(z) + Tis cos y(z). 


(20.46) 


Using (20.45), (20.46) all lattice functions can be expressed in terms of conjugate 
trajectories like 


B(z) = uy (2) + u5(2) , 
a(z) = —uy(z) uw’ (z) — ua(z) u5(z) , (20.47) 
y(z) = uP (2) + uF (2). 


The betatron phase advance AW = y — Wo between the point z = 0 and the 
point z can be derived from 


cos(w — Wo) = cos cos Wo + sin sin Wo, 
where Wo = (0) and y = w(z). With (20.45), (20.47) we get 


uy (Zz) _ uy(z) 
VBQ) ge (z) + u5(z) 


cos W(z) = (20.48) 
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and similarly, 


u2(z) _ uz(z) 
VPQ@) /w2(z) + (2) 


The betatron phase advance then is given by 


sin ¥(z) = (20.49) 


j= (20.50) 


eae ie 2 
Ve + Uy V/ Ui9 + 429 


where uj; = uj;(z) and uj = u;(0). Finally, we can express the elements of the 
transformation matrix from z = 0 to z by 


M(z\0) = ie a) = bane Uso U10 es (20.51) 


/ A / / / / 
M>, Mo Uy Uyy — Uy Ung U0 Uy — U2 Uy 


The two linearly independent solutions (20.45) also can be used to define and 
characterize the phase space ellipse. At the start of a beam line we set z = 0 and 
w(0) = 0 and define an ellipse by the parametric vector equation 


u(0) = a[u,(0) cos¢ — un(0) sing], (20.52) 


where 
u(0) = (“”) and =u,(0) = & ) (20.53) 
Ug Hig 
As ¢ varies over a period of 27, the vector follows the outline of an ellipse. To 
parametrize this ellipse we calculate the area enclosed by the phase ellipse. The area 
element is dA = u'duo, from (20.52) we get 
dup = a[ujo sind — ux9 cos d] dd (20.54) 


and the area enclosed by the ellipse is 


A= 20° | (ui) Cosh — us, sind) (uo sind — u29 cos d) dd (20.55) 
0 
= AH (Uquyg — Ujgun0) = a7, 


since the expression in the brackets is the Wronskian, which we choose to normalize 
to unity. The Wronskian is an invariant of the motion and therefore the area of the 
phase ellipse along the beam transport line is preserved. The vector equation (20.52) 
describes the phase ellipse enclosing a beam with the emittance a* = e. 
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The formalism of conjugate trajectories has not produced any new insight into 
beam dynamics that we did not know before but it is an important tool for the 
discussion of coupled particle motion and provides a simple way to trace individual 
particles through complicated systems. 

Ripken [1] developed a complete theory of coupled betatron oscillations and 
of particle motion in four-dimensional phase space. In our discussion of coupled 
betatron motion and phase space transformation we will closely follow his theory. 
The basic idea hinges on the fact that the differential equations of motion provide 
the required number of independent solutions, two for oscillations in one plane and 
four for coupled motion in two planes, to define a two- or four-dimensional ellipsoid 
which serves as the boundary in phase space for the beam enclosed by it. Since the 
transformations in beam dynamics are symplectic, we can rely on invariants of the 
motion which are the basis for the determination of beam characteristics at any point 
along the beam transport line if we only know such parameters at one particular 
point. 

Before we discuss coupled motion in more detail it might be useful to recollect 
some salient features of linear beam dynamics. The concept of conjugate trajectories 
can be used to define a phase ellipse at z = 0 in parametric form. Due to the 
symplecticity of the transformations we find the area of the phase ellipse to be a 
constant of motion and we may describe the phase ellipse at any point z along the 
beam line is given by (20.52). The Wronskian is a constant of motion normalized 
to unity in which case the phase ellipse (20.52) has the area A = ze, where € is 
the beam emittance for the beam enclosed by the ellipse. The solutions are of the 
form (20.45) and forming the Wronskian we find the normalization 


Bo =1 (20.56) 


as we would expect. 
To describe coupled motion we try analogous to (20.52) the ansatz 


v(z) = /€[vi(z) cos 8, — v2(z) sin B,] cos x (20.57) 
+./€;[v3(z) cos 3, — v4(z) sin 3,,] sin x. 


As the independent variables y, 3, and ¥, vary from 0 to 2 the vector v covers all 
points on the surface of a four-dimensional ellipsoid while the shape of the ellipse 
varies along the beam line consistent with the variation of the vector functions vj. 
In this ansatz we chose two modes of oscillations indicated by the index I and II. 
If the oscillations were uncoupled, we would identify mode-I with the horizontal 
oscillation and mode-II with the vertical motion and (20.57) would still hold with 
x = 0 having only horizontal nonvanishing components while v3,4 contain nonzero 
components only in the vertical plane for y = 2/2. For independent solutions v; of 
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coupled motion, we try 


)= By (2) CoS Py, (Z) , yi(z) = By, (2) COs py, (2) 5 

x2(z) = VBu(2) singy(z), y2(z) = VBy (2) singy(2), (20.58) 
) = V Buy) cos bry (Z), y3(2) = By (z) cos py, (Zz), 

) = VB (Z) sings (2), ya(z) = By (z) sin dy, (z) , 


which is consistent with the earlier definitions of conjugate trajectories. Earlier in 
this section we defined conjugate trajectories to be independent solutions normal- 
ized to the same phase ellipse and developed relationships between these trajectories 
and betatron functions. These relationships can be expanded to coupled motion by 
defining betatron functions for both modes of oscillations similar to (20.47) 


Ba =e, Bu =O+2%, (20.59) 
By, = +93. Pn = Ye +3. (20.60) 


The phase functions can be defined like (20.48) by 


x) x3 
cos ox a le) COs xn SS ———— (20.61) 
Vt $3 x3 +34 
Ji y3 


(20.62) 


| 
II 


cos fy, = ————., cos @y, = ————.. 
yity yty 


All other lattice functions can be defined in a similar way. By following the 
conjugate trajectories and utilizing the (4 x 4)-transformation matrices including 
coupling effects we are able to determine the betatron functions at any point along 
the coupled beam transport line. To correlate parameters of the four-dimensional 
phase ellipse with quantities that can be measured, we write the solutions in the 
form 


x1(z) = J Bx (2) cos¢ (2), x2(z) = Bx) sind, @), (20.63) 
X(2) = Vm (Z) 608 Wy (2), X5(Z) = V Ym (Z) Sin Vx, (2) , 


and similar for all other solutions. Comparing the second equations in (20.63) with 
the derivative of the first equations we find the definitions 


0, ee 
ee eae (20.64) 
Buy 


and 


Vu = bx, — arctan p be : (20.65) 
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The other parameters y,,,, etc. are defined similarly and the phase ellipse (20.57) 
can now be expressed by the four-dimensional vector 


oY Bur COS (du + 07) 
Nes JS Vat COS (War + V7) 
V(z) = Jer Bi coe ea) cos x (20.66) 
J Vy COS (Wy 7 dy) 
V But cos (dx + Vir) 
JV COS (War + Vir) |. 
ese Byn cos (bu + 811) men 
JV COS (Wy + B11) 


This vector covers all points on the surface of the four-dimensional ellipsoid as 
x, ¥, and #,, vary independently from 0 to 27. For one-dimensional oscillations we 
know from the definition of the phase ellipse that the product ,/€, i Be is equal to 
the beam size or beam envelope E,, and ,/€, ,/, equal to the angular beam envelope 
A,, where u = x or y. These definitions of beam envelopes can be generalized to 
coupled motion but we find from (20.66) that the envelopes have two contributions. 
Each point on the phase ellipse for an uncoupled beam appears now expanded into 
an ellipse with an area zre,, as shown in Fig. 20.1. 

In a real beam transport line we are not able to observe experimentally the four- 
dimensional phase ellipse. By methods of emittance measurements, however, we 
may determine the area for the projection of the four-dimensional ellipsoid onto the 
(x — x’), the (y — y’) or the (x — y)-plane. 

To do that we note in (20.66) that the maximum amplitude of a particle in the u- 


Euqy Buy 
jp 5 


u 


plane occurs for $,,,, = —Uw, and a projection angle y given by sin? y = 
where the beam envelope for coupled motion is given by 


Ey, = Vv Eu Puy + Eu Bug : (20.67) 


Fig. 20.1 Phase space ellipse 
for coupled motion 


(eB),!? 


A= TE, 


(eB),'” 


A= TE, 
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Similarly, we get from the second component of (20.66) the angular envelope 


Ay = J €u Vuy + Eun Yun (20.68) 
for Ving = Dun and a projection angle given by 
sin? y = Sa Pon (20.69) 


To completely determine the phase ellipse we calculate also the slope x’ for the 
particle at x = E, which is the slope of the envelope E’. Taking the derivative 
of (20.67) we get 


E! = Eu Cu a Eu Cun (20.70) 


u ? 
Vv €un Buy + €vy Buy 


Expressing the equation of the phase ellipse in terms of these envelope definitions 
we get 


Aw? —2EE, wl + Eu” =e (20.71) 


u 


and inserting u = E, and u’ = E’ into (20.71) we get for the emittance of the 


projection ellipse 
€y = Ey A2 — El’. (20.72) 


The envelope functions can be measured noting that FE? = o1,A* = on 
and EE’ = —o,2 where the oj are elements of the beam matrix. Because of 
the deformation of the four-dimensional phase ellipse through transformations, we 
cannot expect that the projection is a constant of motion and the projected emittance 
is therefore of limited use. 

A more important and obvious projection is that onto the (x, y)-plane which 
shows the actual beam cross section under the influence of coupling. For this 
projection we use the first and third equation in (20.66) and find an elliptical beam 
cross section. The spatial envelopes EF, and FE, have been derived before in (20.67) 
and become here 


Ey = V Ex Bx, + €xy Brn ; (20.73) 
Ey = y €y By, ote €yn Byn : (20.74) 
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The y-coordinate for E,, which we denote by E,,, can be derived from the third 
equation in (20.66) noting that now ¥,,, = —Pxyy, X is given by (20.69) and 


i €, V Bx, By, Cos AQ, + &, Pry By, Cos AG, 
xy , 
€x, Bxy a €xy Bry 


where Ad,,, = $x — Py. The beam cross section is tilted due to coupling 
whenever E,,, # 0. The tilt angle w of the ellipse is determined by 


(20.75) 


oe 
Ey — ES 


tan2y = (20.76) 


or more explicitly 


tan2y = 71 Vv Bx By, cos Ady + €, Vv Bx By, COS Abr (20.77) 
€y ABI + €xy, AB 

The beam cross section of a coupled beam is tilted as can be directly observed, 
for example, through a light monitor which images the beam cross section by 
the emission of synchrotron light. This rotation vanishes as we would expect for 
vanishing coupling when f,,, — 0 and £,, — 0. The tilt angle is not a constant of 
motion and therefore different tilt angles can be observed at different points along a 
beam transport line. 

We have discussed Ripken’s theory [1] of coupled betatron motion which allows 
the formulation of beam dynamics for arbitrary strength of coupling. The concept 
of conjugate trajectories and transformation matrices through skew quadrupoles 
and solenoid magnets are the basic tools required to determine coupled betatron 
functions and the tilt of the beam cross section. 


20.4 Hamiltonian and Coupling 


In practical beam transport systems particle motion is not completely contained in 
one or the other plane although special care is being taken to avoid coupling effects 
as much as possible. Coupling of the motion from one plane into the other plane can 
be generated through the insertion of actual rotated magnets or in a more subtle 
way by rotational misalignments of upright magnets. Since such misalignments 
are unavoidable, it is customary to place weak rotated quadrupoles in a transport 
system to provide the ability to counter what is known as linear coupling caused by 
unintentional magnet misalignments. Whatever the source of coupling, we consider 
such fields as small perturbations to the particle motion. 

The Hamiltonian treatment of coupled motion follows that for motion in a single 
plane in the sense that we try to find cyclic variables while transforming away 
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those parts of the motion which are well known. For a single particle normalized 
coordinates can be defined which eliminate the z-dependence of the unperturbed 
part of the equations of motion. Such transformations cannot be applied in the case 
of coupled motion since they involve the oscillation frequency or betatron phase 
function which is different for both planes. 


20.4.1 Linearly Coupled Motion 


We will derive some properties of coupled motion for the case of linear coupling 
introduced, for example, by a rotated quadrupole. Equations of linearly coupled 
motion are with k = p(z) of the form 


x" +kx = —p(z)y, 


20.78 
y'-yx= —p(z)x, ( ) 

which can be derived from the Hamiltonian for linearly coupled motion 
Hagx? tty? + dk — Sky? +ploxy. (20.79) 


This Hamiltonian is composed of an uncoupled Hamiltonian Hp and the pertur- 
bation Hamiltonian for linear coupling 


My = p(z)xy. (20.80) 


The solutions for the uncoupled equations with integration constants c, and ¢ are of 
the form 


u(z) = Cur Bu cos [Wu(2) + 4] , 


u (z) = — Tk fa,(z) cos [W(z) + @] + sin[Wi(z) + A]} (20.81) 


Applying the method of variation of integration constants, we try the ansatz 


u(z) = V 2au(z) V'Bu Cos Wul(z) + d(2)] . 


ae : (20.82) 
uw! (2) = —y/ © fau(2) cos [Yu(z) + $@)] + sin Yul) + O@D 
for the coupled motion. Now we use the integration constants (a, @) as new variables 
and to show that the new variables are canonical, we use the Hamiltonian equations 
0H/du' =du/dz and dH/du = —du'/dz and get 


0H dH OM, du du oduda du 0d 
— — — 2 . 
a 0 OO Se: ae Ob De ce 
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A similar expression exists for the second Hamiltonian equation of motion 


0H _ 0H 4 0M, du’ _—du'_—s du da_—sout' 0 (20.84) 
du Ou du—s—sddz—“‘<iés:SC<“‘«iéi CSC , 


For uncoupled oscillators we know that a = const. and ¢@ = const. and therefore 
du/dz = OHo/du' and du'/dz = —dHp/du. With this we derive from (20.81)- 
(20.84) the equations 


OH, __ OH, du + dH, du’ _ _ da 

dd du op du’ Ob dz’ 

oH, __ OH du de dM, du’ __ dd (20.85) 
da du da du’ da ~~ dz? 


demonstrating that the new variables (@,a) are canonical variables and (20.82) 
are canonical transformations. Applying (20.82) to the perturbation Hamilto- 
nian (20.80) with appropriate indices to distinguish between horizontal and vertical 
plane, the perturbation Hamiltonian becomes 


Ay = 2p(2) V BxBy Jaxay COS (Wx + bx) cos (Wy + oy) , (20.86) 
where z is still the independent variable. The dynamics of linearly coupled motion 


becomes more evident after isolating the periodic terms in (20.86). For the 
trigonometric functions we set 


cos (Wy + bu) = 5 jean + aiiaid (20.87) 


and the Hamiltonian takes the form 


Hy = 572) BxBy Jad Yel Utd Hier), (20.88) 


Ay 
where the non-zero integers /, and /, are integers defined by 
l,l) 4(-1, 1). (20.89) 


Similar to the one-dimensional case we try to separate constant or slowly varying 
terms from the fast oscillating terms and expand the exponent in (20.88) like 


LW + hy Wy — Lvoxe — by voye 
+ LVorp + LVvoyQ + lby + lydy , (20.90) 


where ¢@u = Vou, Vou are the tunes for the periodic lattice, g@ = 27z/L and L is 
the length of the lattice period. The first four terms in (20.90) are periodic with the 
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period g (L) = 2x + (0). Inserting (20.90) into (20.88) we get with y, (L) = 
27 Vou + Wu (0) 


Ay = ~ 0) Bx yell th vy—bvag—hvore] 
Tals 
x Jay 2 eilevaethvoeth wy thoy] - (20.91) 
In this form we recognize the periodic factor 


A (g) = P(z) BxByeile¥st b¥y—k roe breve] (20.92) 


since betatron functions and perturbations p(z) =k(z) are periodic. After expand- 
ing (20.92) into a Fourier series 


L . 
5A) =D kane (20.93) 
q 


coupling coefficients can be defined by 


1 20 L ; 

= — —A(v)e"d 

Kadedy = 57 oe (p)e p 

1 a 4 22 

= — | ky/B.Byell thy Goreet buna) F 4] ae (20.94) 

2m Jo “Yo - 
Since kg.1,1 = Kg—1,-1 and Kg1,-1 = Kg—-1,1, we have with -1 </< +1 
1 L FH 20 
Kai = / k BxByeil™ +1 Wry (vox +1 voy qN ) L z] dz. (20.95) 
; 7 ae i 


The coupling coefficient is a complex quantity indicating that there are two 
orthogonal and independent contributions which require also two orthogonally 
independent corrections. Now that the coupling coefficients are defined in a 
convenient form for numerical evaluation we replace the independent variable z by 
the angle variable gy = 27z/L and obtain the new Hamiltonian A, = 7H 1 Or 


H= ee Gedy cos ($y + Idy + Aggy) , (20.96) 


where 


A, = Vox + L Voy = qN : (20.97) 
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Most terms in (20.96) are fast oscillating and therefore cancel before any damage 
can be done to particle stability. One term, however, is slowly varying for g = r 
defining the resonance condition for coupled motion (A, ~~ 0) or 


rN & vor + Loy. (20.98) 


In this resonant case, the quantity A, is the distance of the tunes from the coupling 
resonance as defined by (20.97) with g = r. Neglecting all fast oscillating terms 
we apply one more canonical transformation (¢,,4,) —> (bus Gu) to eliminate 
the independent variable g from the Hamiltonian. In essence, we thereby use a 
coordinate system that follows with the unperturbed particle and exhibits only the 
deviations from the ideal motion. From the generating function 


G = a, ($y + $Ag) + & (6) + Ag) (20.99) 
we get for the new variables 


bx = PHHy+pArg, ip = Te = Gy, 
Pe = Bi Ox Ob (20.100) 
by =, = by FIZArg, oy ag, O93 


and the new Hamiltonian for the rotating coordinate system is 


~ ~ 0G 
H, = H+ 


aa H+ tA,a,+lhAr-ay. (20.101) 
g 


For simplicity we drop the tilde on the amplitudes and use (a,, ay) . The resonant 
Hamiltonian becomes after this transformation 


H, = 4A, (a, + lay) + k1,/aa, cos ($y + 1 dy) (20.102) 


and the equations of motion are 


day _ OH, _ ra ard re 

Te = Tage = Ket (Gq sin (dx +1 gy) , o-68) 
day oH, wpe ~ . 
n= 7 = 1k,1,/Gdy sin (~ + 1 dy) , 


and 


Ibe a o p p 

a Bem Lat be (Zeal 18), 20.08 
aby alt oe p p | 
Ba Baltare ta yomti +) 


From these equations we can derive criteria for the stability or resonance 
condition of coupled systems. Depending on the value of / we distinguish a sum 
resonance if / = +1 or a difference resonance if / = —1. 
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Linear Difference Resonance 


In case of a difference resonance (J = —1) we add both Eqs. (20.103) and get 


d 
7 (a, + ay) =0. (20.105) 


The coupled motion is stable because the sum of both amplitudes does not 
change. Both amplitudes a, and a, will change such that one amplitude increases 
at the expense of the other but the sum of both will not change and therefore 
neither amplitude will grow indefinitely. Since a, and ay are proportional to the 
beam emittance, we note that the sum of the horizontal and vertical emittance stays 
constant as well, 


€, + €) = const. (20.106) 
The resonance condition (20.98) for a difference resonance becomes [2] 
v»—vym, =N. (20.107) 


Our discussion of linear coupling resonances reveals the feature that a difference 
resonances will cause an exchange of oscillation amplitudes between the horizontal 
and vertical plane but will not lead to beam instability. This result is important 
for lattice design. If emittance coupling is desired, one would choose tunes which 
closely meet the resonance condition. Conversely, when coupling is to be avoided 
or minimized, tunes are chosen at a save distance from the coupling resonance. 

There exists a finite stop-band width also for the coupling resonance just as 
for any other resonance and we have all the mathematical tools to calculate that 
width. Since the beam is not lost at a difference coupling resonance, we are also 
able to measure experimentally the stop-band width by moving the tunes through 
the resonance. The procedure becomes obvious after linearizing the equations of 
motion (20.103), (20.104). Following a suggestion by Guignard [3], we define new 
variables similar in form to normalized coordinates 


= idx 
ta Vee (20.108) 
v= /a,e'*y : 


Taking derivatives of (20.108) with respect to g and using (20.103), (20.104) we 
get after some manipulation the linear equations 


dw _ ;1 

dp = 17 (ku + Aw), 
dv — jl 
dg 2 


(20.109) 


where we have set for simplicity k,-; = Kk. 
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These equations can be solved analytically and will provide further insight into 
the dynamics of coupled oscillations. We will look for characteristics of coupled 
motion which do not depend on initial conditions but are general for all particles. 
Expecting the solutions w and v to describe oscillations, we assume that the motion 
in both planes depends on the initial conditions wo, vo in both planes due to the 
effect of coupling. For simplicity, however, we study the dynamics of a particle 
which starts with a finite amplitudes wo 4 0 in the horizontal plane only and set 
vo = 0. The ansatz for the oscillations be 


= ive —ive 
w(~) = Wo (a a + Oe. ), (20.110) 
v(y) = wo (ce? + de") , 
where we define an as yet undefined frequency v. Inserting (20.110) into (20.109) 
the coefficients of the exponential functions vanish separately and we get from the 
coefficients of e'”? the two equations 


2va=xkc+ A,a, 


20.111 
2ve=Kkat+A,c, ( ) 


from which we may eliminate the unknowns a and c to get the defining equation for 
the oscillation frequency 


v=t/A2+x«2. (20.112) 


While determining the coefficients a,b,c,d, we note that due to the initial 
conditions a+ b= | andc +d = 0. Similar to (20.111) we derive another pair of 
equations from the coefficients of e~'”? 


2vb = xkd—A,b, 


20.11 
2vd=xb+A,d, ee 


which completes the set of four equations required to determine with (20.112) the 
four unknown coefficients 


fy 3 eae 20.114 
poe ga (20.114) 


With this, the solutions (20.110) become 
w(~) = wo cos ve + iwos sinvg, 


20.115 
v(g) = +iwo;, sinvg, ( ) 
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and by multiplication with the complex conjugate and (20.108) we get expressions 
for the coupled beam emittances (€, = 2a,) 


dy = a0 (A; + «* cos* vo) , (20.116) 


— K +2 
ay = x0 452 sin" v@. 


The ratio of maximum values for beam emittances in both planes under the 

influence of linear coupling is from (20.116) 
2 20.117 
a ae oe (20.117) 

The emittance coupling increases with the strength of the coupling coefficient 
and is equal to unity at the coupling resonance or for large values of «. At the 
coupling resonance we observe complete exchange of emittances at the frequency 
v. If on the other hand, the tunes differ and A, 4 0, there will always be a finite 
oscillation amplitude left in the horizontal plane because we started with a finite 
amplitude in this plane. A completely symmetric result would be obtained only for 
a particle starting with a finite vertical amplitude as well. 

We may now collect all results and derive the particle motion as a function of 
time or g. For example, the horizontal particle position is determined from (20.82) 
where we set ,/a, = we '% and further replace w by (20.110). Here, we are 
only interested in the oscillation frequencies of the particle motion and note that 
the oscillatory factor in (20.82) is Re[ei(”+*)]. Together with other oscillatory 


quantities e~#: and w we get both in the horizontal and vertical plane terms with 
oscillatory factors 


Re [rer (20.118) 


where the index wu stands for either x or y. The phase W, = v,g and from (20.100) 
and / = —1 for the difference resonance ¢, = $, + SArg. These expressions used 
in (20.118) define two oscillation frequencies 


Vit = Vey F ZAr =v (20.119) 


or with (20.112) 


Vit = Vey F ZAr + $f A2 + x2. (20.120) 


We have again found the result that under coupling conditions the betatron 
oscillations assume two modes. In a real accelerator only these mode frequencies 
can be measured while close to the coupling resonance. For very weak coupling 
(k ~ 0) the mode frequencies are approximately equal to the uncoupled frequencies 
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Vx, respectively. Even for large coupling this equality is preserved as long as the 
tunes are far away from the coupling resonance or A, > k. 

The mode frequencies can be measured while adjusting quadrupoles such that the 
beam is moved through the coupling resonance. During this adjustment the detuning 
parameter A, varies and changes sign as the coupling resonance is crossed. For 
example, if we vary the vertical tune across a coupling resonance from below, we 
note that the horizontal tune or v; does not change appreciably until the resonance is 
reached, because —A, + J A2 + x? x 0. Above the coupling resonance, however, 
A, has changed sign and v; increase with A,. The opposite occurs with the 
vertical tune. Going through the coupling resonance the horizontal tune has been 
transformed into the vertical tune and vice versa without ever getting equal. 

Actual tune measurements [4] are shown in Fig.20.2 as a function of the 
excitation current of a vertically focusing quadrupole. The vertical tune change is 
proportional to the quadrupole current and so is the parameter A,. While increasing 
the quadrupole current, the vertical tune is increased and the horizontal tune 
stays practically constant. We note that the tunes actually do not cross the linear 
coupling resonance during that procedure, rather the tune of one plane is gradually 
transformed into the tune of the other plane and vice versa. Both tunes never become 
equal and the closest distance is determined by the magnitude of the coupling 
coefficient k. 

The coupling coefficient may be nonzero for various reasons. In some cases 
coupling may be caused because special beam characteristics are desired. In most 
cases, however, coupling is not desired or planned for and a finite linear coupling of 
the beam emittances is the result of rotational misalignments of upright quadrupoles. 
Where this coupling is not desired and must be minimized, we may introduce 
a pair or two sets of rotated quadrupoles into the lattice to cancel the coupling 
due to misalignments. The coupling coefficient (20.95) is defined in the form of 
a complex quantity. Both orthogonal components must therefore be compensated 


tunes -} / 
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| vertical tune ~~ 

0.23 4 me 


“7 
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0.22 4 Y 


0.21 oe ea ee ee horizontal tune 


0.20 4 Kg 
7 «ko ‘S measurements 
0.19 - a 
| oa 
a _“* vertical tune 
0.18 7 P. 
017 __sttength of QD quadrupole 


Fig. 20.2 Measurements of mode frequencies as a function of detuning for linearly coupled 
motion [4] 
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by two orthogonally located skew quadrupoles and the proper adjustment of these 
quadrupoles can be determined by measuring the width of the linear coupling 
resonance. 


Linear Sum Resonance 


To complete the discussion, we will now set / = +1 and get from (20.98) the 
resonance condition for a sum resonance 


by + vy = m,N . (20.121) 


Taking the difference of both Eqs. (20.103), we get 


d 
ie (a,—a) =0, (20.122) 


which states only that the difference of the emittances remains constant. Coupled 
motion in the vicinity of a sum resonance is therefore unstable allowing both 
emittances to grow unlimited. To solve the equations of motion (20.103), (20.104), 
we try the ansatz 


u= Jae'™ +i fae. (20.123) 


From the derivative du/dg, we get with (20.103), (20.104) 


d 
a =i} (A,u—Ku’) , (20.124) 
and for the complex conjugate 
oe = -i} (A, u* +« u) (20.125) 
dy ~~ ; 


Solving these differential equations with the ansatz 
u=ae?+be?, (20.126) 
and the complex conjugate 


u* =ae'? +be?, (20.127) 


we get after insertion into (20.124), (20.125) analogous to (20.111) the oscillation 
frequency 


v= 5 /A2—k?. (20.128) 
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This result shows that motion in the vicinity of a linear sum resonance becomes 
unstable as soon as the detuning is less than the coupling coefficient. The condition 
for stability is therefore 


A, >k. (20.129) 


By a careful choice of the tune difference to avoid a sum resonance and careful 
alignment of quadrupoles, it is possible in real circular accelerators to reduce 
the coupling coefficient to very small values. Perfect compensation of the linear 
coupling coefficient eliminates the linear emittance coupling altogether. However, 
nonlinear coupling effects become then dominant which we cannot compensate for. 


20.4.2 Higher-Order Coupling Resonances 


So far all discussions on coupled motions and resonances have been based on linear 
coupling effects caused by rotated quadrupole fields. For higher-order coupling the 
mathematical treatment of the beam dynamics is similar although more elaborate. 
The general form of the nth-order resonance condition (20.98) is 


Lvy+hvy=mN with — [ie] + || <n. (20.130) 


The factors /, and J, are integers and the sum |/,| + [4,| is called the order of the 
resonance. In most cases it is sufficient to choose a location in the resonance diagram 
which avoids such resonances since circular accelerators are generally designed for 
minimum coupling. In special cases, however, where strong sextupoles are used 
to correct chromaticities, coupling resonances can be excited in higher order. For 
example, the difference resonance 2v, — 2v, has been observed at the 400 GeV 
proton synchrotron at the Fermi National Laboratory [5]. 


20.4.3 Multiple Resonances 


We have only discussed isolated resonances. In general, however, nonlinear fields 
of different orders do exist, each contributing to the stop-band of resonances. A 
particularly strong source of nonlinearities occurs due to the beam-beam effect in 
colliding-beam facilities where strong and highly nonlinear fields generated by one 
beam cause significant perturbations to particles in the other beam. The resonance 
patterns from different resonances are superimposed creating new features of 
particle instability which were not present in any of the resonances while treated 
as isolated resonances. Of course, if one of these resonances is unstable for any 
oscillation amplitude the addition of other weaker resonances will not change this 
situation. 
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Combining the effects of several resonances should cause little change for small 
amplitude oscillations since the trajectory in phase space is close to a circle for 
resonances of any order provided there is stability at all. Most of the perturbations 
of resonance patterns will occur in the vicinity of the island structures. When island 
structures from different resonances start to overlap, chaotic motion can occur and 
may lead to stochastic instability. The onset of island overlap is often called the 
Chirikov criterion after Chirikov [6], who has studied extensively particle motion in 
such situations. 

It is beyond the scope of this text to evaluate the mathematical criteria of multi 
resonance beam dynamics. For further insight and references the interested reader 
may consult articles in [7-10]. A general overview and extensive references can also 
be found in [11]. 


Problems 


20.1 (S). Consider a lattice made of 61 FODO cells with 90° per cell in both planes. 
The half cell length be L = 5m and the full quadrupole length € = 0.2 m. Introduce 
a Gaussian distribution of rotational quadrupole misalignments. Calculate and plot 
the coupling coefficient for the ring and the emittance ratio as a function of the 
rms misalignment. If the emittance coupling is to be held below | % how must the 
lattice be retuned and how well must the quadrupoles be aligned? Insert two rotated 
quadrupoles into the lattice such that they can be used to compensate the coupling 
due to misalignments. Calculate the required quadrupole strength. 


20.2 (S). Use the measurement in Fig. 20.2 and determine the coupling coefficient 
K. 


20.3. Can we rotate a horizontally flat 10 GeV beam by 90° with a solenoid? If yes, 
what is the strength of the solenoid and where along the z-axis do we have a flat 
vertical beam? 


20.4. In circular accelerators rotated quadrupoles may be inserted to compensate 
for coupling due to misalignments. Assume a statistical distribution of rotational 
quadrupole errors which need to be compensated by special rotated quadrupoles. 
How many such quadrupoles are required and what criteria would you use for 
optimum placement in the ring? 


20.5. Consider a point source of particles (e.g. a positron conversion target) on the 
axis of a solenoidal field. Determine the solenoid parameters for which the particles 
would exit the solenoid as a parallel beam. Such a solenoid is also called a A /4-lens, 
why? Let the positron momentum be 10 MeV/c. What is the maximum solid angle 
accepted from the target that can be focused to a beam of radius r = 1cm? What 
is the exit angle of a particle which emerges from the target at a radius of | mm? 
Express the transformation of this A /4-lens in matrix formulation. 
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20.6. Choose a FODO lattice for a circular accelerator and insert at a symmetry 
point a thin rotated quadrupole. Calculate the tilt of the beam cross section at this 
point as a function of the strength of the rotated quadrupole. Place the same skew 
quadrupole in the middle of a FODO half cell and determine if the rotation of 
the beam aspect ratio at the symmetry point requires a stronger or a weaker field. 
Explain why. 


20.7. Assume two cells of a symmetric FODO lattice and determine the betatron 
functions for a phase advance of 90° per cell. Now introduce a rotational mis- 
alignment of the first quadrupole by an angle w which generates coupling of the 
horizontal and vertical betatron oscillations: a.) Calculate and plot the perturbed 
betatron functions 6; and By and compare with the unperturbed solution. b.) If the 
beam emittances are €; = €; mm-mrad, what is the beam aspect ratio and beam 
rotation at the end of cell one and two with and without the rotation of the first 
quadrupole? 


20.8. Use the Fokker-Planck equation and derive an expression for the equilibrium 
beam emittance of a coupled beam 
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Part VIII 
Intense Beams 


Chapter 21 
Statistical and Collective Effects* 


Transverse and longitudinal beam dynamics as discussed in earlier chapters is 
governed by purely single-particle effects where the results do not depend on the 
presence of other particles or any interactive environment. Space-charge effects 
were specifically excluded. This restriction is sometimes too extreme and collective 
effects must be taken into account where significant beam intensities are desired. In 
most applications high beam intensities are desired and it is therefore prudent to test 
for the appearance of space charge and other intensity effects. 

Collective effects can be divided into two distinct groups according to the 
physics involved. The compression of a large number of charged particles into a 
small volume increases the probability for collisions of particles within the same 
beam. Because particles perform synchrotron and betatron oscillations, statistical 
collisions occur in longitudinal, as well as transverse phase space often causing a 
mixing of phase space coordinates. The other group of collective effects includes 
effects which are associated with electromagnetic fields generated by the collection 
of all particles in a beam. 

The study and detailed understanding of the cause and nature of collective effects 
or collective instabilities with corrective measures is important for a successful 
design of the accelerator. Most accelerator design and developments are conducted 
to eliminate collective effects as much as possible through self-imposed limitation 
on the performance or installation of feedback systems and other stabilizing control 
mechanisms. Beyond that, we also must accept limitations in beam performance 
imposed by nature or lack of understanding and technological limits. Pursuit of 
accelerator physics is the attempt to explore and push such limits as far as nature 
and general understanding of the subject allows. 
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21.1 Statistical Effects 


Coupling of individual particles to the presence of other particles may occur through 
very short range forces in collisions with each other. In this section, we will 
discuss statistical effects related to the finite number of particles and from collision 
processes within a particle bunch. 


21.1.1 Schottky Noise 


Electrical current is established by moving charged particles. The finite electrical 
charge and finite number of particles gives rise to statistical variations of the 
electrical current. This phenomenon has been observed and analyzed by Schottky 
[1] and we will discuss this Schottky noise in the realm of particle dynamics 
in circular accelerators. The information included in the Schottky noise is of 
great diagnostic importance for the nondestructive determination of particle beam 
parameters, a technique which has been developed at the CERN Intersecting Storage 
Ring (ISR) [2] and has become a standard tool of beam diagnostics. 

We consider a particle k with charge q orbiting in an accelerator with the angular 
revolution frequency w, and define a particle line density by 27RA(t) = 1 where 
2nR is the circumference of the ring. On the other hand, we may describe the 
orbiting particle by delta functions 


2n +00 
a=af YS S(coxt + 6% — 20m) dd , 
0 


m>=—oo 


where a, is the angular revolution frequency of the particle k and 6; its phase at time 
t = 0. The delta function can be expressed by a Fourier series and the line-charge 
density at time t becomes 


2nR 


n=0 


gA,(t) = [ +2) ‘cos(negt + | (21.1) 


From a pick up electrode close to the circulating particle, we would obtain a 
signal with a frequency line spectrum w = na, where n is an integer. In a real 
particle beam there are many particles with a finite spread of revolution frequencies 
@x and therefore the harmonic lines na, spread out proportionally to n. For not too 
high harmonic numbers the frequency spreads do not yet overlap and we are able to 
measure the distribution of revolution frequencies. Tuning the spectrum analyzer to 
w, we observe a signal with an amplitude proportional to N(@/n) te where N(w/n) 
is the particle distribution in frequency space and 6a the frequency resolution of 
the spectrum analyzer. The signal from the pick up electrode is proportional to the 
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line-charge density which is at the frequency w from (21.1) 


Ghims(@) = v4 4 Ivo /n 2 (21.2) 


and has been derived first by Schottky for a variety of current sources [1]. The 
spread in the revolution frequency originates from a momentum spread in the beam 
and measuring the Schottky spectrum allows its nondestructive determination. 

Individual particles orbiting in an accelerator perform transverse betatron oscil- 
lations which we describe, for example, in the vertical plane by 


Ye(t) = ay COS(Ve@RL + Wr), (21.3) 


where a, is the amplitude and 7; the phase of the betatron oscillation for the particle 
k at time t = 0. The difference signal from two pick up electrodes above and below 
the particle beam is, in linear approximation, proportional to the product of the 
betatron amplitude (21.3) and the line-charge density (21.1) and of the form 


Dx(t)= Ax) cos[(n — vg) (Wet + $0)] (21.4) 


n=0 


+A, y. cos[(n + ve) (@et + Gx)] . 


n=0 


where we have ignored terms at frequencies na,. The transverse Schottky signal is 
composed of two side bands for each harmonic at frequencies 


@ = (n+ Vg) @. (21.5) 


which are also called the fast wave for @ = (n + vx)@g and the slow wave for 
o= (n = VE)Ok. 

The longitudinal Schottky noise depends on the rms contribution of all particles 
which are spread over a range of revolution frequencies due to a momentum spread 
and over betatron frequencies by virtue of the chromaticity. For A@mns = Nc@odrms 
and A’yms = &d:ms where wo is the revolution frequency of the bunch center, dims = 
APprms/po the rms relative momentum error, 7, the momentum compaction and &, 
the vertical chromaticity, the frequency distribution of the signal from the pick up is 


wo = [n+ (vy0 + &dx)] (Wo + Newodx) (21.6) 
= (nt vy) @ + [(n + vy0) Ne + &] wo d% + O(S?). 
The momentum spread 6; causes a frequency spread which is different for the 


slow and fast wave. For example, for positive chromaticity above transition, 7. < 0 
and the frequency spreads add up for the slow wave and cancel partially for the fast 
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wave. This has been verified experimentally for a coasting proton beam in the ISR 
[2]. 

A transverse Schottky scan may exhibit the existence of weak resonances which 
may dilute the particle density, specifically in a coasting proton or ion beam. To 
control coasting beam instabilities, it is desirable to make use of Landau damping 
by introducing a large momentum and tune spread. This tune spread, however, can 
be sufficiently large to spread over higher order resonances and blow up that part 
of the beam which oscillates at those resonance frequencies. A Schottky scan can 
clearly identify such a situation as reported in [2]. 

In this text we are able to touch only the very basics of Schottky noise and the 
interested reader is referred to references [3-6] for more detailed discussions on the 
theory and experimental techniques to obtain Schottky scans and how to interpret 
the signals. 


21.1.2 Stochastic Cooling 


The “noise” signal from a circulating particle beam includes information which can 
be used to drive a feedback system in such a way as to reduce the beam emittance, 
longitudinal as well as transverse. Due to the finite number of particles in a realistic 
particle beam, the instantaneous center of a beam at the location of a pick up 
electrode exhibits statistical variations. This statistical displacement of a slice of 
beam converts to a statistical slope a quarter betatron wavelength downstream. The 
signal from the small statistical displacement of the beam at the pick up electrode 
can be amplified and fed back to the beam through a kicker magnet located an odd 
number of quarter wavelength downstream, assuming that the statistical variations 
do not smear out between pick up electrode and kicker. Van der Meer [7] proposed 
this approach to reduce the transverse proton beam emittance in ISR for increased 
luminosity and the process is now known as stochastic cooling. 

This process of correction is not a statistical process and we must ask ourselves 
if this is an attempt to circumvent Liouville’s theorem. It is not. Due to the finite 
number of particles in the beam, the phase space is not uniformly covered by 
particles but rather exhibits many holes. The method of stochastic cooling detects 
the moment one of these holes appears on one or the other side of the beam in phase 
space. At the same moment, the whole emittance is slightly shifted with respect 
to the center of the phase space and this shift can be both detected and corrected. 
The whole process of stochastic cooling therefore only squeezes the “air” out of the 
particle distribution in phase space. The most prominent application of this method 
occurs in the cooling of an antiproton beam to reach a manageable beam emittance 
for injection into high energy proton antiproton colliders. To discuss this process in 
more detail, theoretically as well as technically, would exceed the scope of this text 
and the interested reader is referred to a series of articles published in [8]. 


21.1 Statistical Effects 705 
21.1.3 Touschek Effect 


The concentration of many particles into small bunches increases the probability for 
elastic collisions between particles. This probability is further enhanced considering 
that particles perform transverse betatron as well as longitudinal synchrotron 
oscillations. In each degree of freedom, we have acceptance limits and if a particle’s 
oscillation amplitude exceeds such limits due, for example, to a collision with 
another particle one or both particles can get lost. In this section, we discuss the 
process of single collisions where the momentum transfer is large enough to lead 
to the loss of both particles involved in the collision and postpone the discussion of 
multiple collisions with small momentum transfer to the next section. 

We may consider two collision processes which could lead to beam loss. First, 
we observe two particles performing synchrotron oscillations and colliding head- 
on in such a way that they transfer their longitudinal momentum into transverse 
momentum. This collision process is insignificant in particle accelerators because 
the longitudinal motion includes not enough momentum to increase the betatron 
oscillation amplitude enough for particle loss. On the other hand, transverse 
oscillations of particles represent large momenta and a transfer into longitudinal 
momenta can lead to the loss of both particles. This effect was discovered on the 
first electron storage ring ever constructed [9, 10] and we therefore call this the 
Touschek effect. 

In this text, we will not pursue a detailed derivation of the collision process 
and refer the interested reader to references [11-13]. Of particular interest is the 
expression for the beam lifetime as a result of particle losses due to a momentum 
transfer into the longitudinal phase space exceeding the rf-bucket acceptance of 
Ap/Po|rr. Whenever such a transfer occurs both particles involved in the collision 
are lost. The beam decay rate is proportional to the number of particles in the 
bunch and the beam current therefore decays exponentially. Last, but not least, a 
loss occurs only if there is sufficient momentum in the transverse motion to exceed 
the rf-momentum acceptance. We assume the momentum acceptance to be limited 
by the rf-voltage and combining these parameters in a collision theory results in a 
beam lifetime for a Gaussian particle distribution given by 


1 1dM oo recM WV 


Die), (21.7) 


T N, dt 87 Ox Oy Of Y? 


where r, is the classical particle radius, 0,,0,,0¢ are the standard values of the 
Gaussian bunch width, height and length, respectively, and A~! = Ap/polis the 
momentum acceptance parameter. The function D(e) (Fig. 21.1) is defined by [13] 


e (* Inu 


D(e)= ve|-te + a ak du (21.8) 


j CO eu 
+5(3e —€ Ine + 2) — du ; 
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Fig. 21.1 Touschek lifetime function D (€) 


where the argument is 


Ap \* : 
ce (=) with o,= 2". (21.9) 
Y Op Bx 


Particle losses due to the Touschek effect is particularly effective at low energies 
and where the rf-acceptance is small. For high particle densities My /(o; 0) o¢) the 
rf-acceptance should therefore be maximized. This seems to be the wrong thing to 
do because the bunch length is reduced at the same time and the particle density 
becomes even higher but a closer look at (21.7) shows us that the Touschek lifetime 
increases faster with rf-acceptance than it decreases with bunch length. 


21.1.4 Intra-Beam Scattering 


The Touschek effect describes collision processes which lead to immediate loss 
of both colliding particles. In reality, however, there are many other collisions 
with only small exchanges of momentum. While these collisions do not lead to 
immediate particle loss, there might be sufficiently many during a damping time 
in electron storage rings or during the storage time for proton and ion beams to 
cause a significant increase in the bunch volume, or in the case of a coasting beam 
an increase in beam cross section. During the discussion of the Touschek effect 
we neglected the transfer from the longitudinal momentum space into transverse 
momentum space because the transverse momentum acceptance is larger than the 
longitudinal acceptance and particles are generally not lost during such an exchange. 
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This is not appropriate any more for the multiple Touschek effect or intra-beam 
scattering where we are interested in all collisions. 

The multiple Touschek effect was observed in the first ever constructed storage 
ring, AdA (Anello di Accumulatione) in Frascati, Italy. The Touschek effect had 
been expected and analyzed before but did give too pessimistic beam lifetimes 
compared to those observed in AdA. A longer beam lifetime had been obtained 
because of multiple elastic scattering between particles increasing the bunch volume 
and thereby reducing the Touschek effect [10]. 

During the exchange of momentum as a consequence of collisions between 
particles within the same bunch or beam, each degree of freedom can increase its 
energy or temperature because the beam is able to absorb any amount of energy 
from the rf- system. We are particularly interested in the growth times of transverse 
and longitudinal emittances to asses the long-term integrity of the particle beam. 
The multiple Touschek effect or intra-beam scattering has been studied extensively 
[14, 15] and we will not repeat here the derivations but merely recount the results. 

The growth time of the beam emittances for Gaussian particle distributions are 
for the longitudinal phase space or momentum and bunch distribution [14, 15] 


1 doy op 

—1 P h 
= —— =A —f(a,),c), 21.10 
2 20, dt ont 2 ( ) 


where the particle bunch density is expressed by 


rcNp 
64170, Op Ox Oy Oy Oy BF y4 


(21.11) 


with the standard dimensions of a Gaussian distribution for the bunch length o-, the 
relative momentum spread o,, horizontal and vertical betatron amplitudes (ox, oy) 
and divergences (ov, oy’) and number of particles per bunch N,. The constants 7, 
and B = v/c, finally, are the classical particle radius and velocity in units of the 
velocity of light. 

The function 


fla,b a= sx f'tin[ S (=+-)]-0s7 ea (21.12) 
UY; — 5 2 VP Va . . Pa ‘i 


where 


pH=@?+xr(1—-@), q=P+x(1-D’), 


a= 2, b= 2 
Voy” yoy 
2 ed 5 
OF On A 2 _ p22 V2may 
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The transverse emittance growth times are similarly given by 


1 do? 1b coy 
tis o = al¢(-,7,5) 4+ e209], (21.13) 
202 dt aaa Ox 
and 
-_ 1 de _,,(lae (21.14) 
y= ag ae AT pea) 


These expressions allow the calculations of the emittance growth rate, which 
for most electron accelerators is small compared to radiation damping but become 
significant in proton and ion storage rings where high particle densities and long 
storage times are desired. Progress in the design of modern synchrotron radiation 
facilities allow ever smaller emittances which have reached a level where intra- 
beam scattering is significant again. From the density factor A it is apparent that high 
particle density in six-dimensional phase space increases the growth rates while this 
effect is greatly reduced at higher beam energies. 


21.2 Collective Self Fields 


The electric charges of a particle beam can become a major contribution to the 
forces encountered by individual particles while travelling along a beam transport 
line or orbiting in a circular accelerator. These forces may act directly from beam 
to particle or may originate from electromagnetic fields being excited by the beam 
interaction with its surrounding vacuum chamber. In this section, we will derive 
expressions for the fields from a collection of particles and determine the force due 
to these fields on an individual test particle. We use the particle charge q rather than 
the elementary charge e to cover particles with multiple charges like ions for which 
q = eZ. For all cases to be correct, we should distinguish between the electrical 
charge of particles in the beam and that of the individual test particle. This, however, 
would significantly complicate the expressions and we use therefore the same charge 
for both the beam and test particle. In a particular situation whenever particles of 
different charges are considered, the sign and value of the charge factors in the 
formulas must be reconsidered. 

Individual particles in an intense beam are under the influence of strong repelling 
electrostatic forces creating the possibility of severe stability problems. Particle 
beam transport over long distances could be greatly restricted unless these space- 
charge forces can be kept under control. First, it is interesting to calculate the 
magnitude of the problem. 

If all particles would be at rest within a small volume, we would clearly expect 
the particles to quickly diverge from the center of charge under the influence of the 
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repelling forces from the other particles. This situation may be significantly different 
in a particle beam where all particles propagate in the same direction. 

In Sect. 1.5.10 we obtained the encouraging result that at least relativistic particle 
beams become stable under the influence of their own fields. For lower particle 
energies, however, significant diverging forces must be expected and adequate 
focusing measures must be applied. The physics of such space charge dominated 
beams is beyond the scope of this book and is treated elsewhere, for example in 
considerable detail in [16]. 


21.2.1 Self Field for Elliptical Particle Beams 


The self fields of a beam depend on beam parameters like particle type, particle 
distribution, bunching, and energy of the particle. Here, we will derive the nature 
and effect of these self fields in a more restricted way for common particle beam 
cases in accelerators. 

To determine self fields, we consider a continuous beam of particles with a line 
charge A, or a volume charge p(x, y). The electric fields within a beam are derived 
from a potential V defined by 


1 
AV = —-—p(x,y), (21.15) 
i) 


where p, being the electric charge density in the beam, is finite within and zero 
outside the beam. Similarly, the magnetic vector potential is defined by 


1 
AA = ——vp (x,y). (21.16) 
0 


For a particle beam, we may set v ~ (0,0, v) and the vector potential therefore 
contains only a longitudinal component A = (0, 0, Az). 

In Sect. 1.5.10 we discussed the self fields of a round beam. Generally, however, 
particle beams have an elliptical cross section and the solution to (21.15) for such 
a beam with constant charge density (o = const) has been derived by Teng [17, 
18]. Within the elliptical beam cross section, where x < a and y < J, the electric 
potential is 


Oe eer aes (21.17) 
pie en a+b aS . 


and a, b are the horizontal and vertical half axis respectively. The vector potential 
for the magnetic field is from the discussions above 


t wa [x 
A,(x, y) = — 21.18 
2(x, y) Perle +3 (21.18) 
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and both the electric and magnetic field can be derived by simple differentiations 


E =-VV and B=VxA (21.19) 
for 
1 4qr 1 Agr 
£ = 2 —— 8 Ly = a (21.20) 
4m€) a(a + b) ; 4m€9 bD(a + b) 
and 
4qr 4qgr 
baa lo. SaN8 y, Bia A (21.21) 
4n b(a+b) : 4a a(a+b) 


where 6 = v/c and the linear charge density A is defined by 
A = mab p(x, y). (21.22) 


Comparing (21.20) and (21.21) reveals the relationship between electric and 
magnetic self fields of the beam to be (21.22) 


cB, = —BE,, cB, = +BEy. (21.23) 


The electric as well as the magnetic field scales linearly with distance from the beam 
center and therefore both cause focusing and a tune shift in a circular accelerator. 
In many applications it is not acceptable to assume a uniform transverse charge 
distribution. Most particle beams either have a bell shaped particle distribution or a 
Gaussian distribution as is specially the case for electrons in circular accelerators. 
We therefore use in the transverse plane a Gaussian charge distribution given by 


je es (21.24) 
x,y) = ——— exp| -—~ -=— ]. ; 
ee 21005 " 202 20? 


which also well describes a beam with bell shaped distribution. Although many 
particle beams, but specifically electron beams, come in bunches with a Gaussian 
distribution in all degrees of freedom, we will only introduce a bunching factor for 
the longitudinal particle distribution and refer the interested reader for the study of 
a fully six dimensional Gaussian charge distribution to reference [19]. 

The potential for a transverse bi-Gaussian charge distribution (21.24) can be 
expressed by [18] 


V(x, y) = 


oe 
e 7 2e2+) 262+) a (21.25) 


ae ((o2 + (02 +18) 
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Equation (21.25) can be verified by back insertion into (21.15). From this potential 
we obtain for example the vertical electric field component by differentiation 


et ae ears cee 
_ dV(x,y) oe ay f P| 26749 — Woz+H 


y= dt. (21.26) 
by Aro 5 + D2 +063 +9) 


0 


No closed analytical expression exists for these integrals unless we restrict 
ourselves to a symmetry plane with x = 0 or y = 0 and small amplitudes y < oy 
or x < 0, respectively. These assumptions are appropriate for most space-charge 
effects and the potential in the vertical midplane becomes 


A 2 


Vix =0, ~~ Amey Oy(Oy +0)> 
(x y <oy) 4m €9 0y(o, + oy) : 


(21.27) 


For reasons of symmetry a similar expression can be derived for the horizontal 
mid plane by merely interchanging x and y in (21.27). The associated electric fields 
are for x = Oandy < oy 


1 2a 1 2a 
x= %, y= TY), (21.28) 
Are Ox(0x + Oy) “ — Arre€o Oy(Ox + Oy) 
and the magnetic fields according to (21.23) are from (21.28) 
2a 2A 
[ee a a p24 —_ (21.29) 


© Am 0\(o, + oy) 4 0x(0x + Oy) 

All fields increase linearly with amplitude and we note that the field components 
in the horizontal midplane are generally much smaller compared to those in the 
vertical midplane because most particle beams in circular accelerators are flat and 
Oy K Ox. 


Forces from Space-Charge Fields 


The electromagnetic self fields generated by the collection of all particles within a 
beam exert forces on individual particles of the same beam or of another beam. The 
Lorentz force due to these fields can be expressed by 


F = cEf, + e[v x B ff. (21.30) 


where we have added to the usual expression for the Lorentz force the factors fz 
and fy. Because the fields act differently depending on the relative directions and 
charge of beam and individual particle distinct combinations occur. We set f. = 1 if 
both the beam particles and the test particle have the same sign of their charge and 
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Table 21.1 Self field force 
factors 


+-— tt 
—+ tt 


+(1=6") —(1 +B?) 


fe = —1 if their charges are of opposite sign. Similarly we set f, = 1 or fy = —1 
depending on whether the beam and test particle have the same or opposite direction 
of movement with respect to each other. 

The vertical force from the self field, for example, of a proton beam on 
an individual proton within the same beam moving with the same velocity is 
from (21.30) 


Fy(qp4+) = +e — B°)Ey. (21.31) 


An antiproton moving in the opposite direction through a proton beam would feel 
the vertical force 


Fy(4y.4—-) = —e(1 + BE. (21.32) 


Expansion to other combinations of particles and directions of velocities are 
straightforward. For ions the charge multiplicity Z must be added to the fields or 
the individual particle or both depending on the case. The possible combinations of 
the force factors +(1 + 8?) are summarized in Table 21.1. 

The +-signs in Table 21.1 indicate the charge polarity of beam and test particle 
and the arrows the relative direction. We note a great difference between the case, 
where particles move in the same direction, and the case of beams colliding head 
on. 


21.2.2 Beam—Beam Effect 


In colliding beam facilities two counter rotating beams within one storage ring or 
counter rotating beams from two intersecting storage rings are brought into collision 
to create a high center of mass energy at the collision point which transforms into 
known or unknown particles to be studied by high energy experimentalists. The 
event rate is given by the product of the cross section for the particular event and the 
luminosity which is determined by storage ring operating conditions. By definition, 
the luminosity is the density of collision centers in the target multiplied by the 
number of particles colliding with this target per unit time. In the case of a colliding 
beam facility a bunch of one beam is the target for the other beam. For simplicity 
we assume here that both beams have the same cross section. We also assume that 
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each beam consists of mp bunches. In this case the luminosity is 


N 
L= ——Noveey, (21.33) 
npA 


where N and N> are the total number of particles in each beam, A the cross section 
of the beams, and v,.y the revolution frequency in the storage ring. In most storage 
rings the transverse particle distribution is Gaussian or bell shaped and since only the 
core of the beam contributes significantly to the luminosity we may define standard 
beam sizes for all kinds of particles. For a Gaussian particle distribution the effective 
beam cross section is 


Ag = 4110,0) (21.34) 
and the luminosity 
N: 
f=-_—* _Niie. (21.35) 
470,.0, B 


The recipe for high luminosity is clearly to maximize the beam intensity and 
to minimize the beam cross section. This approach, however, fails because of 
the beam-beam effect which, due to electromagnetic fields created by the beams 
themselves, causes a tune shift and therefore limits the amount of beam that can 
be brought into collision in a storage ring. The beam-beam effect has first been 
recognized and analyzed by Amman and Ritson [20]. 

In case of counter rotating beams colliding at particular interaction points in a 
colliding-beam facility, we always have f, = —1 but the colliding particles still may 
be of equal or opposite charge. In addition, there is no contribution from magnetic 
image fields since collisions do not occur within magnets. Even image fields from 
vacuum chambers are neglected because the beam-beam interaction happens only 
over a very short distance. A particle in one beam will feel the field from the 
other beam only during the time it travels through the other beam which is equal 
to the time it takes the particle to travel half the effective length of the oncoming 
bunch. With these considerations in mind, we obtain for the beam-beam tune shift 
in the vertical plane from (21.63) with foo = 1 and assuming head on collisions of 
particle-antiparticle beams (f. = —1) 


< Mo p. 
A= (21.36) 
2nBy oy (of + o;') 
and in the horizontal plane 
< Mto : 
Avia NS _—_Pe (21.37) 


2nBy of (G+ +0*) 
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where * indicates that the quantities be taken at the interaction point. In cases 


where other particle combinations are brought into collision or when both beams 
cross under an angle these equations must be appropriately modified to accurately 
describe the actual situation. 

From (21.32) and (21.28)we find for two counter rotating beams of particle and 
antiparticle a vertical beam-beam force of 


1 e(1 + B2)2a 


= 21.38 
4m€9 0y(0, + oy) ( ) 


fy= 


This force is attractive and therefore focusing, equivalent to that of a quadrupole of 
strength 


PF, 
eT (21.39) 
c?B2ym 
causing a vertical tune shift of 
1 
bvy = — | Bykdz. (21.40) 
7 4n / 


coll 


Integrating over the collision length which is equal to half the bunch length ¢ 
because colliding beams move in opposite directions, we note that the linear charge 
density is A = eN/ (BC), where N is the total number of particles per beam and 
B the number of bunches per beam. With these replacements the beam tune shift 
becomes finally 


es (21.41) 
 — 2NBy0y(0x + Oy) 
where r, is the classical particle radius of the particle which is being disturbed. 
Obviously, the tune shift scales linear with particle intensity or particle beam current 
and inversely with the beam cross section. Upon discovery of this effect it was 
thought that the particle beam intensity is limited when the tune shift is of the order 
of + 0.15 — 0.2 which is the typical distance to the next resonance. Experimentally, 
however, it was found that the limit is much more restrictive with maximum tune 
shift values of ~ 0.04 — 0.06 for electrons [20-23] and less for proton beams [24]. 
A definitive quantitative description of the actual beam-beam effect has not 
been possible yet due to its highly nonlinear nature. Only particles with very 
small betatron oscillation amplitudes will experience the linear tune shift derived 
above. For betatron oscillations larger than one o, however, the field becomes very 
nonlinear turning over to the well known 1/r-law at large distances from the beam 
center. 
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In spite of the inability to quantitatively describe the beam-beam effect by the 
linear tune shift it is generally accepted practice to quantify the beam-beam limit 
by the value of the linear tune shift. This is justified since the nonlinear fields of a 
particle beam are strictly proportional to the linear field and therefore the linear tune 
shift is a good measure for the amount of nonlinear fields involved. 


21.2.3. Transverse Self Fields 


Expressions for space-charge fields originating from a beam of charged particles 
have been derived earlier and we obtained for a Gaussian transverse distribution 
of particles with charge q the electric fields in (21.28) and the magnetic fields 
in (21.29). 

The local linear particle density A is defined by 


A(z) = Jf eeeraevay. (21.42) 


where p(x, y,z) is the local particle density normalized to the total number of 
particles in the beam | eee A(z)dz = Np. With these fields and the Lorentz equation, 
we formulate the transverse force acting on a single particle within the same particle 
beam. Since both expressions for the electrical and magnetic field differ only by the 
factor B we may, for example, derive from the Lorentz equation the vertical force 
on a particle with charge q 


1 2qr 


F, = q(— B’)Ey = . 
y=q— BYE, havo Fo) 


(21.43) 


The space-charge force appears at its strongest for nonrelativistic particles 
and diminishes quickly like 1/y? for relativistic particles. In accelerator physics, 
however, particle beams are carried from low to high energies and therefore space- 
charge effects may become important during some or all phases of acceleration. This 
is specifically true for heavy particles like protons and ions for which the relativistic 
parameter y is rather low for most any practically achievable particle energies. 


21.2.4 Fields from Image Charges 


Discussing space charges, we ignored so far the effect of metallic and magnetic 
surfaces close to the beam. The electromagnetic self fields of the beam circulating 
in a metallic vacuum chamber and between ferromagnetic poles of magnets must 
meet certain boundary conditions on such surfaces. Laslett [25] derived appropriate 
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corrections to free space electromagnetic fields by adding the electromagnetic fields 
from all image charges to the fields of the particle beam itself. 

Following his reasoning, we consider a particle beam with metallic and ferromag- 
netic boundaries as shown in Fig. 21.2. For full generality, let the elliptical particle 
beam be displaced in the vertical plane by y from the midplane, the metallic vacuum 
chamber and magnet pole are simulated as pairs of infinitely wide parallel surfaces 
at +b and +g, respectively, and the observation point of the fields be at y. The linear 
particle density is 


Mo Ng 
A=“ = ak (21.44) 


nmyly — nyV 2700 


where Not is the total number of particle in the circulating beam, m, the number of 
bunches, ¢, = JV 270% the effective bunch length and o, the standard bunch length 
for a Gaussian distribution. 

The locations and strength of the electrical images of a line current in the 
configuration of Fig.21.2 are shown in Fig.21.3. The boundary condition for 
electric fields is E,(b) = O on the surface of the metallic vacuum chamber and 
is satisfied if the image charges change sign from image to image. To calculate the 


Fig. 21.2 Particle beam with 
metallic and ferromagnetic 
boundaries 


reference 


vacuum chamber path 


magnet pole 
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Fig. 21.3 Location and 5b 
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electrical field E,(y), with (21.28) we add the contributions from all image fields in 
the infinite series 


2A (21.45) 


Ey image (V) = Ame 
0 


1 1 1 1 
x = = = = = =F = 
(a> 2b+y+y 4b+y-y 4b-y+y 
1 1 1 1 


+ = - = = = ++ = 
6b—y-y 6b+y+y 8b+y-y 8b-y+y 


1 1 
~<a. eee 


These image fields must be added to the direct field of the line charge to meet 
the boundary condition that the electric field enter metallic surfaces perpendicular. 
Equation (21.45) can be split into two series with factors (y + y) and (y — y) in the 
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numerator. We get after some manipulations with y+ y << bandy—y <b 


oe Oe ee ay 21.46 
'y,image (Y) Ame re) (2m = 1? + ine ; ( ‘ ) 
m=1 m=1 
1 AT. ae a 
- ian [Ste +O- Ma] 
1 Aw 1 4gA 
So Oye = a, 
ah! ae, 


The electric image fields depend linearly on the deviations y and y from the axis 
of bunch center and test particle, respectively, and act therefore like a quadrupole 
causing a tune shift. 

A similar derivation is used to get the magnetic image fields due to ferromagnetic 
surfaces at +g above and below the midplane. The magnetic field lines must enter 
the magnetic pole faces perpendicular and the image currents therefore flow in the 
same direction as the line current causing a magnetic force on the test particle which 
is opposed to that by the magnetic field of the beam itself. 

Bunched beams generate high frequency electromagnetic fields which do not 
reach ferromagnetic surfaces because of eddy current shielding by the metallic 
vacuum chamber. For magnetic image fields we distinguish therefore between dc 
and ac image fields. The dc Fourier component of a bunched beam current is equal 
to twice the average beam current c6AB, where the Laslett bunching factor B is the 
bunch occupation along the ring circumference defined by 


X nyly 
Le = : 
A 2nR 


(21.47) 


The dc magnetic image fields are derived similar to electric image fields with 
By = —2AB/r from (21.29) and are with (21.47) 


cio 2AB | ay +y A -y 
By ima e,dc = ————B a ar 21.48 
image.de () 4n g? p (2m — 1)? 7" dX 4m? ( ) 
cuo ABT a x 
= Se Ep pa — yy 
ee |o+y a Wa 
Clo 4A 2 
ge! OP en(25 +9). 
4n g 


The magnetic image fields must penetrate the metallic vacuum chamber to reach 
ferromagnetic poles. This is no problem for de or low frequency field components 
but in case of bunched beams relevant frequencies are rather high and eddy current 
shielding of the vacuum chamber for ac magnetic fields must be taken into account. 
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In most cases we may assume that they do not penetrate the thick metallic vacuum 
chamber. Consequently, we ignore here the effect of ferromagnetic poles and 
consider only the contribution of magnetic ac image fields due to eddy currents in 
vacuum chamber walls. Similar to electric image fields, the magnetic image fields 
are in analogy to (21.46) 


CcLL0 Ap a S 
By image,ac(¥) = -— pe — BB + y), 
Clio 4A 7 
= Ho BE 1 Be 29-+y), (21.49) 


where the factor (1 — B) accounts for the subtraction of the dc component BAB. 
Similar to the electric image fields, the magnetic image fields must be added to the 
direct magnet fields (21.29) from the beam current to meet the boundary condition 
of normal field components at ferromagnetic surfaces. The coefficients €; and €2 are 
the Laslett form factors which are for infinite parallel plate vacuum chambers and 
magnetic poles 


a a 


€, = — and €2 


=_, 21.50 
48 24 ery 


The vacuum chamber and ferromagnetic poles are similar to infinitely wide surfaces. 
While this is a sufficiently accurate approximation for the magnet poles, corrections 
must be applied for circular or elliptical vacuum chambers. Laslett [25] has derived 
what we call now Laslett form factors for vacuum chambers with elliptical cross 
sections and variable aspect ratios which are compiled in Table 21.2. 

All relevant field components have been identified and we collect these fields first 
for y = 0 and obtain from (21.28), (21.46) for the electric field in the vertical mid 
plane 


C7 [Lo 2a 20, (0, + Oy) 
Ey(y) = 1 —_ : 21.51 
4n Oy(Ox + Oy) - b? «| 
From (21.29), (21.48) the dc magnetic field is 
clo 2A BB 20;(0% sr Oy) 
Bye = — 1- — - 21.52 
.d dy Oy (o, + Oy) 2 €21y ( ) 


siestarreoaree o/b x1 [oe 

incoherent tune shift form 

factors for elliptical vacuum ai Lo | SOE ee 

chambers *a is the horizontal and b the vertical half-axis of an 
elliptical vacuum chamber 
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and from (21.49) the ac magnetic field 


Byac = 


clo 2AB 20y(6x + dy 

ae ate) E + pp a| (1—B)y. (21.53) 
Tacitly, we have assumed that the transverse particle distribution is Gaussian which 
is a true representation of an electron beam but may not be correct for proton or ion 
beams. The standard deviations o of a Gaussian distribution are very well defined 
and can therefore be replaced by other quantities like the full-width half maximum 
or as the particle distribution may require. 

The electromagnetic force due to space charge on individual particles in a beam 
has been derived and it became obvious that image field effects can play a significant 
role in the perturbation of the beam. The fields scale linear with amplitude for very 
small amplitudes and act therefore like focusing quadrupoles. At larger amplitudes, 
however, the fields reach a maximum and then evanesce like 1/r. Consequently, the 
field gradient is negative decaying quickly with amplitude. 

A complete set of direct and image fields have been derived which must be 
considered to account for space-charge effects. Similar derivations lead to other 
field components necessary to determine horizontal space-charge forces. In most 
accelerators, however, the beam cross section is flat and so is the vacuum chamber 
and the magnet pole aperture. As a consequence, we expect the space-charge forces 
to be larger in the vertical plane than in the horizontal plane. 


21.2.5  Space-Charge Effects 


The Lorentz force on individual particles can be calculated from the space-charge 
fields and we get 


1 2fAC — BA) 


~ Arey Oy(Ox + Oy) 


y Foon Y = GF y, (21.54) 


where the correction factor due to image fields is with B?y? = y? — 1, 


20, (0% + Oy 
b2 


b2 
Posie dell +07 — DB] + a? - 5B (21.55) 


and 


1 2fA(0— BA) 


Are) 0y(0x + Oy) 


feorr » (21.56) 


The factors f, and f, determine signs depending on the kind of particles interacting 
and the direction of travel with respect to each other. Specifically, f, = sign(q qp) 
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where q is the charge of a test particle and q the charge of the field creating particles, 
e.g. the charge of a bunch. Similarly, f, = sign(vvp) where v is the direction of 
travel for the test particle and vy, the direction of travel of the bunch. To calculate the 
space-charge force of head-on colliding proton and antiproton beams, for example, 
we would set f, = —1 and f, = —1. 

There is a significant cancellation of two strong terms, the repulsive electrical 
field and the focusing magnetic field, expressed by the factor 1 — f? for space- 
charge forces within a highly relativistic beam. This cancellation can be greatly 
upset if particle beams become partially neutralized by collecting other particles of 
opposite charge within the beams potential well. For example, proton beams can 
trap electrons in the positive potential well as can electron beams trap positive ions 
in the negative potential well. To avoid such partial neutralization and appearance 
of unnecessarily strong space-charge effects, clearing electrodes must be installed 
over much of the ring circumference to extract with electrostatic fields low energy 
electrons or ions from the particle beam. 

The electromagnetic space-charge force on an individual particle within a particle 
beam increases linearly with its distance from the axis. A similar force occurs for 
the horizontal plane and both fields therefore act like a quadrupole causing a tune 
shift. This has been recognized and analyzed early by Kerst [26] and Blewett [27]. 
A complete treatment of space charge dominated beams can be found in [16]. The 
equation of motion under the influence of space charge forces can be written in the 
form 


Fy 


0 
my u+ Du = 
Ou 


u with u= (x,y). (21.57) 
We get the regular form u” + (ko + Ak) u = 0 with i = u" (cB) and f, = 1, where 
ko describes the quadrupole strength and the space-charge strength is expressed by 


_ 1 OF, _ 2K Xr 
~ me?yB? du B23 oy (0% + oy) 


Soorr (21.58) 


where r, is the classical particle radius. For ions with charge multiplicity Z and 
atomic number A the classical particle radius is rion = rp Vie /A. 


Space Charge Dominated Beams 


So far, space-charge effects or space-charge focusing has been consistently 
neglected in the discussions on transverse beam dynamics. In cases of low beam 
energy and high particle densities, it might become necessary to include space- 
charge effects. They are defocusing in both planes and compensation therefore 
requires additional focusing in both planes. However, it should be noted that 
particles closer to the beam surface will not experience the same linear space-charge 
defocusing as those near the axis and therefore a compensation of space-charge 
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focusing works only for part of the beam. Here, we will not get involved with the 
dynamics of heavily space charge dominated particle beams but try to derive a 
criterion by which we can decide whether or not space-charge forces are significant 
in transverse particle beam optics. 

This distinction becomes obvious from the equation of motion including space 
charges. From (21.57), (21.58) we get the equation of motion 


ii 2re Xr 
uo+|ko- yfeom u= 0, (21.59) 


py? Oy (0x + oy 


where we ignored the image current corrections. Space-charge forces can be 
neglected if the integral of the space-charge force over a length L which is 
characteristic for the average distance between quadrupoles in the beam line is small 
compared to the typical integrated quadrupole length ko¢,or if 


2 | fe dz <K kolg (21.60) 
f Y L Oy(0> Oy) 


The effect of space-charge focusing is most severe where the beam cross section 
is smallest and (21.60) should therefore be applied specifically to such sections of 
the beam transport line. Obviously, the application of this formula requires some 
subjective judgement as to how much smaller space-charge effects should be. To 
aid this judgement, one might also calculate the average betatron phase shift caused 
by space-charge forces and compare it with the total phase advance along the beam 
line under investigation. In this case we look for 


2re Bud feorr 
py L Oy(Ox + Oy) 


dz K Wo(L) (21.61) 


to determine the severity of space-charge effects. The nominal phase advance 
Wo,u(L) is defined such that Wo,,(0) = 0 at the beginning of the beam line. 


Space-Charge Tune Shift 


Space-charge focusing may not significantly perturb the lattice functions but may 
cause a big enough tune shift in a circular accelerator moving the beam onto a 
resonance. The beam current is therefore limited by the maximum allowable tune 
shift in the accelerator which is for a linear focusing force F(z) given by 


To 


1 Lint 
Av, = -—-=— F(z)Bu dz. 21.62 
v= gage | FOB sd (21.62) 


The integration in (21.62) is taken over that part of the path in each revolution where 
the force is effective. For the effect on particles within the same beam this is the 
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circumference and for the beam-beam effect it is the total length of all head on 
collisions per turn. 

The tune shifts are not the same for all particles due to the nonuniform charge 
distribution within a beam. Only particles close to the beam center suffer the 
maximum tune shift while particles with increasing betatron oscillation amplitudes 
are less affected. The effect of space charge therefore introduces a tune spread rather 
than a specific tune shift and we refer to this effect as the incoherent space-charge 
tune shift. 

As a particular case, consider the space-charge tune shift of a particle within a 
beam of equal species particles. Applying the Lorentz force (21.54) with (21.56) the 
space-charge tune shift becomes from (21.62) 


eet B°f\) Bu 


2n 0, (0x + Oy) 


Avuse = Foor AZ, (21.63) 


where the local linear particle density A is defined by (21.44). 
The maximum incoherent space-charge tune shift is from (21.63) with f, = 1, 


fo = 1, I — B*) = 1/y? and (21.56) 


x 2nR 7 
AVu,sc,incoh = ee / Pu dz (21.64) 
2nB?y? | Jo Ful(Ox + Oy) 
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where the integration length Ly. is equal to the total length of the vacuum chamber 
and Lmag is the total length of magnets along the ring circumference. Note, however, 
that this last term appears only at low frequencies because of eddy-current shielding 
in the vacuum chamber at high frequencies. Observing the tune on a betatron side 
band at a high harmonic of the revolution frequency may not exhibit a tune shift due 
to this term while one might have a contribution at low frequencies. 

A coherent space-charge tune shift can be identified by setting y = y in the field 
expressions (21.46), (21.48), (21.49) to determine the fields at the bunch center. The 
calculation is similar to that for the incoherent space-charge tune shift except that 
we define new Laslett form factors for this case 


= (21.65) 


for the image fields from the magnetic pole and form factors €; which depend on 
the aspect ratio of an elliptical vacuum chamber (Table 21.3). 


(21.66) 
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elliptical vacuum chambers 
*a is the horizontal and b the vertical half-axis of an 
elliptical vacuum chamber 


The coherent space-charge tune shift is analogous to (21.64) 


A mA / rk Bu d (21.67) 
Vu,sc,coh = — DG a = Re : 
ee 2n By? | Jo — Fu(Ox + Oy) : 
Lyac o Imag - Lb? 
+2(1 + pe) f ps dz + 2py° | p i“ a : 
0 0 


In both cases, we may simplify the expressions significantly for an approximate 
calculation by applying smooth approximation 8, ~ R/V, and assuming a uniform 
vacuum chamber and magnet pole gaps. With these approximations, (21.63) 
becomes 


_ Lite Mot R fp el _ B’f.) ( feorr) 


Aties = a —; 21.68 
Ma 27 Vo, B p2y 0, (Ox + Oy) : 
where 
5, (Ge + 5 b 
(feorr) =1+ re av + B°y’B) - cb?y 8 | ; (21.69) 


Symbols with an overbar are the values of quantities averaged over the circumfer- 
ence of the ring and vo, is the unperturbed tune in the plane (x, y). The incoherent 
tune shift (21.64) becomes then 

Te Mot R 1 
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where 7) = Lag / (27R) is the magnet fill factor and the coherent tune shift (21.67) 
becomes 


re Mo R 1 
2 V9,B p2y3 Ou (O~ + Oy) 
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(21.71) 
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The tune shift diminishes proportional to the third power of the particle energy. As 

a matter of fact in electron machines of the order of | GeV or more, space-charge 
tune shifts are generally negligible. For low energy protons and ions, however, this 
tune shift is of great importance and must be closely controlled to avoid beam loss 
due to nearby resonances. While a maximum allowable tune shift of 0.15—0.25 
seems reasonable to avoid crossing a strong third order or half-integer resonance, 
practically realized tune shifts can be significantly larger of the order 0.5—0.6 [28- 
30]. Independent of the maximum tune shift actually achieved in a particular ring, 
space charge forces ultimately lead to a limitation of the beam current. 


21.2.6 Longitudinal Space-Charge Field 


Within a continuous particle beam travelling along a uniform vacuum chamber 
we do not expect longitudinal fields to arise. We must, however, consider what 
happens if the longitudinal charge density is not uniform since this is a more realistic 
assumption. For the case of a round beam of radius 7p in a circular vacuum tube of 
radius ry (Fig. 21.4), the fields can be derived by integrating Maxwell’s equation 
VxE=- B and with Stoke’s law 


ed =—> | Bas, (21.72) 


where dA is an element of the area enclosed by the integration path s. The integration 
path shown in Fig. 21.4 leads to the determination of the electrical field Eo in the 
center of the beam. 


Eo 
Soper reer pa eeeeeaeee 
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1 integration path 4 
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vacuum chamber 


Fig. 21.4 Space-charge fields due to a particle beam travelling inside a circular metallic vacuum 
chamber 


726 21 Statistical and Collective Effects* 


Integrating the |.h.s. of (21.72) along the integration path we get with (21.28) fora 
round beam (r = 0) 


n 


E,Az+ i; E,(z + Az)dr — E-wAz -[ E,(z)dr (21.73) 
0 0 
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where a Taylor’s expansion was applied to the linear particle density A(z + Az) and 
only linear terms were retained. E,, is the longitudinal electrical field on the vacuum 
chamber wall. 

For the r.h.s. of (21.72) we use the expressions for the magnetic field (21.29) and 
get with [ B,dA = Az { Bydr 


w\ OA w\ OA 
—Pg(1 42%) Fac = p'g(1 + 2m) Faz (21.74) 
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while using the continuity equation 
oA oA 
= = =0. 21.75 
ae? nae) 
The longitudinal space-charge field is therefore 
q 1 rw \ 0A 
E. = Ex — —+ {14+ 2In— | — (21.76) 
‘ i Are y? ro ) Oz 


and vanishes indeed for a uniform charge distribution because E,, = 0 for a dc 
current. However, variations in the charge distribution cause a longitudinal field 
which together with the associated ac field in the vacuum chamber wall, acts on 
individual particles. 

The perturbation of a uniform particle distribution in a circular accelerator is 
periodic with the circumference of the ring and we may set for the longitudinal 
particle distribution keeping only the nth harmonic for simplicity 


Aether , (21.77) 


where w, is the nth harmonic of the perturbation (@, = n@o). Of course a real beam 
may have many modes and we need therefore to sum over all modes n. In case of 
instability, it is clear that the whole beam is unstable if one mode is unstable. 

With the derivative dA/dz, smooth approximation and 6 = z/R with R the 
average ring radius an integration of (21.76) around the circular accelerator gives 
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the total induced voltage due to space-charge fields 


i _ I, 2m0n Te, eee 
Vo = 20RE wy — i —; [1+2In—]e eee (21.78) 
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In this expression we have also introduced the nth harmonic of the beam-current 
perturbation J, = BcgdA,. Equation (21.78) exhibits a relation of the induced 
voltage to the beam current. Borrowing from the theory of electrical currents, it 
is customary to introduce here the concept of a frequency dependent impedance 
which will become a powerful tool to describe the otherwise complicated coupling 
between beam current and induced voltage. We will return to this point in Chap. 22. 


21.3  Beam-Current Spectrum 


In the last section a beam stability issue appeared based on instantaneous current 
variations. This is particularly true in circular accelerators where the particle 
distribution is periodic with the circumference of the ring. On one hand, we have 
an orbiting particle beam which constitutes a harmonic oscillator with many eigen- 
frequencies and harmonics thereof and on the other hand, there is an environment 
with a frequency dependent response to electromagnetic excitation. Depending on 
the coupling of the beam to its environment at a particular frequency, periodic 
excitations occur which can create perturbations of particle and beam dynamics. 
This interaction is the subject of this discussion. In this text, we will concentrate in 
Chap. 22 on the discussion of basic phenomena of beam-environment interactions 
or beam instabilities. For a more detailed introduction into the field of beam 
instabilities, the interested reader is referred to the general references for this 
chapter. In this discussion, we will follow mainly the theories as formulated by 
Chao [31], Laclare [32], Sacherer [33] and Zotter [34]. 

Since the coupling of the beam to its environment depends greatly on the 
frequency involved, it seems appropriate to discuss first the frequency spectrum of 
a circulating particle beam. 


21.3.1 Longitudinal Beam Spectrum 


In case of a single circulating particle of charge g in each of np equidistant bunches, 
a pick up electrode located at azimuth g would produce a signal proportional to the 
single-particle beam current which is composed of a series of delta function signals 


4 +00 F y T, 
k=—oo 
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where T is the longitudinal offset of the particle from the reference point, m, the num- 
ber of equidistant bunches and 79 the revolution time (Fig. 21.4). With the revolution 
frequency w) = 27/To, we use the mathematical relations 27 pra 5(y—2zk) = 


yore. e”” and |c| (cy) = 8(y) for 


p=—0O0 


a 20k edi. 5 
Y 5 (« = ) ee - elPndoo (21.80) 
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where x = t— ae to replace the delta functions. We also replace the exponential 


function 


+00 
eivsiny — = In(y) lt (21.81) 
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and replace t by the synchrotron oscillation t = T cos[(m + v.)@ot + €;| where v, is 
the synchrotron oscillation tune. The term mmf reflects the mode of the longitudinal 
particle distribution in all buckets. This distribution is periodic with the periodicity 
of the circumference and the modes are the harmonics of the distribution in terms 
of the revolution frequency (Fig. 21.5). 

Inserting (21.80) on the rh.s. of (21.79) and replacing the term 7 ipmeor 
with (21.81) one gets 


ny +00 +00 
; 0 ny a 
i.e) = DIY EP n(gnyeont) (21.82) 
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Fig. 21.5 Particle 
distribution along the 
circumference of a circular 
accelerator and definition of 
parameters 
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Fig. 21.6 Current spectrum of a single particle orbiting in a circular accelerator and executing 
synchrotron oscillations 


Performing a Fourier transform 


+00 
I i 
i(@,~) = an i i(t,p) e 1 dt (21.83) 
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we get instead of (21.82) the single particle longitudinal current spectrum 


gnyo +00 +00 : 
iy(@, ~) = _ >> SLi (pmpeoo?) eH?" 82), (21.84) 
p= n= 
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where 82 = w — (pny +m + nvg)@o and making use of the identity et d¢ = 
276(w). This spectrum is a line spectrum with harmonics of the revolution 
frequency separated by nywo. Each of these main harmonics is accompanied on 
both sides with satellites separated by 2, = v,wo. Schematically, some of the more 
important lines of this spectrum are shown in Fig. 21.6 for a single particle. 

In the approximation of small synchrotron oscillation amplitudes, one may 
neglect all terms with |n| > 1 and the particle beam includes only the frequencies 
@ = [pry £ (m+ vs)]@o. In Sect. 19.5.1 the interaction of this spectrum for p = h 
with the narrow-band impedance of a resonant cavity was discussed in connection 
with Robinson damping. 

A real particle beam consists of many particles which are distributed in initial 
phase ¢; as well as in oscillation amplitudes t. Assuming the simple case of equal 
and equidistant bunches with uniform particle distributions in synchrotron phase ¢; 
we may set n = 0. The time independent particle distribution is then Do(t,7) = 
oo(t) which is normalized to unity and the total beam-current spectrum is given by 


+00 : +00 
1y(o,9) = Ib Y> 8 — Qo) 0? / Jo(pnyoo?) do(@)d?, (21.85) 
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Fig. 21.7 Dipole mode oscillation (a) and quadrupole bunch shape oscillations (b) 


where J, = q/Tp is the bunch current and 29) = pmpwo. All synchrotron 
satellites vanished because of the uniform distribution of synchrotron phases and 
lack of coherent bunch oscillations. Observation of synchrotron satellites, therefore, 
indicates a perturbation from this condition either by coherent oscillations of one 
or more bunches (n # 0,t # 0) or coherent density oscillations within a bunch 
Po(t,t) =f (Gi). 

The infinite sum over p represents the periodic bunch distribution along the 
circumference over many revolutions whether it be single or multiple bunches. The 
beam-current spectrum is expected to interact with the impedance spectrum of the 
environment and this interaction may result in a significant alteration of the particle 
distribution (t,t). As an example for what could happen, the two lowest order 
modes of bunch oscillations are shown in Fig. 21.7. 

In lowest order a collection of particles contained in a bunch may perform dipole 
mode oscillations where all particles and the bunch center oscillate coherently 
(Fig. 21.7a). In the next higher mode, the bunch center does not move but particles at 
the head or tail of the bunch oscillate 180° out of phase. This bunch shape oscillation 
is in its lowest order a quadrupole mode oscillation as shown in Fig. 21.7b. Similarly, 
higher order mode bunch shape oscillations can be defined. 


21.3.2 Transverse Beam Spectrum 


Single particles and a collection of particles in a bunch may also perform transverse 
betatron oscillations constituting a transverse beam current which can interact with 
its environment. Again, we observe first only a single particle performing betatron 
oscillations 


u = ucos y(t), (21.86) 
where u = x or y, y(t) is the betatron phase, and the transverse current is 


it (t,~) = ij(t,g) dos W(t). (21.87) 


Note that the transverse current has the dimension of a current moment represented 
by the same spectrum as the longitudinal current plus additional spectral lines due 
to betatron oscillations. The betatron phase is a function of time and depends on the 
revolution frequency and the chromaticity, which both depend on the momentum 
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of the particle. From the definition of the momentum compaction dw/wo = 14, 
chromaticity €, = dv/6é and relative momentum deviation 6 = dp/po, the variation 
of the betatron phase with time is 


Wr) =o = wo (14 £45) oy (Lt m8). (21.88) 
= won + (0+) 
X Vowo + ae WoT , 


where we have kept only linear terms in 6 and used t = —7.6. Equation (21.88) can 
be integrated for 


W(t) = Vvowo(t— Tt) — oper + Wo (21.89) 


c 


and (21.87) becomes with (21.74), (21.81), (21.84) 


in (t,g) = i(t, p)icos y(t) (21.90) 
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Following the derivation for the longitudinal current and performing a Fourier 
transform we get the transverse beam spectrum 


it (@,¢) = ane > > i "Jn 


p=—0O n=—00 


x Ee tpy—nEi) 5(Q2,), 
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where (2, = w — (p + Vo)Mp@o + n Q, defines the line spectrum of the transverse 
single particle current (Fig. 21.8). 

We note that the betatron harmonics (p + v9)mpwo are surrounded by synchrotron 
oscillation satellites, however, in such a way that the maximum amplitude is shifted 
in frequency by w&,/ 7c. It is interesting to note at this point that the integer part of 
the tune vp cannot be distinguished from the integer p of the same value. This is the 
reason why a spectrum analyzer shows only the fractional tune Av wo. 

The transverse current spectrum is now just the sum of all contributions from 
each individual particles. If we assume a uniform distribution ®(f, 7, i) in betatron 
phase, we get no transverse coherent signal because (e'¥°) = 0, although the 
incoherent space-charge tune shift is effective. Additional coherent signals appear 
as a result of perturbations of a uniform transverse particle distribution. 
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Fig. 21.8 Oscillation spectrum of a single particle orbiting in a circular accelerator and executing 
betatron and synchrotron oscillations 


Problems 


21.1 (S). The linear focusing of the beam-beam effect changes also the betatron 
function. Derive an expression that relates the change in the value of the betatron 
function iM at the collision point to the beam-beam tune shift dv. 


21.2. Verify that (21.17) and (21.25) are indeed solutions of the respective Poisson 
equation. 


21.3. Prove that (21.27) is indeed the potential for small vertical amplitudes and 
x=0. 


21.4. Calculate the linear beam-beam tune shift for each beam under the following 
head on colliding beam conditions: 


a) A 250GeV proton beam colliding with a fully ionized 30 GeV/u Au ion beam. 

(proton emittance €,, = 20 mm-mrad, gold ion emittance €,, = 33 mm-mrad, 
* = 2.0m, proton intensity 10!’ p/bunch, a total of 60 bunches per beam, gold 
ion intensity 10° Au ions/bunch). 

b) A 250GeV proton beam colliding with a fully ionized 100 GeV/u Au ion beam 
(parameters same as in a) but gold ion emittance €,, = 10 mm-mrad). 

c) A 30GeV electron beam colliding with a 820 GeV proton beam. The circumfer- 
ence of the rings is 6336 m, there are 2.1 x 10! protons and 0.8 x 10!° electrons 
in 210 bunches and the horizontal and vertical beam sizes at the collision point 
are 0x/y = 0.29/0.07 mm for the proton beam and 0.26/0.02 mm for the electron 
beam, respectively. 

d) A 1.5 GeV electron beam colliding with a 1.5 GeV positron beam at a collision 
point with €, = 0.67 mm-mrad, emittance coupling 27.7 %, BY = 1.3m, By = 
0.1m and a beam current of 66mA [35]. , 


21.5. Estimate the strength of the octupole field component of the proton beam 
in RHIC at the collision point. Would an octupole be technically feasible to 
compensate for the beam-beam octupole term? 
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21.6. At the Stanford Linear Collider, SLC, an electron beam collides with a 
positron beam at up to 50 GeV per beam. Each bunch contains 5 x 10!! particles and 
is focused to a beam diameter of 2.0 1m at the collision point where the betatron 
functions in both planes are 6* = 0.005 m. Calculate the beam-beam tune shift and 
the focal length of the beam lens for a bunch length of £ = 1 mm. Compare with 
beam-beam limits in storage rings. Why can we tolerate a much greater beam-beam 
tune shift in a linear collider compared with a storage ring? 


21.7. Show that the horizontal damping partition number is negative in a fully 
combined function FODO lattice as employed in older synchrotron accelerators. 
Why, if there is horizontal antidamping in such synchrotrons, is it possible to retain 
beam stability during acceleration? What happens if we accelerate a beam and keep 
it orbiting in the synchrotron at some higher energy? 


21.8. Future colliding beam facilities for high-energy physics experimentation are 
based on two linear accelerators aimed at each other and producing beams of very 
high energy for collision. In this arrangement synchrotron radiation is avoided 
compared to a storage ring. We assume that such beams can be directed to different 
detectors. Design an S-shaped beam transport system based on a FODO lattice, 
which would allow the beams to be directed into a detector being displaced by the 
distance D normal to the linac axis. The beams have an energy of Eg = 1,000 GeV 
and a beam emittance of € = 1.0x 107!” m which should not be diluted in this beam 
transport system by more than 10%. Determine quadrupole and bending magnet 
parameters. 


21.9. Strong focusing is required along a 500 GeV linear accelerator. Misalign- 
ments and path correction introduce dipole fields which are the source of syn- 
chrotron radiation and quantum excitation. Assume a normalized emittance of 
ye = 10~°m and an initial beam energy of 1 GeV at the entrance to the linac. 
The high-energy linac has a circular aperture of 3mm diameter. Design a FODO 
cell with sufficient focusing to contain this beam within a radius of 0.5 mm leaving 
the rest for path distortions. The distance between quadrupoles increases linearly 
with energy. Determine with statistical methods the number and strength of the 
quadrupoles for an acceleration of 100 MeV/m. Determine the alignment tolerances 
for these quadrupoles to keep the emittance increase due to quantum excitation in 
the dipole field from misaligned quadrupoles and due to correctors to 10 %. 


21.10. Consider the FODO lattice along the linear accelerator in Problem 21.9 and 
estimate the increase in beam energy spread due to synchrotron radiation from the 
finite beam size in quadrupoles. 


21.11. Consider an electron beam in a 6 GeV storage ring with a bending radius 
of p = 20m in the bending magnets. Calculate the rms energy spread o, /Eo and 
the damping time t. What is the probability for a particle to emit a photon with 
an energy of o, and 20,. How likely is it that this particle emits another such 
photon within a damping time? In evaluating the particle distribution, do we need to 
consider multiple photon emissions? 
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21.12. Consider one of the storage rings in Table 10.1 and calculate the equilibrium 
beam emittance and energy spread. To manipulate the beam emittance we vary the 
rf-frequency. Determine the maximum variation possible with this method. 


21.13. A large hadron collider LHC operates in the LEP tunnel of 28 km circum- 
ference at CERN in Geneva. The maximum proton energy is 15 TeV. Determine 
the magnetic bending field required if 80 % of the circumference can be used for 
bending magnets. Calculate the synchrotron radiation power for a circulating proton 
current of 200 mA, damping times, equilibrium beam emittance and energy spread. 


21.14. Determine basic FODO lattice parameters for a 2GeV et /e -colliding 
beam storage ring with two collision points to reach a design luminosity of 
L. = 10°! cm~?s~!. The betatron functions at the collision point be py = 5 
cm and B* = 1.3m and the emittance coupling 10%. Calculate beam sizes 
in the arc, aperture requirements, circumference and beam current. What is the 
total synchrotron radiation power? Adjust, if necessary, your design to keep the 
maximum synchrotron radiation power at the vacuum chamber wall below a 
practical limit of 5 kW/m. 
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Chapter 22 
Wake Fields and Instabilities* 


While discussing self fields of a charged particle bunch, we noticed a significant 
effect from nearby metallic surfaces. The dynamics of individual particles as well 
as collective dynamics of the whole bunch depends greatly on the electromagnetic 
interaction with the environment. Such interactions must be discussed in more detail 
to establish stability criteria for particle beams. 

The electric field from a charge in its rest frame extends isotropic from the 
charge into all directions. In the laboratory frame, this field is Lorentz contracted 
and assumes for a charge in a uniform beam pipe the form shown in Fig. 22. La. 
The contracted field lines spread out longitudinally only within an angle +1/y. 
This angle is very small for most high energy electron beams and we may describe 
the single-particle current as well as its image current by a delta function. Some 
correction must be made to this assumption for lower energy protons and specifically 
ions for which the angle 1/y may still be significant. In the following discussions, 
however, we will assume that the particle energy is sufficiently large and y > 1. 

Electron storage rings are being planned, designed, constructed, and operated for 
a variety of applications. While in the past such storage rings were optimized mostly 
as colliding beam facilities for high energy physics, in the future most applications 
for storage rings seem to be connected with the production of synchrotron radiation. 
Some of these radiation sources will be designed for higher energy particle beams 
(few GeV) to produce hard X-rays while others have moderate to low beam energies 
(= 100 MeV) to, for example, produce VUV and soft X-rays or to drive free 
electron lasers. 

The beam in an electron storage ring is composed of bunches which are typically 
a few centimeters long and are separated by a distance equal to one or more rf- 
wavelengths. The total number of bunches in a storage ring can range from one 
bunch to a maximum of / bunches, where h is the harmonic number for the storage 
ring system. The particle beam covers therefore a wide frequency spectrum from the 
kHz regime of the order of the revolution frequency up to many GHz limited only 
by the bunch length or size of the vacuum chamber. On the other hand, the vacuum 
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Fig. 22.1 Coupling of a charged particle beam to the environment; uniform chamber cross section 
(a), and obstacle on vacuum chamber surface (b) 


chamber environment constitutes an impedance which can become significant in the 
same frequency regime and efficient coupling can occur leading to collective effects. 
The most important impedance in an accelerator is that of the accelerating cavity at 
the cavity fundamental frequency. Since the particle beam is bunched at the same 
frequency, we observe a very strong coupling which has been extensively discussed 
in Sect. 19.4 in connection with beam loading. In this section, we will therefore 
ignore beam loading effects in resonant cavities at the fundamental frequency and 
concentrate only on higher-order mode losses and interaction with the general 
vacuum chamber environment. 

Depending on the particular application and experiment conducted, it may be 
desirable to store only a single bunch with the highest intensity possible. In other 
cases the maximum total achievable intensity is desired in as many bunches as 
possible and the particular bunch distribution around the ring does not matter. In 
either case the ultimate electron beam intensity will most probably be limited by 
instabilities caused by electromagnetic interaction of the beam current with the 
environment of the vacuum chamber. We ignore here technical limitations due to, 
for example, insufficient available rf-power or inability to cool the radiation heating 
of the vacuum chamber. 

Since the radiation intensity produced is directly proportional to the stored 
electron beam current, it is obvious that the usefulness of such a radiation source 
depends among other parameters on the maximum electron beam current that can 
be stored in each bunch or in the storage ring. 


22.1 Definitions of Wake Field and Impedance 


The image currents of a charge gq travelling along the axis of a uniform and perfectly 
conducting tube move with the charge without losses and no forces are generated 
that would act back on the particle. This is different for a resistive wall where the 
image fields drag a significant distances behind the charge or in case of an obstacle 
extending into the tube or any other sudden variation of the tube cross section 
(Fig. 22.1b). 
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In any of these cases, wake fields are created which have the ability to pull or 
push the charge q or test particles following that charge. Because of causality, no 
such fields exist ahead of a relativistically moving charge. 

Energy losses and gains of a single or collection of particles can cause significant 
modifications in the dynamics of particle motion. Specifically, we are concerned 
that such forces may lead to particle or beam instability which must be understood 
in detail to determine limitations or corrective measures in a particular accelerator 
design. The interaction of a charged particle beam with its environment can be 
described in time domain or frequency domain where both have their advantages 
and disadvantages when it comes to evaluate their effect on particle dynamics. 


22.1.1 Parasitic Mode Losses and Impedances 


In time domain, the interaction is described by wake fields which then act on 
charges. In frequency domain, vacuum chamber components can be represented 
as a frequency dependent impedance. We used this picture before while discussing 
properties of accelerating cavities. Many vacuum chamber components or sudden 
changes in cross section behave like cavities and represent therefore frequency 
dependent impedances. Together with the frequency spectrum of the beam, we 
find strong coupling to the vacuum chamber if the impedance and particle beam 
have a significant component at the same frequency. The induced voltage V(w) 
from this interaction is proportional to the collective particle current /(@) and the 
impedance Z(w) acting as the proportionality factor, describes the actual coupling 
from the particle beam via the vacuum chamber environment to the test particle. 
Mathematically, we set 


V(@) = —Z(@) I(w) (22.1) 


indicating by the minus sign that the induced voltage leads to an energy loss for 
beam particles. The impedance is in general complex and depends for each piece of 
vacuum chamber including accelerating cavities or accidental cavities, on its shape, 
material and on the frequency under consideration. The coupling impedance for a 
particular vacuum chamber component or system may be narrow band with a quality 
factor Q > | like that in an accelerating cavity or broad band with Q ~ | due toa 
sudden change in the vacuum chamber cross section. 

Fields induced by the beam in a high Q structure are restricted to a narrow 
frequency width and persist for a long time and can act back on subsequent 
particle bunches or even on the same particles after one or more revolutions. Such 
narrow-band impedances can be the cause for multi-bunch instabilities but rarely 
affect single bunch limits. The main source for a narrow-band impedance in a 
well-designed accelerator comes from accelerating cavities at the fundamental as 
well as higher-order mode frequencies. There is little we can or want do about 
the impedance at the fundamental frequency which is made large by design for 
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efficiency. The design of modern accelerator cavities exhibit significantly reduced 
impedances for higher-order modesor HOMs. 

The source for broad-band impedances are discontinuities in cross section or 
material along the vacuum chamber including accelerating cavities, flanges, kicker 
magnets with ferrite materials, exit chambers electrostatic plates, beam position 
monitors, etc. Many higher order modes over a wide frequency range can be excited 
in such discontinuities by a passing short particle bunch, but all modes decoher very 
fast. Only for a very short time are these mode fields in phase, adding up to a high 
field intensity but at the time of arrival of the next particle bunch or the same bunch 
after one or more revolutions these fields have essentially vanished. Broad-band 
wake fields are therefore mainly responsible for the appearance of single-bunch 
beam instabilities. 

Due to tight particle bunching by the rf-system to about 5 % of the rf-wavelength, 
we have large instantaneous currents with significant amplitudes of Fourier compo- 
nents at harmonics of the revolution frequency up to about 20 times the rf-frequency 
or down to wavelength of a few centimeters. Strong electromagnetic interaction 
between electron bunches and cavity like structures as part of the vacuum enclosure 
must therefore be expected. Any but the smallest steps in the cross section of 
the vacuum chamber constitute cavity like structures. A bunch passing by such 
a structure deposits electromagnetic energy which in turn causes heating of the 
structure and can act back on particles in a later segment of the same bunch or 
in a subsequent bunch. Schematically such fields, also called wake fields, are shown 
in Fig. 22.1 where the beam passes by a variation in the cross section of the vacuum 
chamber. We will discuss the nature and the frequency spectrum of these wake fields 
to determine the effect on the stability of the beam and to develop counter measures 
to minimize the strength and occurrence of these wake fields. 

We distinguish broad band parasitic losses where the quality factor Q is of the 
order of unity from narrow band losses with higher Q values. Fields from broad 
band losses last only a very short time of the order of one period and are mainly 
responsible for single bunch instabilities, where the fields generated by electrical 
charges in the head of the bunch act back on the particles in the tail of the same 
bunch. Due to the low value of the quality factor (Q ~ 1) these broad band wake 
fields decay before the next bunch arrives. 

Wake fields can appear as longitudinal or transverse modes and cause corre- 
spondingly longitudinal or transverse instabilities. Obviously, a perfect vacuum 
chamber would have a superconducting surface and be completely uniform around 
the ring. This is not possible in real accelerators because we need rf-systems which 
by their nature are not smooth, injection/ejection components, synchrotron light 
ports, bellows, and beam position monitors. While we cannot avoid such lossy 
components we are able by proper design to minimize the detrimental effects of 
less than ideal components. 

The loss characteristics of a particular piece of the vacuum chamber for the whole 
ring is generally expressed in terms of an impedance Z or in terms of a loss factor 
k. To illustrate the different nature of wake fields we assume a cavity like change in 
the cross section of the vacuum chamber as shown in Fig. 22.2. 
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Fig. 22.2 Longitudinal parasitic mode 
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Fig. 22.3 Transverse parasitic mode 


A bunch passing through such a structure on axis excites in lowest order a 
longitudinal electrical field and a transverse magnetic field as shown. Such a field 
pattern will not cause a transverse deflection of the whole beam since the electrical 
field is strictly longitudinal and the transverse magnetic field is zero on axis and out 
of phase. For this situation we define a longitudinal impedance Z) by 


f E(@) dz 


se oma 


(22.2) 


where E(w) is the electric field at the frequency w and /(@) the Fourier transform 
of the bunched beam current. The r.h.s. of (22.2) is the energy gained per unit 
charge and is equivalent to an accelerating voltage divided by the current, where 
the actual frequency dependence as determined by the specific physical shape of the 
“resonating” structure. 

In a similar way we can define a transverse impedance. A beam passing off axis 
through a “cavity” excites asymmetric fields, as shown in Fig. 22.3, proportional to 
the moment of the beam current J(@) Ax, where Ax is the displacement of the beam 
from the axis. 

Such an electrical field is connected through Maxwell’s equation with a finite 
transverse magnetic field on axis, as shown in Fig. 22.3, which causes a transverse 
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deflection of the beam. Consistent with the definition of the longitudinal impedance 
we define a transverse impedance by 


a (E(w) + [v x B(w)])| 1dz 
I(w) Ax 


Zi (@) = (22.3) 


where v is the velocity of the particle and B(w) the magnetic field component of the 
electromagnetic field at frequency w. In general the impedances are complex 


Z(@) = Zre(@) + iZim() . (22.4) 


The resistive part of the impedance can lead to a shift in the betatron oscillation 
frequency of the particles while the reactive or imaginary part may cause damping 
or antidamping. 

The impedance is a function of the frequency and its spectrum depends on 
the specific design of the vacuum chambers in a storage ring. The longitudinal 
impedance of vacuum chambers has been measured in SPEAR and in other existing 
storage rings and has been found to follow a general spectrum as a consequence 
of similar design concepts of storage ring components. SPEAR measurements, as 
shown in Fig. 19.9, demonstrate the general form of the frequency spectrum of the 
vacuum chamber impedance [1]. 

Characteristic for the spectrum is the cutoff frequency f. at which the linear 
impedance function reaches a maximum and above which the fields are able to 
propagate in the vacuum chamber. This cutoff frequency obviously is determined 
by the aperture of the vacuum chamber and therefore occurs at different frequencies 
for different rings with different vacuum chamber apertures. For the longitudinal 
broad band impedance at high frequencies above the cutoff frequency f, we have 
the simple power law 


Zito) SZ, (@ > a). (22:5) 


To simplify comparisons between different storage rings we define a normalized 
impedance Z/n as the impedance at the cut off frequency divided by the mode 
number 7 which is the ratio of the cutoff frequency f, to the revolution frequency 


Trev 


Z 


Ze 
Ie / rev 


F (22.6) 


n 
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This definition of the normalized impedance can be generalized to all frequencies 
and together with (22.5) the impedance spectrum becomes 


Zi w —1.68 
( ) (22.7) 
c Wc 
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Where only one is known, we can make an estimate of the other one through the 
approximate relation which is correct only for cylindrically symmetric structures 
[2, 3] 


A=>3—) (22.8) 


where 27R is the ring circumference and b the typical vacuum chamber radius. 
The longitudinal impedance of the whole storage ring vacuum system including 
rf-cavities can be determined by measuring the energy loss of particles in a high 
intensity bunch compared to the energy loss for particles in a low intensity bunch. 
Such loss measurements are performed by observing the shift in synchronous phase 
for the low and high intensity beam. The parasitic losses of rf-cavities can be 
calculated very accurately with computer programs or are known from laboratory 
measurements. From the separate knowledge of cavity and total ring losses we 
derive the vacuum chamber losses by simple subtraction. 

A bunched particle beam of high intensity represents a source of electromagnetic 
fields, called wake fields [4] in a wide range of wavelengths down to the order of the 
bunch length. The same is true for a realistic coasting beam where fluctuations in 
beam current simulate short particle bunches on top of an otherwise uniform beam. 

Introducing wake fields and higher-order mode losses, we distinguish two 
groups, the longitudinal and the transverse wake fields. The longitudinal wake fields 
being in phase with the beam current cause energy losses to the beam particles, 
while transverse wakes deflect particle trajectories. There is no field ahead of 
relativistically moving charge due to causality. From the knowledge of such wake 
fields in a particular environment we may determine the effect on a test charge 
moving behind a charge gq. 

The character of local wake fields depends greatly on the actual geometry and 
material of the vacuum chamber and we may expect a significant complication in 
the determination of wake field distributions along a vacuum enclosure of an actual 
accelerator. It is not practical to evaluate these fields in detail along the beam path 
and fortunately we do not need to. Since the effects of localized fields are small 
compared to the energy of the particles, we may integrate the wake fields over a 
full circumference. As we will see, this integral of the field can be experimentally 
determined. 


22.1.2 Longitudinal Wake Fields 


One may wonder how the existence of an obstacle in the vacuum chamber, like a 
disk which is relatively far away from the charge g, can influence a test particle 
following closely behind the charge gq. To illustrate this, we consider the situation 
shown in Fig. 22.4. 
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Long before the charge reaches the obstruction, fields start to diverge from the 
charge toward the obstruction to get scattered there. Some of the scattered fields 
move again toward the charge and catch up with it due to its slightly faster speed. 

The details of this catch up process are, however, of little interest compared to 
the integrated effect of wake fields on the test particle. Each charge at the position z 
creates a wake field for a particle at location Z < z and this wake field persists during 
the whole travel time along an accelerator segment L assuming that the distance 

= z—Z does not change appreciably along L. We define now a longitudinal wake 
function by integrating the longitudinal wake fields E, along the interaction length 
L, which might be the length of a vacuum chamber component, a linear accelerator 
or the circumference of a circular accelerator, and normalize it to a unit charge. By 
integrating, which is the same as averaging over the length L, we eliminate the need 
to calculate everywhere the complicated fields along the vacuum chambers. The 
wake field at the location of a test particle at z from a charge q at location z is then 
(Fig. 22.4) 


Fig. 22.4 Catch up of wake 
fields with test particle 


wy) = x | By(c.t~£/Be) de. (22.9) 
qd JL 


where ¢ = z— Zz > 0. The wake function is measured in V/Cb using practical units 
and is independent of the sign of the charge. To get the full wake field for a test 
particle, one would integrate the wake function over all particles ahead of the test 
particle. 

The longitudinal wake function allows us to calculate the total energy loss of 
the whole bunch by integrating over all particles. We consider a test particle with 
charge e at position Z and calculate the energy loss of this particle due to wake fields 
from charges further ahead at z > Zz. The total induced voltage from a collection of 
particles with distribution A(z) on the test charge at Z is then determined by the wake 
potential! 


Vom @) = —e / AQ)Wy(z—- 2 dz, (22.10) 


‘Expression (22.9) is sometimes called the wake potential. We do not follow this nomenclature 
because the expression (22.9) does not have the dimension of a potential but (22.10) does. 
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where a negative sign was added to indicate that the wake fields are decelerating. 
Integrating over all slices dz, the total energy loss of the bunch into HOM fields is 


AU yom = -| eA(Z) ae[ eA(z) Wi (z — 2) dz. (22.11) 


_ z 
a 
wake potential at Z 


The linear distribution A(z) of particles with charge e is normalized to the total 
number of particles M, in the bunch [ A(z) dz = Np. It is interesting to perform the 
integrations in (22.11) for a very short bunch such that the wake function seen by 
particles in this bunch is approximately constant and equal to Wo. In this case, we 
define the function w(Z) = i eA(z) dz and the double integral assumes the form 
— [°,wdw = +(eNp)? where we have used the normalization w(—oo) = eNy. 
Particles in a bunch see therefore only 50% of the wake fields produced by the 
same bunch consistent with our earlier formulation of the fundamental theorem of 
wake fields discussed in Sect. 19.3 in connection with wake fields in rf-cavities. By 
the same argument, each particle sees only half of its own wake field. 

Wake functions describe higher-order mode losses in the time domain. For 
further discussions, we determine the relationship to the concept of impedance 
in the frequency domain and replace in (22.10)the charge distribution with the 
instantaneous current passing by Z 


1(Z, 1) = Ip ll | (22,12) 


The beam current generally includes more than one mode k but for simplicity we 
consider only one in this discussion. Integrating over all parts of the beam which 
have passed the location z before, the wake potential (22.10) becomes 


di on om t—z . 

Vuom Zt) = -— j (ee ~*) Wi (z —Z) dz. (22.13) 
Consistent with a time dependent beam current, the induced voltage depends on 

location Z and time as well. The wake function vanishes due to causality for z—Z < 0 

and the integration can therefore be extended over all values of z. With (22.12), 

¢ = z—Zand applying a Fourier transform (22.13) becomes 


Vuou (ts @) = “105 / ~ e 1B WL) de. (22.14) 


=—&O 


From (22.14) we define the longitudinal coupling impedance in the frequency 
domain 


i cee 
Z\(@) = af. eo 1 OS/P WE) de (22.15) 
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which has in practical units the dimension Ohm. The impedance of the environment 
is the Fourier transform of the wake fields left behind by the beam in this 
environment. Because the wake function has been defined in (22.9) for the length 
L of the accelerator under consideration, the impedance is an integral parameter of 
the accelerator section L as well. Conversely, we may express the wake function in 
terms of the impedance spectrum 


L pe Esai 
Wy) = 5 i Z(w) eb?! ? dey. (22.16) 


The interrelations between wake functions and impedances allows us to use 
the most appropriate quantity for the problem at hand. Generally, it depends on 
whether one wants to work in the frequency or the time domain. For theoretical 
discussions, the well defined impedance concept allows quantitative predictions for 
beam stability or instability to be made. In most practical applications, however, 
the impedance is not quite convenient to use because it is not well known for 
complicated shapes of the vacuum chamber. In a linear accelerator, for example, 
we need to observe the stability of particles in the time domain to determine the 
head-tail interaction. The most convenient quantity depends greatly on the problem 
to be solved, theoretically or experimentally. 


Loss Parameter 


In a real accelerator, the beam integrates over many different vacuum chamber 
pieces with widely varying impedances. The interaction of the beam with the 
vacuum chamber impedance leads to an energy loss which has to be compensated 
by the rf-system. We are therefore not able to experimentally determine the 
impedance or wake function of a particular vacuum chamber element. Only the 
integrated impedance for the whole accelerator can sometimes be probed at specific 
frequencies by observing specific instabilities as we will discuss later. The most 
accurate quantity to measure the total resistive impedance for the whole accelerator 
integrated over all frequencies is the loss factor or loss parameter. 
We characterize this loss through the loss factor k defined by 


_ AU 


a 


é (22.17) 


where AU is the total energy deposited by the passing bunch and q is the total 
electrical charge in this bunch. This definition is a generalization of the energy loss 
of a single particle passing once through a resonator where k = —(w/4)(Rs/Q) and 
R, is the shunt impedance of this resonator. The loss factor is related to the real part 
of the impedance by 


k= af Re[Z(w)] I? (w) dw 
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Fig. 22.5 Dependence of the 
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and depends strongly on the bunch length as can be seen from measurements of the 
loss factor in SPEAR [5] shown in Fig. 22.5. Specifically, we find the loss factor to 
scale with the bunch length like 


k(or) ~ oj, ?". (22.18) 


Similar to the definitions of impedances, we also distinguish a longitudinal and 
a transverse loss factor. The loss factor can be related to the wake function and we 
get from comparison with (22.11) the relation 


Kitson a wf 2® az [ A(z) W (z—2z)dz. (22.19) 


The loss parameter can be defined for the complete circular accelerator or for 
a specific vacuum chamber component installed in a beam line or accelerator. 
Knowledge of this loss factor is important to determine possible heating effects 
which can become significant since the total higher-order mode losses are deposited 
in the form of heat in the vacuum chamber component. In a circular accelerator, the 
energy loss rate or heating power of a beam circulating with revolution frequency 


to iS 


ye 


WOE mg (22.20) 


HOM 


where np is the number of bunches in the beam and J, = np gNpfo is the average 
circulating beam current in the accelerator. As an example, we consider a circulating 
beam of 1mA in one bunch of the LEP storage ring where the revolution frequency is 
about fo = 10 kHz. The heating losses in a component with loss factor kyoy = 0.1 
V/pCb would be 10 Watts. This might not seem much if the component is large 
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and an external cooling fan might be sufficient. On the other hand, if the vacuum 
component is small and not accessible like a bellows this heating power might 
be significant and must be prevented by design. The higher-order heating losses 
scale like the average current, the bunch current and inversely proportional with 
the revolution frequency. For a given circulating beam current, the losses depend 
therefore greatly on the number of bunches and the size of the circular accelerator. 
As the bunch length becomes smaller, higher and higher modes can be excited as 
demonstrated by the steep increase in loss parameter with decreasing bunch length 
(Fig. 22.5). Although we try to apply a careful design to all accelerator components 
to minimize the impedance it is prudent to be aware of this heating effect while 
developing accelerators that involve significantly reduced bunch length like those in 
quasi-isochronous storage rings or beams accelerated by laser beams. 

The loss parameter can be measured by observing the shift in the synchronous 
phase. A bunch of particles circulating in an accelerator looses energy due to 
the resistive impedance of the vacuum chamber. This additional energy loss is 
compensated by an appropriate shift in the synchronous phase which is given by 


AU yom = eNp Viel sin (ds — dso) | : (22.21) 


where @¢,o is the synchronous phase for a very small beam current and V,, the peak 
rf-voltage. The loss factor is then with the number of particles per bunch M, 


AU, 
kuom = NE . (22.22) 


Performing this measurement as a function of rf-voltage one can establish a curve 
similar to that shown in Fig. 22.5 for the storage ring SPEAR and the dependence of 
the loss parameter on the bunch length can be used to determine the total resistive 
impedance of the accelerator as a function of frequency. To do that, we write (22.19) 
in terms of Fourier transforms 


1 ia 5 
A =e Zres(@) |[(w)|- dw (22.23) 
e-Ny, J—oo 
and recall that the bunch or current distribution in a storage ring is Gaussian 


Ip 2/992 
I(t) = ———e* 9% | (22.24) 
V210, 


The Fourier transform of a Gaussian distribution is 


I(w) = he?” , (22.25) 
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where J is the total bunch current and inserting (22.25) into (22.23), we get 


Io me —wo2 
ky HOM e2N2 i Zres(@) € tda. (22.26) 
With (22.26) and the measurement Kom (20) where oy = co;, one may solve for 
Zres(@) and determine the resistive-impedance spectrum of the ring. 

Unfortunately, it is not possible to attach a resistance meter to an accelerator to 
determine its impedance and we will have to apply a variety of wake field effects on 
the particle beams to determine the complex impedance as a function of frequency. 
No single effect, however, will allow us to measure the whole frequency spectrum 
of the impedance. 


22.1.3 Transverse Wake Fields 


Similar to the longitudinal case we also observe transverse wake fields with 
associated impedances. Such fields exert a transverse force on particles generated 
by either transverse electrical or magnetic wake fields. Generally such fields appear 
when a charged particle beam passes off center through a nonuniform but cylindrical 
or through an asymmetric vacuum chamber. Transverse wake fields can be induced 
only on structures which also exhibit a longitudinal impedance. A beam travelling 
off center through a round pipe with perfectly conducting walls will not create 
longitudinal and therefore also no transverse wake fields. 

We consider a charge g passing through a vacuum chamber structure with an 
offset Au = (Ax, Ay) in the horizontal or vertical plane as shown in Fig. 22.3. 

In analogy to the definition of the longitudinal wake function (22.9), we define a 
transverse wake function per unit transverse offset by 


pile tE(t— ¢/Bc) + clB x Bit—¢/Be)]} 1 dz 


Wit) = pha 


(097) 


which is measured in units of V/Cb/m. Consistent with the definition (22.15) of the 
longitudinal impedance, the transverse coupling impedance is the Fourier transform 
of the transverse wake functions defined by 


Zi (o) = ig / © erietle W1(¢) de (22.28) 


adding the factor i to indicate that the action of the transverse force is a mere 
deflection while the particle energy stays constant. This transverse impedance is 
measured in Ohm/m. The inverse relation is similar to the longitudinal case 


a ie er 
W.(2 =i— / Z1 (w) eb?! dav. (22.29) 
PE an ee 
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22.1.4 Panofsky-Wenzel Theorem 


The general relationship between longitudinal and transverse wake fields is 
expressed by the Panofsky-Wenzel theorem [6]. Panofsky and Wenzel studied 
the effect of transverse electromagnetic fields on a particle trajectory and applied 
general relations of electromagnetic theory to derive a relationship between 
longitudinal and transverse electromagnetic forces. We will derive the same result 
in the realm of wake fields. The Lorentz force on a test particle at z due to transverse 
wake fields from charges at location z > Z causes a deflection of the particle 
trajectory and the change in transverse momentum of the test particle is after 
integration over all charges at locations z < Z 


Pp. = Al [E + (v x B)], dz. (22.30) 


Note that the wake fields vanish because of causality for ¢ < 0. The fields can be 
expressed by the vector potential EF, = —dA,/dt and B, = (VxA)_. The particle 
velocity has only one nonvanishing component v= (0,0, v) and (22.30) becomes 
with dz/dt = v 


=d/dz 


—_—_—__ 
47a ar a 
p= -cef ——+ — JA, dz+ceV, A| dz, (22.31) 
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where we made use of the vector relation for v x (V x A)+A x (V x v) which is 
— 


=0 
equal to V, (vA) — (vV)A,—(AV) v noting that the particle velocity is a constant. 
————” 


=0 

The integrand in the first integral of (22.31) is equal to the total derivative 
dA, /dz and the integral vanishes because the fields vanish for ¢ = soo. After 
differentiation with respect to the time ¢ (22.31) becomes 


d [o.@) 

PL _ ey, / E\ dz (22.32) 
dt 6 

which is in terms of forces 


") 
—F, =-V\F\. (22.33) 
dz 


The longitudinal gradient of the transverse force or electromagnetic field is propor- 
tional to the transverse gradient of the longitudinal force or electromagnetic field 
and knowledge of one allows us to calculate the other. 
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22.2 Impedances in an Accelerator Environment 


The vacuum chamber of an accelerator is too complicated in geometry to allow an 
analytical expression for its impedance. In principle each section of the chamber 
must be treated individually. By employing two or three-dimensional numerical 
codes it may be possible to determine the impedance for a particular component 
and during a careful design process for a new accelerator, this should be done 
to avoid later surprises. In [7] expressions for many geometries are compiled. 
Yet, every accelerator is somewhat different from another and will have its own 
particular overall impedance characteristics. For this reason, we focus in these 
discussions specifically on such instabilities which can be studied experimentally 
revealing impedance characteristics of the ring. However, depending on the fre- 
quency involved, there are a few classes of impedances which are common to 
all accelerators and may help understand the appearance and strength of certain 
instabilities. In this section, we will discuss such impedances to be used in later 
discussions on stability conditions and growth rate of instabilities. 

Consistent with (21.88), (22.10) the longitudinal impedance for a circular 
accelerator is defined as the ratio of the induced voltage at frequency w to the Fourier 
transform of the beam current at the same frequency 


_ SE \() dz 
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Similarly the transverse impedance is from (22.27), (22.28) the ratio of induced 
transverse voltage to the transverse moment of the beam current 
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Z1(o) = T(w) Au 


, (22.35) 


where Au is the horizontal or vertical offset of the beam from the axis. 


22.2.1 Space-Charge Impedance 


In (21.78) we found an induced voltage leading to an energy gain or loss due 
to a collection of charged particles. It is customary to express (21.78) in a form 
exhibiting the impedance of the vacuum chamber. In case of a perfectly conducting 
vacuum chamber Ew = 0 and (21.78) becomes 


Vz = —Zisc In elf? -O™ , (22.36) 
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where the longitudinal space-charge impedance Z),, is defined by [8, Sect. BA 5)> 


Ajl@) tt) (igo ™ (22.37) 
neg 2By? ro)” , 


where n = w/o and @p is the revolution frequency. This expression is correct for 
long wavelength below cut off of the vacuum chamber or for w < c/ry. The space- 
charge impedance is purely reactive and, as we will see, capacitive. For a round 
beam of radius ro and offset from the axis of a round beam pipe with diameter 27, 
a transverse space-charge impedance can be derived [7] 


i R 1 1 
Z1se(@) = —— =P (22.38) 
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where R is the average ring radius. The transverse space-charge impedance is 
inversely proportional to 6? and is therefore especially strong for low energy particle 
beams. 


22.2.2. Resistive- Wall Impedance 


The particle beam induces an image current in the vacuum chamber wall in a thin 
layer with a depth equal to the skin depth. For less than perfect conductivity of the 
wall material, we observe resistive losses which exert a pull or decelerating field 
on the particles. This pull is proportional to the beam current and integrating the 
fields around a full circumference 27R of the accelerator we get the longitudinal 
resistive wall impedance in a uniform tube of radius r,, at frequency w, for lowest 
order monopole oscillations [9] 
R 


Z| (@ , 
uC y lwo Oskin 


=(1—i) (22.39) 
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where the skin depth is defined by [10] Sskin(@n) = / ae The longitudinal 


resistive wall impedance decays with increasing frequency and therefore plays an 
important role only for lower frequencies up to tens of GHz[9]. The transverse 
resistive wall impedance for a round beam pipe is from the Panofsky-Wenzel 
theorem [6] 
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2Note: the factor 1/(eoc) = V Lo/€o = Zo = 376.73 @ is often called the free space impedance. 
We will not use it because it is not a physical quantity but only a convenient unit scaling factor. A 
current passing through vacuum will not loose energy into this impedance. 
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22.2.3. Cavity-Like Structure Impedance 


The impedance of accelerating cavities or cavity like objects of the vacuum chamber 
can be described by the equivalent of a parallel resonant circuit for which the 
impedance is from (19.11) 


1 Dg, gid 
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where Q is the quality factor and R, the cavity impedance at the resonance frequency 
@, or cavity shunt impedance. Taking the inverse, we get for the normalized 
impedance 
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where an = R, is purely resistive andn = w/wo. 

Vacuum chamber impedances occur, for example, due to sudden changes of cross 
section, flanges, beam position monitors, etc., and are collectively described by a 
cavity like impedance with a quality factor Q ~ 1. This is justified because fields 
are induced in these impedances at any frequency. From (22.42) the longitudinal 
broad-band impedance is therefore 
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This broad-band impedance spectrum is shown in Fig. 22.6 and we note that the 
resistive and reactive part exhibit different spectra. 

The resistive broad-band impedance has a symmetric spectrum and scales like 
w* for low frequencies decaying again for very high frequencies like 1/@*. At low 
frequencies, the broad-band impedance (22.43) is almost purely inductive scaling 
linear with frequency 
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At high frequencies the impedance becomes capacitive 
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Fig. 22.6 Resistive and 
reactive broad-band 
impedance spectrum 


Z/n , 


Re(Z/n) 


@ @ 


Q=1 Im(Z,/n) 


and decaying slower with frequency than the resistive impedance. We note, however, 
that the reactive broad-band impedance spectrum changes sign and beam stability or 
instability depend therefore greatly on the actual coupling frequency. At resonance, 
the broad-band impedance is purely resistive as would be expected. 

Sometimes it is convenient to have a simple approximate correlation between 
longitudinal and transverse impedance in a circular accelerator as shown in (22.40). 
Although this correlation is valid only for the resistive wall impedance in a round 
beam pipe, it is often used for approximate estimates utilizing the broad-band 
impedance. 


22.2.4 Overall Accelerator Impedance 


At this point, we have identified all significant types of impedances we generally 
encounter in an accelerator which are the space charge, resistive wall, narrow-band 
impedances in high Q cavities, and broad-band impedance. In Fig. 22.7 we show 
qualitatively these resistive as well as reactive impedance components as a function 
of frequency. 

At low frequency the reactive as well as the resistive component of the resistive 
wall impedance dominates while the space charge impedance is independent of 
frequency. The narrow-band cavity spectrum includes the high impedances at the 
fundamental and higher mode frequencies. 

Generally, it is not possible to use a uniform vacuum chamber in circular 
accelerators. Deviations from a uniform chamber occur at flanges, bellows, rf- 
cavities, injection/ejection elements, electrostatic plates, etc. It is not convenient 
to consider the special impedance characteristics of every vacuum chamber piece 
and we may therefore look for an average impedance as seen by the beam. The 
broad-band impedance spectrum created by chamber components in a ring reaches 
a maximum at some frequency and then diminishes again like 1/@. This turn over 
of the broad-band impedance function depends on the general dimensions of all 
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Fig. 22.7 Qualitative spectra of resistive and reactive coupling impedances in a circular accelera- 
tor 


vacuum chamber components of a circular accelerator and has to do with the cut off 
frequency for travelling waves in tubes. 

In Fig. 19.9 the measured impedance spectrum of a storage ring was shown and is 
typical for complex storage ring vacuum chambers which are generally composed of 
similar components exhibiting at low frequencies an inductive impedance increasing 
linearly with frequency and diminishing again at high frequencies. This is also 
the characteristics of broad-band cavity impedance and therefore expressions for 
broad-band impedance are useful tools in developing theories for beam instabilities 
and predicting conditions for beam stability. The induced voltage for the total ring 
circumference scales like Ly Ts where Ly, is the wall inductance and iy the time 
derivative of the image current in the wall. The induced voltage is 


di 
AV.» = ly = i@Ly (@)Iy b9—&) (22.46) 
where the inductive impedance is defined by 


Zhina(@) = —ioLw(o). (22.47) 


The total induced voltage due to space charge, resistive and inductive wall 
impedance is finally 


Vow = —Zy In ebm, (22.48) 


where the total longitudinal normalized impedance at frequency , is from (22.37), 
(22.39), (22.47) 


R : 
——_ - 1 
Ty O8gkin 4a 


w 


Lg tin: 
€0 
(22.49) 


n 2eoBcy? 


142i) 40-9 


WwW 
ro 


756 22 Wake Fields and Instabilities* 


From the frequency dependence we note that space charge and inductive wall 
impedance becomes more important at high frequencies while the resistive wall 
impedance is dominant at low frequencies. The inductive wall impedance derives 
mostly from vacuum chamber discontinuities like sudden change in the vacuum 
chamber cross section, bellows, electrostatic plates, cavities, etc. In older acceler- 
ators, little effort was made to minimize the impedance and total ring impedances 
of |Z\/n| ~ 20 to 3082 were not uncommon. Modern vacuum design have been 
developed to greatly reduce the impedance mostly by avoiding changes of vacuum 
chamber cross section or by introducing gentle transitions and impedances of the 
order of |Z) /n| < 1 2 can be achieved whereby most of this remaining impedance 
comes from accelerating rf-cavities. 

From (22.49), we note that the space-charge impedance has the opposite sign 
of the inductive impedance and is therefore capacitive in nature. In general, we 
encounter in a realistic vacuum chamber resistive as well as reactive impedances 
causing both real frequency shifts or imaginary shifts manifesting themselves in the 
form of damping or instability. In subsequent sections, we will discuss a variety of 
such phenomena and derive stability criteria, beam-current limits or rise times for 
instability. At this point, it is noteworthy to mention that we have not made any 
assumption as to the nature of the particles involved and we may therefore apply the 
results of this discussion to electron as well as proton and ion beams. 


22.2.5 Broad-Band Wake Fields in a Linear Accelerator 


The structure of a linear accelerator constitutes a large impedance for a charged 
particle beam, specifically, since particle bunches are very short compared to the 
periodicity of the accelerator lattice. Every single cell resembles a big sudden 
change of the vacuum chamber cross section and we expect therefore a large 
accumulation of wake fields or impedance along the accelerator. The wake fields 
can be calculated numerically [4] and results for both the longitudinal and transverse 
wakes from a point charge are shown in Fig. 22.8 as a function of the distance behind 
this point charge. 

Broad-band wake fields for other structures look basically similar to those shown 
in Fig. 22.8. Specifically, we note the longitudinal wake to be strongest just behind 
the head of the bunch while the transverse wake builds up over some distance. For 
an arbitrary particle distribution, one would fold the particle distribution with these 
wake functions to obtain the wake potential at the location of the test particle. 


22.3. Coasting-Beam Instabilities 


The space-charge impedance as well as resistive and reactive wall impedances 
extract energy from a circulating particle beam. As long as the particle distribution 
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Fig. 22.8 Time dependence of transverse (left) and longitudinal (right) wake fields from a point 
charge moving through one 3.3 cm long cell of a SLAC type 3 GHz linear accelerator structure [4] 


is uniform, this energy loss is the same for all particles and requires simple 
replacement in acceleration cavities. In reality, however, some modulation of the 
longitudinal particle distribution cannot be avoided and we encounter therefore 
an uneven energy loss along the coasting particle beam. This can have serious 
consequences on beam stability and we therefore need to discuss stability criteria 
for coasting beams. 


22.3.1 Negative-Mass Instability 


Consider a beam in a ring below transition energy. The repulsive electrostatic 
field from a lump in the charge distribution causes particles ahead of the lump to 
be accelerated and particles behind the lump to be decelerated. Since accelerated 
particles will circulate faster and decelerated particles circulate slower, we observe 
a stabilizing situation and the lumpy particle density becomes smoothed out. Nature 
demonstrates this in the stability of Saturn’s rings.which is equi8valent to this case 
below transition energy. 

At energies above transition energy the situation changes drastically. Now the 
acceleration of a particle ahead of a lump leads to a slower revolution frequency and 
it will actually move closer to the lump with every turn. Similarly a particle behind 
the lump becomes decelerated and circulates therefore faster, again catching up with 
the lump. We observe an instability leading to a growing concentration of particles 
wherever a small perturbation started to occur. We call this instability the negative- 
mass instability [11] because acceleration causes particles to circulate slower similar 
to the acceleration of a negative mass. The same mechanism can lead to stabilization 
of oscillations if the forces are attractive rather than repulsive. 

We will derive conditions of stability for this effect in a more quantitative 
way. The stability condition depends on the variation of the revolution frequency 
for particles close to the small perturbation of an otherwise uniform longitudinal 
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particle distribution and we therefore investigate the time derivative of the revolution 
frequency 


daw dm d0w06 


dt oF 00 oF eo 


which can also be expressed in the form 


dw _ daw dE _ Neo dE 


= = ; 22.51 
dt dE dt B?Eo dt ( ) 


where 7, is the momentum compaction. The energy change per unit time is for a 
longitudinal impedance Z, and nth harmonic of the beam current 
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where g = eZ > 0 is the electrical charge of the particle and Z the charge 


multiplicity. Collecting all terms for (22.51) we get with 


@ = wy + w,el9- 29 (22.53) 
the relation 
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This can be further simplified with the continuity equation 
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and we get with (21.77), (22.53) 
(2 — nwo), = Oynlo. (22.55) 


Replacing @, in (22.54) by the expression (22.55,) we finally get for the 
perturbation frequency 2 with Ip = BcdAo 
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Equation (22.56) determines the evolution of the charge or current perturbation 
An or I, respectively. With AQ = AQ, + iAQ;, the current perturbation is 


E TO ai ec a a) = i; eAvit ei(29—nwot—AQ,t) (22.57) 
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exhibiting an exponential factor which can cause instability or damping since 
there is a positive as well as negative solution from (22.56) for the frequency 
shift AS2;. The situation in a particular case will depend on initial conditions 
describing the actual perturbation of the density distribution, however, different 
initial perturbations must be expected to be present along a real particle distribution 
including at least one leading to instability. 

Beam stability occurs only if the imaginary part of the frequency shift vanishes. 
This is never the case if the impedance has a resistive component causing a resistive- 
wall instability [12]. From (22.56) and the resistive wall impedance (22.39) we may 
derive a growth rate for the instability 


1 2—1n2gnew2IbR [2 
Z = ImfAQ} = V2 = 1 ngnewilo falda a (22.58) 
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This result requires some more discussion since we know that circular accelera- 
tors exist, work, and have metallic vacuum chambers with a resistive surface. The 
apparent discrepancy is due to the fact that we have assumed a monochromatic beam 
which indeed is unstable but also unrealistic. In the following sections, we include a 
finite momentum spread, find a stabilizing mechanism called Landau damping and 
derive new stability criteria. 

Below transition energy, 7. > O will assure stability of a coasting beam as 
long as we consider only a purely capacitive impedance like the space-charge 
impedance (22.37) in which case A{2; = 0. Above transition energy 7. < 0 and the 
negative-mass instability appears as long as the impedance is capacitive or Z; > 0. 
For an inductive impedance, the stability conditions are exchanged below and above 
transition energy. In summary, we have the following longitudinal coasting beam 
stability conditions: 


always stable 


if ZA0—> Aa, #0 pigs : - 22.59 
; 7 ea ee resistive-wall instability ( ) 
Zev stable for y > Yr or Ne < 0 
if Z=0 (inductive) unstable for y < Yr or nce > O (22.60) 
Z, > 0 stable for y < Wy or Ne > O 
(capacitive) unstable for y > Yur or Nc < 0. 


It is customary to plot the stability condition (22.56) in a (Z,, Zj)-diagram with 
ASQ, as a parameter. We solve (22.56) for the imaginary impedance Z; and get 


i= Z,_\* _ (AR 22.61 
; = sen(n.)a (55) +( 7 ) : (22.61) 


760 22 Wake Fields and Instabilities* 


Fig. 22.9 Stability diagram Z 
for a coasting monochromatic i Aw,=0 
particle beam 
Ao=1 
Aw,=3 
Aw =5 
Z. 
where 
2 
n wil 0 
= ng|nel@olo (22.62) 
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and plot the result in Fig. 22.9. Only the case 7. > 0 is shown in Fig. 22.9 noting 
that the case 7. < 0 is obtained by a 180° rotation of Fig. 22.9 about the Z;-axis. 
Figure 22.9 demonstrates that beam stability occurs only if Z, = 0 and Z, > 0. 
Knowing the complex impedance for a particular accelerator, Fig. 22.9 can be used 
to determine the rise time 1/t = A&2; of the instability. 

The rise time or growth rate of the negative-mass instability above transition is 
for a beam circulating within a perfectly conducting vacuum chamber from (22.37) 
and (22.56) 
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In this section, it was implicitly assumed that all particles have the same 
momentum and therefore, the same revolution frequency wo allowing a change 
of the revolution frequency only for those particles close to a particle density 
perturbation. This restriction to a monochromatic beam is not realistic and provides 
little beam stability for particle beams in a circular accelerator. In the following 
section, we will discuss the more general case of a beam with a finite momentum 
spread and review beam stability conditions under more realistic beam parameters. 


22.3.2 Dispersion Relation 


In the previous section, conditions for beam stability were derived based on a 
monochromatic particle beam. The rise time of the instability depends critically 
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on the revolution frequency and we may assume that the conditions for beam 
stability may change if we introduce the more realistic case of a beam with a 
finite momentum spread and therefore a finite spread of revolution frequencies. In 
Chap. 15, we discussed the mathematical tool of the Vlasov equation to describe 
collectively the dynamics of a distribution of particles in phase space. We will apply 
this tool to the collective interaction of a particle beam with its environment. 

The canonical variables describing longitudinal motion of particles are the 
azimuth @ and relative momentum error 6 = Ap/po. Neglecting radiation damping, 
the Vlasov equation is 


+4@—+58—=0, (22.64) 


where W(6,0,f) is the particle distribution. For a coasting beam with a small 
perturbation 


Y= H+ wy, leon (22.65) 
we get after insertion in (22.64) and sorting terms the relation 
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Making use of the correlation between particle momentum and revolution 


frequency, we get from (22.66) with om = aM Sea = NcWo Mo 
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Integrating the 1.h.s. of (22.67) over all momenta, we get for the perturbation current 


an 
r* W,,(5) d5 = I, . 
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At this point, it is convenient to transform from the variable 6 to the frequency w 
and obtain the particle distribution in these new variables 


W(6,0) = newoP(w, 0). (22.68) 


Performing the same integration on the r.h.s. of (22.67), we get with (22.52) and 
6 = (dE/dt)/(B?Eo) the dispersion relation [13] 
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The integration is to be taken over the upper or lower complex plane where we 
assume that the distribution function ® vanishes sufficiently fast at infinity. Trying to 
establish beam stability for a particular particle distribution, we solve the dispersion 
relation for the frequency w, or frequency shift Aw, = @, —nq@ which is in general 
complex. The real part causes a shift in the frequency while the imaginary part 
determines the state of stability or instability for the collective motion. 

For example, it is interesting to apply this result to the case of a coasting beam 
of monochromatic particles as discussed in the previous section. Let the particle 
distribution be uniform in @ and a delta function in energy. In the dispersion relation, 
we need to take the derivative with respect to the revolution frequency and set 
therefore 


— = ——§(w— ap). (22.70) 
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Insertion into (22.69) and integration by parts results in 
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which is identical to the earlier result (22.56) in the previous section. Application 
of the Vlasov equation therefore gives the same result as the direct derivation of the 
negative-mass instability conditions as it should be. 

We may now apply this formalism to a beam with finite momentum spread. In 
preparation to do that, we note that the integrand in (22.69) has a singularity at 
@ = @,/n which we take care of by applying Cauchy’s residue theorem for 
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The dispersion relation (22.69) then assumes the form 
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where P.V. indicates that only the principal value of the integral be taken. 

The solutions of the dispersion function depend greatly on the particle distri- 
bution in momentum or revolution-frequency space. To simplify the expressions, 
we replace the revolution frequency by its deviation from the reference value [14]. 
With 2S being the full-width half maximum of the particle momentum distribution 
(Fig. 22.10), we define the new variables 
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Fig. 22.10 Particle 


distribution f (x) 
fwhm = 2S 
x 
-1 1 

In these variables the particle distribution becomes 

21S 

fQ@) = =?) (22.75) 
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which is normalized to f(+1) = 5f(0) and f f(x)dx = 1. The full momentum 
spread at half maximum intensity is 


A 2 
ca (22.76) 
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It is customary to define parameters U, V by 
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and the dispersion relation becomes finally with this 
1=-(V+i10U)/, (22.79) 
where the integral 
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For a particular accelerator all parameters in (22.79) are known, at least in prin- 
ciple, and we may determine the status of stability or instability for a desired beam 
current Jy by solving for the generally complex frequency shift Aw. The specific 
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Fig. 22.11 Particle l/r 
distribution in momentum f(x) 
space 


boundary of stability depends on the actual particle distribution in momentum. 
Unfortunately, (22.79) cannot be solved analytically for an arbitrary momentum 
distribution and we will have to either restrict our analytical discussion to simple 
solvable distributions or to numerical evaluation. 

For reasonable representations of real particle distributions in an accelerator a 
central region of stability can be identified for small complex impedances and finite 
spread in momentum. Regions of stability have been determined for a number 
of particle distributions and the interested reader is referred for more detailed 
information on such calculations to references [15-19]. 

As an example, we use a simple particle distribution (Fig. 22.11) 
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and evaluate the dispersion relation (22.79). The integral in (22.80) becomes now 
after integration by parts 
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exhibiting a new singularity at x = i while the integration path still excludes the 
other singularity at x = x,;. Applying the residue theorem 
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The second term in (22.80) is 
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and the dispersion relation (22.79) becomes 


jf = 4 nh (22.86) 
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We solve this for (x, — i)? and get 


2x1 
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For a small beam current ig, we get x, ~ i and the second term in the square 
bracket becomes approximately 1/2. Recalling the definition (22.74) for x1, we get 


from (22.87) 
AQ = inS + ,/3n*S?(U -iV), (22.88) 


where from (22.76) § = 5|ncl@oAp/Po- The significant result in (22.88) is the fact 
that the first term on the right-hand side has a definite positive sign and provides 
therefore damping which is called Landau damping [20]. 

Recalling the conditions for the negative-mass instability of a monochromatic 
beam, we did not obtain beam stability for any beam current if Z, « V = 0 and 
the reactive impedance was inductive or Z; « U < 0. Now with a finite momentum 
spread in the beam we get in the same case 


AQ gia, =i1N8 Eig) areS 10), (22.89) 


where S?|U| is independent of the momentum spread. We note that it takes a 
finite beam current (U « Jy) to overcome Landau damping and cause instability. 
Of course Landau damping is proportional to the momentum spread S and does not 
occur for a monochromatic beam. Equation (22.88) serves as a stability criterion 
for longitudinal coasting-beam instabilities and we will try to derive a general 
expression by writing (22.88) in the form 


AX =inS+ Va-—ib (22.90) 


and get after evaluating the square root 


aa =inss (F454), (22.91) 


where r = Ja’ + b?. Beam stability occurs for Im{A@} > 0 or 


Ws? = 1 5 . (22.92) 
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which is in more practical quantities recalling the definition (22.76) for S 


Ap\? 3 I Z\. & 
Po 2n BEo|ne| \ non 


We may solve (22.93) for the impedance and get an equation of the form 


1 
2 
Z, = AZ; aA (22.94) 
which is shown in Fig. 22.12. 

Any combination of actual resistive and reactive impedances below this curve 
cause beam instability for the particle distribution (22.81). We note the significant 
difference to Fig. 22.9 where the impedance had to be purely positive and reactive 
to obtain beam stability. 

Other momentum distributions like f(x) o« (1 — x*)” lead to similar results 
[15] although the stability curves allow less resistive impedance than the distribu- 
tion (22.81). As a safe stability criterion which is true for many such distributions 
including a Gaussian distribution we define the area of stability by a circle with a 
radius R = Zi|z=0 = 1/(4A). With this assumption, the stability criterion for the 
longitudinal microwave instability is 


Z 2Eolnel ( Ap\* 
|Z. age olNe| ( 2) (22.95) 
n qo \ po 
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where the form factor F = z/3 for the distribution (22.81) and is generally of 
the order of unity for other bell shaped distributions. The criterion (22.95) has been 
derived by Keil and Schnell [21] and is known as the Keil-Schnell stability criterion. 
For a desired beam current and an allowable momentum spread an upper limit for 
the normalized impedance can be derived. 

The impedance seen by the particle beam obviously should be minimized 
to achieve the highest beam-beam currents. A large value of the momentum 
compaction is desirable here to increase the mixing of the revolution frequencies as 
well as a large momentum spread to achieve high beam currents. A finite momentum 
spread increases beam stability where there was none for a monochromatic coasting 
beam as discussed earlier. This stabilization effect of a finite momentum spread is 
called Landau damping. 


22.3.3 Landau Damping 


In previous sections, we repeatedly encountered a damping phenomenon associated 
with the effect of collective fields on individual particle stability. Common to the 
situations encountered is the existence of a set of oscillators or particles, under the 
influence of an external driving force. Particularly, we are interested in the dynamics 
when the external excitation is caused by the beam itself. Landau [20] studied this 
effect first and later Hereward [22] formulated the theory for application to particle 
accelerators. 

We consider a bunch of particles where each particle oscillates at a different 
frequency 2, albeit within a small range of frequencies. The equation of motion for 


each oscillator under the influence of the force F e~' is 


i+ 2?u= Fe (22.96) 
and the solution 
eet 1 1 
u=F _ : (22.97) 
2w 2Q-wo LR+o 


Folding this solution with the distribution function of particles in frequency space 


y(o) = —— (22.98) 
b 


one obtains the center of mass amplitude of the bunch 


_ em POT (2) ¥(@) 
u os [| a | a2 (22.99) 
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or with y(2) = w(-2) and f°, 5 we) dQ = os ee se die 


—iwt [oe] Q 
=e AE) ae (22.100) 
oO Jo 2-w 


Here we apply again Cauchy’s residue theorem and get 


WO) so], 


—-W 


—iwt 


u=F 
o 


[rinv(o) Pv. f (22.101) 


The derivation up to here appears quite abstract and we pause a moment to reflect 
on the physics involved here. We know that driving an oscillator at resonance leads 
to infinitely large amplitudes and that is what the mathematical formulation above 
expresses. However, we also know that infinite amplitudes take an infinite time to 
build up and the solutions gained above describe only the state after a long time. 
The same result can be obtained in a physical more realistic way if we apply the 
excitation at time t¢ = 0 and look for the solution at t — oo as has been shown 
by Hofmann [23]. As an added result of this time evolution derivation, we obtain 
the correct sign for the residue which we have tacitly assumed to be negative, but 
mathematically could be of either sign. 

To understand the damping effect, we calculate the velocity iu and get 
from (22.101) 


u= —iwu 
= Fei eax —iP.V. [3 4 a2 : (22.102) 


The bunch velocity is in phase with the external force for the residue term 
allowing extraction of energy from the external force. The principal value term, 
on the other hand, is out of phase and no work is done. If, for example, the external 
force is due to a transverse wake field generated by a bunch performing coherent 
betatron oscillations, the described mechanism would extract energy from the wake 
field thus damping the coherent betatron oscillation. The question is where does the 
energy go? 

For this, we study the time evolution of the solution for the inhomogeneous 
differential equation (22.96) in the form 


F 
u= asin 22t + auf. (22.103) 

At time t = O we require that the amplitude and velocity of the bunch motion be 
zero u(t = 0) = O and u(t = 0) = 0. The oscillation amplitude 


a) F 
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and the final expression for the solution to (22.96) is for 2 4 w 
F ; oO. 
ugdze(t) = Roo? (sin ot — re sin 21) . (22.105) 
Close to or at resonance §2 = w + A and (22.105) becomes 


F in wt 
Ma~olt) = — 5 (sos or - ne (22.106) 


The oscillation amplitude of particles at resonance grows continuously with time 
while the width of the resonance shrinks like 1/t thus absorbing energy linear 
in time. This Landau damping depends critically on the resistive interaction with 
the wake fields or external forces and is mathematically expressed by the residue 
term. This residue term, however, depends on the particle density at the excitation 
frequency w and is zero if the particle distribution in frequency space does not 
overlap with the frequency w. For effective Landau damping to occur such an 
overlap is essential. 


22.3.4 Transverse Coasting-Beam Instability 


Particle beams travelling off center through vacuum chamber sections can induce 
transverse fields which act back on the beam. We express the strength of this 
interaction by the transverse coupling impedance. In terms of the transverse 
coupling impedance, the force is 


qZIou 


Fi =i — 
2nR 


(22.107) 


where Jp is the beam current, u the transverse beam displacement, Z, / (27R) 
the average transverse coupling impedance and 27R the ring circumference. The 
equation of motion is then 


as qZ1 1p 7 


t+ V405 u= (u+ i) (22.108) 


2nRmy 


with u the betatron oscillation amplitude of an individual particle and wu the 
amplitude of the coherent bunch oscillation. Since the perturbation is linear in the 
amplitudes, we expect tune shifts from the perturbations. The incoherent tune shift 
due to individual particle motion will be incorporated on the |.h.s. as a small tune 
shift 


. cqZiIo 
6vo = i——_.. 22.109 
me : An Vo@oEo ( ) 
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The transverse impedance is generally complex and we get therefore from the 
real part of the coupling impedance a real tune shift while the imaginary part leads 
to damping or antidamping depending on the sign of the impedance. The imaginary 
frequency shift is equal to the growth rate of the instability and is given by 


qRe{Z_} Io 


= : (22.110) 
4a Rmy po 


u = Im{w} = 
T 


For a resistive transverse impedance, we observe therefore always instability 
known as the transverse resistive-wall instability. 

Similar to the case of a longitudinal coasting beam, we find instability for any 
finite beam current just due to the reactive space-charge impedance alone, and 
again we have to rely on Landau damping to obtain beam stability for a finite 
beam intensity. To derive transverse stability criteria including Landau damping, 
we consider the coherent tune shift caused by the coherent motion of the whole 
bunch for which the equation of motion is 


it + Who U = 2vpm[U + (1+iV]i, (22.111) 


where 


ZL 
pinsay a4 


(22.112) 
4m vo ko 


The coherent beam oscillation must be periodic with the circumference of the 
ring and is of the form # = ie'“"®-®9, As can be verified by back insertion the 
solution of (22.111) is 


2V0Wo 


=(|U 1+i)V 5 
y [ ail + i) SP a 


(22.113) 

Now we must fold (22.113) with the distribution in the spread of the betatron 
oscillation frequency. This spread is mainly related to a momentum spread via 
the chromaticity and the momentum compaction. The distribution (5) where 
5 = Ap/po, is normalized to unity { y(5)dé = 1 and the average particle position 
is u = [ uy(5)d6. The dispersion relation is then with this from (22.113) 


2vowow (5)déd 


v2@a — (nw — @)? 


1=([U+( ivf (22.114) 


or simplified by setting vj ~ vo and ignoring the fast wave (n + v)a@o [24] 


l= w+atomf rs vie 


——— 22.115 
— (n= vo) @o 
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This is the dispersion relation for transverse motion and can be evaluated for 
stability based on a particular particle distribution in momentum. As mentioned 
before, the momentum spread transforms to a betatron oscillation frequency spread 
by virtue of the momentum compaction 


Avg = vpoA@o = Vvgoncd@o (22.116) 
and by virtue of the chromaticity 
Avg = &,6. (22.117) 


Landau damping provides again beam stability for a finite beam current and 
finite coupling impedances, and the region of stability depends on the actual particle 
distribution in momentum. 


22.4 Longitudinal Single-Bunch Effects 


The dynamics in bunched particle beams is similar to that of a coasting beam with 
the addition of synchrotron oscillations. The frequency spectrum of the circulating 
beam current contains now many harmonics of the revolution frequency with 
sidebands due to betatron and synchrotron oscillations. The bunch length depends 
greatly on the interaction with the rf-field in the accelerating cavities but also with 
any other field encountered within the ring. It is therefore reasonable to expect that 
wake fields may have an influence on the bunch length which is know as potential 
well distortion. 


22.4.1 Potential-Well Distortion 


From the discussions on longitudinal phase space motion in circular accelerators, it 
is known that the particle distribution or bunch length depends on the variation in 
time of the rf-field interacting with the beam in the accelerating cavities. Analogous, 
we would expect that wake fields may have an impact on the longitudinal particle 
distribution. Pellegrini and Sessler [25] For a particular wake field, we have studied 
this effect in Chap. 15 recognizing that a bunch passing through an rf-cavity causes 
beam loading by exciting fields at the fundamental frequency in the cavity. These 
fields then cause a modification of the bunch length. In this section, we will expand 
on this thought and study the effect due to higher-order mode wake fields. 

To study this in more detail, we ignore the transverse particle distribution. The 
rate of change in the particle momentum can be derived from the integral of all 
longitudinal forces encountered along the circumference and we set with 6 = dp/po 


dé qF(t) 
dt  BEoTo’ 


(22.118) 
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where qgF (rt) is the sum total of all acceleration and energy losses of a particle at a 
position z = Bct from the bunch center or reference point over the course of one 
revolution and Tp is the revolution time. The change of t per unit time depends on 
the momentum compaction of the lattice and the momentum deviation 


ar (22.119) 
a oe ; 


Both equations can be derived from the Hamiltonian 


_ f' Fo ,. 
0 B°EoTo 


H = —5n.8° (22.120) 


For an electron ring and small oscillation amplitudes, we have 


OV 


gF(t) = V(t; + t) — U(E) + qVw(t) = a7 


t+qVw(t), (22.121) 


Ts 


where we ignored radiation damping and where V,,(t) describes the wake field. In 
the last form, the equation is also true for protons and ions if we set the synchronous 
time t; = O. Inserting (22.121) into (22.120) and using the definition of the 
synchrotron oscillation frequency (9.35) we get the new Hamiltonian 


Q2 * gV4(2) 
ee ee Ww ae 22.122 
2m * Ne . 0 B?EoTo ‘ 


Synchrotron Oscillation Tune Shift 


First we use the Hamiltonian to formulate the equation of motion and determine the 
effect of wake fields on the dynamics of the synchrotron motion. The equation of 
motion is from (22.122) 


27 22,Vy 


—__* (22.123) 
WohV,¢| Cos Ws0| 


t+ Q2,t = sign(n) 


where we have made use of the definition of the unperturbed synchrotron oscillation 
frequency (2,9. We express the wake field in terms of impedance and beam spectrum 


V(t) =— / “ Z)(@) I(t, @) e do , (22.124) 


and use (21.85) for 


co 


Vu(t) = —Io >> Zy(p) ¥(p) oP, (22.125) 


p=—0o 
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where /, is the bunch current and 
+00 
W(p) -| Jo(pamot) P(t, t) dz. 
—Co 


The maximum excursion t during phase oscillation is much smaller than the 
revolution time and the exponential factor 


Pr we 1 + ipwot — tp?wot? + O(3) (22.126) 


can be expanded. After insertion of (22.120), (22.121) into (22.123) the equation of 
motion is 


T+ Q40% (22.127) 
lee) 


» Z\(p)¥(p) (1 — ipwot — 4p?@5t”) . 


=—oo 


2nlyQ2, 


—sign(7) aohV,r| cos Wso| PD 


The first term in the factor (1 — ipWot — 5p°wat) is independent of t and causes a 
synchronous phase shift due to resistive losses 


Aws = sgn( te)” YE Relzy(P)} HP): (22.128) 


Vie| COS We0 ae 

For a resistive positive impedance, for example, the phase shift is negative above 
transition indicating that the beam requires more energy from the rf-cavity. By 
measuring the shift in the synchronous phase of a circulating bunch as a function 
of bunch current, it is possible to determine the resistive part of the longitudinal 
impedance of the accelerator. To do this one may fill a small amount of beam in the 
bucket upstream from the high intensity bunch and use the signal from the small 
bunch as the time reference against which the big bunch will shift with increasing 
current. 

The second term in (22.127) is proportional to t and therefore acts like a focusing 
force shifting the incoherent synchrotron oscillation frequency by 


i 1, 82s o 
en OEE | cosh Im{Z W(p). (22.129) 
ahi RV cos Wo pe tZ(P)} PY (P) 


=—0o0 

The real part of the impedance is symmetric in p and therefore cancels in 
the summation over p which leaves only the imaginary part consistent with the 
expectation that the tune shift be real. At this point, it becomes necessary to 
introduce a particular particle distribution and an assumption for the impedance 
spectrum. For long bunches, the frequencies involved are low and one might use for 
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the impedance the space charge and broad-band impedance which both are constant 
for low frequencies. In this case, the impedance can be extracted from the sum 
in (22.129) and the remaining arguments in the sum depend only on the particle 
distribution. 

For a parabolic particle distribution, for example, (22.129) reduces to [26] 


161, im § ZI?) 


AQ, = — 6 
; sen( Bh Vielcoswol | p 


(22.130) 


where B is the bunching factor B = £/(27R) with ¢ the effective bunch length. 

A measurement of the incoherent synchrotron tune shift as a function of bunch 
current allows the determination of the reactive impedance of the accelerator for 
a given particle distribution. This tune shift is derived from a measurement of the 
unperturbed synchrotron frequency {2,9 for a very small beam current combined 
with the observation of the quadrupole mode frequency £22, as a function of bunch 
current. The incoherent tune shift is then 


AQs incoh =e ($23. = 250) , (22.131) 


where jz is a distribution dependent form factor of order 2 for a parabolic distribution 
[27]. 

The third and subsequent terms in (22.127) contribute nonlinear effects making 
the synchrotron oscillation frequency amplitude dependent similar to the effects of 
nonlinear fields in transverse beam dynamics. 


Bunch Lengthening 


A synchrotron frequency shift is the consequence of a change in the longitudinal 
focusing and one might expect therefore also a change in the bunch length. In first 
approximation, one could derive expressions for the new bunch length by scaling 
with the synchrotron tune shift. Keeping the phase space area constant in the proton 
and ion case or keeping only the energy spread constant in the electron case, a rough 
estimate for bunch lengthening can be obtained for a specific particle distribution. 
Since the electron bunch length scales inversely proportional to the synchrotron 
frequency, we have 


ov 225 AQ; 


enya oe eae (22.132) 
Ovo 2 29 


From (22.132), one can determine for an electron beam the potential-well bunch 
lengthening or shortening, depending on the sign of the reactive impedance. For a 
proton or ion beam, the scaling is somewhat different because of the preservation of 
phase space. 
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This approach to understanding potential-well bunch lengthening assumes that 
the particle distribution does not change which is an approximate but not correct 
assumption. The deformation of the potential well is nonlinear and can create 
significant variations of the particle distribution specifically, for large amplitudes. 

In this discussion, we determine the stationary particle distribution w(t, 5) under 
the influence of wake fields by solving the Vlasov equation 


OW OW OW 
of ae Oe 


0. (22.133) 


For a stationary solution, ae = 0 and therefore any function of the Hamiltonian 
is a solution of the Vlasov equation. Since the Hamiltonian does not exhibit 
explicitly the time variable, any such function could be the stationary solution which 
we are looking for and we set therefore w(t, 5) = w(H). The local particle density 


is then after integrating over all momenta 


Mx) = a w(H) db, (22.134) 


where Np is the number of particles per bunch or with (22.122) 


“ Qs * Val) 4-\ 43 
(t) ms fv 506 aca Ses at) a (22.135) 


Without wake fields, the distribution of an electron beam is Gaussian and the 
introduction of wake fields does not change that for the energy distribution. We 
make therefore the ansatz 


H ie 
w(t, 6) =A exp x ) = As exp (5 Ay A(t), (22.136) 
cO on 


cvs 5 


where As and A, are normalization factors for the respective distributions. Integrat- 
ing over all momenta, the longitudinal particle distribution is finally 


is q ne as 
A(t) = Ny Ay exp (-15 - 7 BPEsToo? i V(t) az) : (22.137) 


where we used 05 = 2,90,/|n-| from (13.26). A self-consistent solution of this 
equation will determine the longitudinal particle distribution under the influence of 
wake fields. Obviously, this distribution is consistent with our earlier results for an 
electron beam in a storage ring, in the limit of no wake fields. The nature of the 
wake fields will then determine the distortion from the Gaussian distribution. 
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As an example, we assume a combination of an inductive (L) and a resistive (R) 
wake field 


di 
Vy = rr —Ri. (22.138) 


Such a combination actually resembles rather well the real average impedance 
in circular accelerators at least at lower frequencies as evidenced in the impedance 
spectrum of the SPEAR storage ring shown in Fig. 19.9. Inserting (22.138) 
into (22.137) while setting for a moment the resistance to zero (R = 0) we get 
after integration the transcendental equation 


2 oh 
q a | (22.139) 


it 
A(t) = Np Aa exp | 262 1 B2EyTooe 
which must be solved numerically to get the particle distribution A(t). We note 
that the inductive wake does not change the symmetry of the particle distribution 
in t. For large values of t, the particle distribution must approach zero to meet 
the normalization requirement (lim;-+99 A(t) = 0) and the particle distribution is 
always Gaussian for large amplitudes. The effect of the inductive wake field is 
mainly concentrated to the core of the particle bunch. 
Evaluating numerically (22.139), we distinguish between an electron beam and 
a proton or ion beam. The momentum spread o¢ in case of an electron beam is 
determined by quantum effects related to the emission of synchrotron radiation and 
is thereby for this discussion a constant. Not so for proton and ion beams which are 
subject to Liouville’s theorem demanding a strong correlation between bunch length 
and momentum spread such that the longitudinal phase space of the beam remains 
constant. Equation (22.139) has the form 


f(t) = K exp[—3° —f(0] (22.140) 
or after differentiation with respect to ¢ 


df tf 


- ; (22.141) 
dt 1+f(@ 
For strong wake fields f (t) >> 1 and (22.141) can be integrated for 
f@O=fo-4$r. (22.142) 


The particle distribution in the bunch center assumes more and more the shape of 
a parabolic distribution as the wake fields increase. Figure 22.13 shows the particle 
distribution for different strengths of the wake field. 

Now we add the resistive wake field component. This field actually extracts 
energy from the bunch and therefore one expects that the whole bunch is shifted 
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Fig. 22.13 Potential-well distortion of Gaussian particle distributions (a) for an inductive wake 
field, and (b) for a combination of an inductive and a resistive wake field 


such as to compensate this extra loss by moving to a higher field in the accelerating 
cavities. Inserting the full wake field (22.138) into (22.137) results in the distribution 


2 T 
A(x) = Ny Ay exp |-45 — aLN,A(t) — aRN, / ue) at (22.143) 
0 


2 
T 


where 


Ge 


a= ———_... (22.144) 
Ne BEoToos 

Looking for a shift of the tip of the particle distribution, we get from dA /dt = 0 the 

location of the distribution maximum 


Tmax & NoA (Tmax) : (22.145) 


The maximum of the particle distribution is therefore shifted proportional to the 
bunch intensity and the general distortion is shown in Fig.22.13b for a resistive 
wake much larger than generally encountered in an accelerator. The distortion of the 
particle distribution leads to a deviation from a Gaussian distribution and a variation 
of the bunch length. In the limit of a strong and inductive wake field, for example, 
the full-width half maximum value of the bunch length increases like 


go, |BLN»A(t) 
Bos NcEoTo 


Tfwhm = or fo = (22.146) 


The bunch length changes as the bunch intensity is increased while the sign and 
rate of change is dependent on the actual ring impedance spectrum on hand. We have 
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used an induction as an example for the reactive impedance in a ring because it most 
appropriately represents the real impedance for lower frequencies or longer bunch 
length. In general, this potential-well bunch lengthening may be used to determine 
experimentally the nature and quantity of the ring impedance by measuring the 
bunch length as a function of bunch current. 


Turbulent Bunch Lengthening 


At higher bunch currents the bunch lengthening deviates significantly from the 
scaling of potential well distortion and actually proceeds in the direction of true 
lengthening. Associated with this lengthening is also an increase in the particle 
momentum spread. The nature of this instability is similar to the microwave 
instability for coasting beams. 

Considering long bunches, a strong instability with a rise time shorter than 
the synchrotron oscillation period and high frequencies with wavelength short 
compared to the bunch length, we expect basically the same dynamics as was 
discussed earlier for a coasting beam. This was recognized by Boussard [28] who 
suggested a modification of the Keil-Schnell criterion by replacing the coasting- 
beam particle density by the bunch density. For a Gaussian particle distribution, the 
peak bunch current is 


2nR 
V 20% j 


where Jp is the average circulating beam current per bunch, and the bunch length is 
related to the energy spread by 


1=Ip (22.147) 
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With these modifications, the Boussard criterion is 


Z *Folnel? [2 \° 
| <p B Folnel (+) (22.149) 
n GloV 21 Vs0 Eo 


where the form factor F is still of the order unity. 

As a consequence of this turbulent bunch lengthening we observe an increase of 
the energy spread as well as an increase of the bunch length. The instability does 
not necessarily lead to a beam loss but rather to an adjustment of energy spread and 
bunch length such that the Boussard criterion is met. For very low beam currents 
the stability criterion is always met up to a threshold where the r.h.s. of (22.149) 
becomes smaller than the I.h.s. Upon further increase of the beam current beyond 
the threshold current the energy spread and consequently the bunch length increases 
to avoid the bunched beam microwave instability. 
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22.5 Transverse Single-Bunch Instabilities 


Transverse wake fields can also greatly modify the stability of a single bunch. 
Specifically at high frequencies, we note an effect of transverse wake fields 
generated by the head of a particle bunch on particles in the tail of the same bunch. 
Such interaction occurs for broad-band impedances where the bunch generates a 
short wake including a broad spectrum of frequencies. In the first moment all these 
fields add up being able to act back coherently on particles in the tail but they 
quickly decoher and vanish before the next bunch arrives. This effect is therefore a 
true single-bunch effect. In order to affect other bunches passing by later, the fields 
would have to persist a longer time which implies a higher Q value of the impedance 
structure which we ignore here. 


22.5.1 Beam Break-Up in Linear Accelerators 


A simple example of a transverse microwave instability is the phenomenon of beam 
break-up in linear accelerators. We noted repeatedly that the impedance of vacuum 
chambers originates mainly from sudden changes in cross section which therefore 
must be avoided to minimize impedance and microwave instabilities. This, however, 
is not possible in accelerating cavities of which there are particularly many in a 
linear accelerator. Whatever single-pass microwave instabilities exist they should 
become apparent in a linear accelerator. We have already discussed the effect of 
longitudinal wake fields whereby the fields from the head of a bunch act back as a 
decelerating field on particles in the tail. In the absence of corrective measures we 
therefore expect the particles in the tail to gain less energy than particles in the head 
of an intense bunch. 

Transverse motion of particles is confined to the vicinity of the linac axis by 
quadrupole focusing in the form of betatron oscillations while travelling along the 
linear accelerator. However, coherent transverse betatron oscillations can create 
strong transverse wake fields at high bunch intensities. Such fields may act back 
on subsequent bunches causing bunch to bunch instabilities if the fields persist 
long enough. Here we are more interested in the effect on the same bunch. For 
example, the wake fields of the head of a bunch can act back on particles in the 
tail of the bunch. This interaction is effected by broad-band impedances like sudden 
discontinuities in the vacuum chamber which are abundant in a linear accelerator 
structure. The interaction between particles in the head of a bunch on particles in the 
tail of the same bunch can be described by a two macro particle model resembling 
the head and the tail. 

Transverse wake fields are proportional to the transverse oscillation amplitude 
of the head and we describe the dynamics of the head and tail of a bunch in a two 
particle model where each particle represents half the charge of the whole bunch as 
shown in Fig. 22.14. 


780 22 Wake Fields and Instabilities* 


particle bunch 


Fig. 22.14 Head-tail dynamics of a particle bunch represented by two macroparticles 


The head particle with charge SqNp performs free betatron oscillations while the 
tail particle responds like a driven oscillator. Since all particles travel practically 
at the speed of light, the longitudinal distribution of particles remains fixed along 
the whole length of the linear accelerator. The equations of motion in smooth 
approximation where kg = 1/(vo B,) and B, is the average value of the betatron 
function in the plane u, are for both macroparticles 


4+ ks x =0, 
7 42 mn * z roNyW (22.150) 
x + kp xe = re™ fe A@)Wie—Zdz= =P, 
where we use the indices h and t for the head and tail particles respectively and 
introduce the average wake field per unit length 
2 WL 


Ww. = ‘ 22.151 
a Lox ( ) 


For simplicity, it was assumed in (22.150) that the beam is just coasting along 
the linear accelerator to demonstrate the dynamics of the instability. If the beam is 
accelerated the adiabatic damping effect through the increase of the energy must be 
included. 

Because of causality only the tail particle is under the influence of a wake field. 
The transverse wake field W ,(20,), for example, which is shown in Fig. 22.8, 
is to be taken at a distance 20, behind the head particle. Inserting the solution 
Xn(z) = Xp cos kgz into the second equation, we obtain the solution for the betatron 
oscillation of the tail particle in the form 


reNpW1. 


4ykp 


Xt(Z) = X_ cos kgz + Xn zsinkpz. (22.152) 

The second term in this expression increases without bound leading to particle 
loss or beam break-up as soon as the amplitude reaches the edge of the aperture. If 
the bunch does reach the end of the linear accelerator of length Lacc, the betatron 
oscillation amplitude of the tail has grown by a factor 


% — reNyW1 Lace 
jercg pac Le (22.153) 
Xh Aykp 
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One consequence of this instability is an apparent increase in beam emittance 
long before beam loss occurs. A straight bunch with a small cross section becomes 
bent first like a banana and later like a snake and the transverse distribution 
of all particles in the bunch occupies a larger cross-sectional area than before. 
This increase in apparent beam size has a big detrimental effect on the attainable 
luminosity in linear colliders and therefore must be minimized as much as possible. 
The two particle model adopted here is insufficient to determine a more detailed 
structure than that of a banana. However, setting up equations similar to (22.150) 
for more than two macroparticles will start to reveal the oscillatory nature of the 
transverse bunch perturbation. 

One scheme to greatly reduce the beam break-up effect is called BNS damping in 
reference to its inventors Balakin et al. [29] and has been successfully implemented 
into the Stanford Linear Collider [30]. The technique utilizes the fact that the 
betatron oscillation frequency depends by virtue of the chromaticity on the energy 
of the particles. By accelerating the bunch behind the crest of the accelerating field 
the tail gains less energy than the head. Therefore the tail is focused more by the 
quadrupoles than the head. Since the transverse wake field introduces defocusing 
this additional chromatic focusing can be used for compensation. 

Of course this method of damping the beam break-up by accelerating ahead of the 
crest is counter productive to compensating for the energy loss of tail particles due to 
longitudinal wake fields. In practice, BNS damping is applied only at lower energies 
where the instability is strongest and in that regime the energy reducing effect of the 
longitudinal wake field actually helps to maximize BNS damping. Toward the end 
of the linear accelerator at high beam energies, the beam break up effect becomes 
small (« 1/y) and the bunch is now moved ahead of the crest to reduce the energy 
spread in the beam. 


22.5.2 Fast Head-Tail Effect 


Transverse bunch perturbations due to broad-band impedances are not restricted 
to linear accelerators but occur also in circular accelerators. In a circular proton 
accelerator, for example, the “length” is for all practical purposes infinite, there 
is no radiation damping and therefore even weak transverse wake fields can in 
principle lead to transverse bunch blow up and beam loss. This instability is known 
as the fast head-tail instability or strong head-tail instability and has been first 
discussed and analyzed by Kohaupt [31]. The dynamics in a circular accelerator 
is, however, different from that in a linear accelerator because particles in the head 
of a bunch will not stay there but rather oscillate between head and tail in the course 
of synchrotron oscillations. These synchrotron oscillations disturb the coherence 
between head and tail and the instability becomes much weaker. 

On the other hand, particles in circular accelerators and especially in storage 
rings are expected to circulate for a long time and even a much reduced growth rate 
of the transverse bunch blow up may still be too strong. The dynamics of interaction 
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is similar to that in a linear accelerator at least during about half a synchrotron 
oscillation period (3%), but during the next half period the roles are interchanged 
for individual particles. Particles travelling for one half period in the head of the 
bunch find themselves close to the tail for the next half period only to reach the 
head again and so forth. To understand the dynamics over many oscillations, we 
set up equations of motion for two macroparticles resembling the head and tail of 
a particle bunch similar to (22.150), but we now use the time as the independent 
variable. The distance ¢ between head and tail particle varies between 0 and the 
maximum distance of the two macro particles 2 during the course of a synchrotron 
oscillation period and since the transverse wake field increases linearly with ¢, we 
set W,(¢) = W_ (202) sin 2,t. With this the equations of motion are for 0 < t < 
t,/2 


oe ih 

x + @px =0, 

rc B2c2Nb W, (207) sin Qet - 
2y I> 


(22.154) 


Xo + WH x2 = 


where W, = W. '/(2R) is the wake function per unit length. For the next half 
period t,/2<t<t, 


rcB2c2NpW | (20) sin Bt», 


‘ Oe 
pele 2y (22.155) 


Xy + w% X2 =0. 


For further discussions we consider solutions to (22.154), (22.155) in the form 
of phasors defined by 


x(t) = x(0) el" = x-i—. (22.156) 
wp 


The first Eq. (22.154) can be solved immediately for 
x1(t) = x1(0) el" (22.157) 
and the second Eq. (22.154) becomes with (22.157) 
Xo+ OF X. = Asin Qt eF" x, (0) : (22.158) 
where 


reB2c2NpW 1 (22) 
2y ; 


A= (22.159) 
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The synchrotron oscillation frequency is generally much smaller than the 
betatron oscillation frequency (2 < wp) and the solution of (22.159) becomes 
with this approximation 


: 1 Saal 
X2(f) = x2(0) cP! + me / [A x; (0) sin 2.1 e'6"] sin wg(t — 1) dr’ 
B JO 


or after some manipulation 
xX2(t) = x2(0) e'* — ix1(0)4a(1 — cos 2,1) e”?", (22.160) 


where a = A/(wg 92). During the second half synchrotron oscillation period, the 
roles of both macroparticles are exchanged. We may formulate the transformation 
through half a synchrotron oscillation period in matrix form and get with | — 
cos (Q.5ts) = 2 since 2,41, ~ x for the first half period 


x1(t;/2) iwpts/2( 1 0) (x10) 

= 8 22.161 

(BG ey a ~ia 1) \x2(0) ee 

and for the second half period 

X1 (ts) iwpts/2 1 —ia X1(ts/2) 

_ ; 22.162 

Cyaan tre) Gas ee 

Combining both half periods one gets finally for a full synchrotron oscillation period 
X1 (ts) iopt, { 1—- a’ —ia\ ( x\(0) 

= 8 ; 22.163 

Go) ee aa 


The stability of the motion after many periods can be extracted from (22.163) by 
solving the eigenvalue equation 


2_3 
es ‘) ()=-4(2) (22.164) 
—ia 1 x2 x2 
The characteristic equation 
M-QG=-P)A4+1=0 (22.165) 


has the solution 


eee eee fd 40)?=1 (22.166) 
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and the eigenvalues can be expressed by 


A= etl? (22.167) 
where (1 — 5a’) = cos @ for |a| < 2 or 


reB?C7Np WL (20) 29 


= 22.168 
la| Byes ( ) 


The motion remains stable since no element of the transformation matrix 
increases unbounded as the number of periods increases to infinity. In the form of a 
stability criterion, the single-bunch current J, = gNp frey must not exceed the limit 


4 qy@avpvs 


bx 7 BeW (20) ; (22.169) 


where gq is the charge of the particles and (vg, v,) the betatron and synchrotron tune, 
respectively. In a storage ring, it is more convenient to use impedance rather than 
wake fields. Had we set up the equations of motion (22.150), (22.151) expressing the 
perturbing force in terms of impedance we would get the same results but replacing 
the wake field by 


W1 (20) = 1m{Z} (22.170) 


and the threshold beam current for the fast head-tail instability becomes 


2 so (22.171) 
le Bclm{ +} 

The bunch current /, is a threshold current which prevents us from filling more 
current into a single bunch. Exceeding this limit during the process of filling a bunch 
in a circular accelerator leads to an almost immediate loss of the excess current. 
This microwave instability is presently the most severe limitation on single-bunch 
currents in storage rings and special care must be employed during the design to 
minimize as much as possible the transverse impedance of the vacuum chamber 
system. 

The strength of the instability becomes more evident when we calculate the 
growth time for a beam current just by an increment € above the threshold. For 
|a| > 2 we have (1 — 5a’) = —coshy and the eigenvalue is A = e*". The phase 
jt = Oat threshold and cosh w + 1 + 5p for a = 2 + € and we get 


“=e. (22.172) 
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In each synchrotron oscillation period the eigenvalues increase by the factor e” 
or at a growth rate of 1 = 3 = dole . If, for example, the beam current exceeds 
the threshold by 10 %, we have « = 0.2 and the rise time would be t/t, = 0.89 
or the oscillation amplitudes increase by more than a factor of two during a single 
synchrotron oscillation period. This is technically very difficult to counteract by a 
feedback system. 

We have assumed that transverse wake fields are evenly distributed around 
the accelerator circumference. In a well designed accelerator vacuum chamber, 
however, most of the transverse wake field occur in the accelerating cavities and 
therefore only the transverse betatron oscillation amplitude in the cavities are 
relevant. In this case, one recalls the relation vg ~ R/B, and we replace in (22.171) 
the average value of the betatron function by the value in the cavities for 


Agqy Qs 


i < ————_. . 
reBuy W1cy(2€) 


(22,173) 


This result suggest that the betatron function in the plane uv = x or u = y at the 
location of cavities should be kept small and the synchrotron oscillation frequency 
should be large. The exchange of head and tail during synchrotron oscillation slows 
down considerably the growth rate of the instability. The result (22.173) is the same 
as the amplification factor (22.153) if we consider that in a linear accelerator the 
synchrotron oscillation period is infinite. 

As we approach the threshold current, the beam signals the appearance of the 
head-tail instability on a spectrum analyzer with a satellite below the betatron 
frequency. The threshold for instability is reached when the satellite frequency 


reaches a value @at = Wg — 5 Qs. This becomes apparent when replacing the 
transformation matrix in (22.163) by the eigenvalue 
oo) = el@pis el? ey . (22.174) 
X2(ts) x2(0) 


The phase reaches a value of ® = z at the stability limit and (22.174) becomes at 
this limit 


ea — pi(oe-3:2) 6 bey (22.175) 


¥2(ts) x2(0) 


At this point, it should be noted, however, that the shift of the betatron frequency 
to 52s is a feature of the two macro particle model. In reality there is a distribution 
of particles along the bunch and while increasing the beam current the betatron 
frequency decreases and the satellite vg — vs moves until both frequencies merge 
and become imaginary. This is the point of onset for the instability. It is this feature 
of merging frequencies which is sometimes called mode mixing or mode coupling. 
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Measurement of the Broad-Band Impedance 


As shown by Chao [9] the combined motion of head and tail represents a coherent 
transverse betatron motion which can be picked up by beam position monitors. The 
betatron frequency changes with increasing beam intensity and the initial slope can 
be used to determine the broad band impedance. From the lowest order instability 
mode we take the derivative of the betatron oscillation frequency and get 


dwg _ Os dd 
dM, 2 \dM, 
where N, is the number of electrons per bunch. Measuring the slope dwg /dN, at 


low beam intensities allows to determine the wake function Wo. From this wake the 
transverse impedance is [9] 


rec2Wo 
= —-—— (22.176) 
Np=0 l6ryRog 


R b 
ZL= —Wo (22.177) 
B.vg c 
and from (22.8) the relation to the longitudinal impedance is Z| = a al, where R 


is the average radius of the accelerator, b the typical vacuum chamber radius, vg the 
betatron frequency and f, the average value of the betatron function around the ring. 
the mode number z is the frequency in units of the revolution frequency n = w/w. 


22.5.3 Head-Tail Instability 


Discussing the fast head-tail instability we considered the effect of transverse wake 
fields generated by the head of a particle bunch on the transverse betatron motion 
of the tail. We assumed a constant betatron oscillation frequency which is only 
an approximation since the betatron frequency depends on the particle energy. On 
the other hand, there is a distinct relationship between particle energy and particle 
motion within the bunch, and it is therefore likely that the dynamics of the head-tail 
instability becomes modified by considering the energy dependence of the betatron 
oscillation frequency. 

Like in the previous section, we represent the particle bunch by two macropar- 
ticles which perform synchrotron oscillations being 180° apart in phase. The wake 
fields of the head particle act on the tail particle while the reverse is not true due to 
causality. However, during each half synchrotron oscillation period the roles become 
reversed. 

In (22.160), we obtained an expression which includes the perturbation term 
and consider the variation of this term due to chromatic oscillations of the betatron 
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frequency. The perturbation term is proportional to e'”6‘ and we set therefore with 
5 = Ap/po 


3 
i, tah) tong rs + O(82). (22.178) 


The chromaticity is defined by the betatron tune shift per unit relative momentum 
deviation 


and (22.178) becomes with wg = vg wo 
op = oo + &gda. (22.180) 


The momentum deviation is oscillating at the synchrotron frequency and is corre- 
lated with the longitudinal motion by 


Ss 


QL 
§=— sin Qgt, (22.181) 
Belne| 


where 2¢ is the maximum longitudinal distance between the two macroparticles. 
Combining (22.180), (22.181) we get 


2, £ Ep 
vpR| nel 


wg = p60 — sin (2st , (22.182) 


where the second term is much smaller than unity so that we may expand the 
exponential function of this term to get 


iwgt ty ia@got a 2 LEp . 
eh we 1-—i1—~ tsin(2,t) | . (22.183) 
vpR\ nel 


The expression in the square bracket is the variation of the scaling factor a 
in (22.160) and we note specifically, the appearance of the imaginary term which 
gives rise to an instability. The phase @ in the eigenvalue equation (22.167) becomes 
for small beam currents ® ~ a and with (22.183) 


. 2 Lg 
® =a | |-—i—=—+, |, (22.184) 
mv~gR\|nc| 
where we have set t = 4f, and (sin Q,t) ~ 2/2. The first term represents the 


fast head tail instability with its threshold characteristics discussed in the previous 
section. The second term is an outright damping or antidamping effect being 
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effective at any beam current. This instability is called the head-tail effect discovered 
and analyzed by Pellegrini [32] and Sands [33] at the storage ring ADONE. 

Considering only the imaginary term in (22.184), we note an exponential growth 
of the head-tail instability with a growth rate of 


1 Qual& _ lEgreBcN,W1 (20) 


(22.185) 


tT wvgR|ne| - 2ry|\nelvZ 


Instability may occur either in the vertical or the horizontal plane depending 
on the magnitude of the transverse wake function in both planes. There are two 
modes, one stable and one unstable depending on the sign of the chromaticity 
and momentum compaction. Above transition 7, < 0 and the beam is unstable 
for negative chromaticity. This instability is the main reason for the insertion 
of sextupole magnets into circular accelerators to compensate for the naturally 
negative chromaticity. Below transition, the situation is reversed and no correction 
of chromaticity by sextupoles is required. From (22.185), we would conclude that 
we need to correct the chromaticity exactly to zero to avoid instability by one or the 
other mode. In reality, this is not the case because a two particle model overestimates 
the strength of the negative mode. Following a more detailed discussion including 
Vlasov’s equation [9] it becomes apparent that the negative mode is much weaker 
to the point where, at least in electron accelerators, it can be ignored in the presence 
of radiation damping. 

Observation of the head-tail damping for positive chromaticities or measuring 
the risetime as a function of chromaticity can be used to determine the transverse 
wake function or impedance of the accelerator [34, 35]. Measurements of head-tail 
damping rates have been performed in SPEAR [34] as a function of chromaticity 
and are reproduced in Figs. 22.15 and 22.16. 
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Fig. 22.15 Measurement of the head-tail damping rate in SPEAR as a function of chromaticity 
(a) and beam energy (b) [34] 
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Fig. 22.16 Measurement of 1.0 
the head-tail damping rate in 
SPEAR as a function of beam | 
current [34] 0.8 + a 
0.6 — 
0.4 7 
0.2 
0.0 


We clearly note the linear increase of the damping rate with chromaticity. The 
scaling with energy and beam current is less linear due to a simultaneous change in 
bunch length. Specifically the bunch length increases with beam intensity causing 
the wake fields to drop for a smaller damping rate. 


22.6 Multi-Bunch Instabilities 


Single-bunch dynamics is susceptible to all kinds of impedances or wake fields 
whether it be narrow or broad-band impedances. This is different for multi-bunch 
instabilities or coupled-bunch instabilities [27-37]. In order for wake fields to 
generate an effect on more than one bunch it must persist at least until the next 
bunch comes by the location of the impedance. We expect therefore multi-bunch 
instabilities only due to high Q or narrow-band impedances like those encountered 
in accelerating cavities. Higher-order modes in such cavities persist some time after 
excitation and actually reach a finite amplitude in a circular accelerator where the 
orbiting beam may periodically excite one or the other mode. Because these modes 
have a high quality factor they are also confined to a narrow frequency spread. The 
impedance spectrum we need to be concerned with in the study of multi-bunch 
instabilities is therefore a line spectrum composed of many cavity modes. 

To study the effect of these modes on the circulating beam, we must fold the 
beam current spectrum with the mode spectrum and derive from this interaction 
conditions for beam stability. We will do this for the case of the two lowest order 
mode oscillations only where all bunches oscillate in synchronism at the same phase 
or are 90° out of phase from bunch to bunch respectively. Of course in a real 
accelerator higher-order modes can be present too and must be taken into account. 
Here we must limit ourself, however, to the discussion of the physical effect only and 
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direct the interested reader to more detailed discussions on this subject in references 
(3,9, 19,26]. 

We consider the dynamics of rigid coupled bunches ignoring the internal motion 
of particles within a single bunch. The beam spectrum is then from (21.84) with g 
the bunch charge and observing at g = 0 for simplicity 


iced +oo0 +00 
(0.9) = YY YE a(pnveo?) 5(@ — 2y), (22.186) 


p=—00 n=—0o 


where now 2, = (pnp +nm-+nvs) @o and where we have replaced the synchrotron 
frequency by the synchrotron tune and the phase ¢; for individual particles by the 
mode of the bunch distribution setting ¢; = mwot with 0 < m < np. A beam of ny 
equidistant bunches can oscillate in mn, different modes. Two bunches, for example, 
can oscillate in phase or 180° out of phase; four bunches can oscillate with a phase 
difference of 0°, 90°, 180°, and 270° between consecutive bunches. In general the 
order of the mode m defines the phase difference of consecutive bunches by 


° 


360 
Ag =m . (22.187) 
Np 


To determine the multi-bunch dynamics we calculate first the induced voltage 
V(t) by the beam current in the impedance Z(w) and then fold the voltage 
function with the beam function to calculate the energy loss per turn by each 
particle. Knowing this, we will be able to formulate the equation of motion for the 
synchrotron oscillation. Specifically, we will be able to formulate frequency shifts 
and damping or antidamping due to the interaction of the multi-bunch beam with its 
environment to identify conditions for beam stability. 

For simplicity we assume small phase oscillations (t < 1) and consider only 
the fundamental beam frequency and the first satellite n = 0, 1. With this (22.186) 
becomes 


+00 


» Jo(prywot) 6(@ — 20) —i Ji (prpwoTt) 6(@ — 21), 


p=—0o 


GnpWo 
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(22.188) 


where 29 = pmp@o, 2; = (pny + m+ Vs) Wo, and J; are Bessel’s functions. The 
induced voltage spectrum is V(w) = Z(q@) J(@) and its Fourier transform V(t) = 
[ V(@)ei@" dew or 


qnpWo 


+00 
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The energy loss per particle is then defined by integrating in time the product of 
voltage function and single-bunch current function 


= i; V\ (2) HEF) ay (22.190) 
Np Nb 


Np is the number of particles per bunch and T, = 7o/np the time between passage 
of consecutive bunches. The bunch current can be expanded fort < 1 


T(t + Am) +e <H(0). (22.191) 


The Fourier transforms of both current and its derivative with respect to time are 
correlated by 


d 
all (w) = ia l\(@) (22.192) 
and (22.191) becomes in frequency domain with (22.188) 


qnp®o 


It+t)= Qny 


i a (1 + ime) (yp 8p — is 5,) e™ at, (22.193) 
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where we have used some abbreviations which become obvious by comparison 
with (22.188). Inserting (22.193) and (22.189) into (22.190), we get 


v= tore / / dX (Jo Zoe" — i Ze!) en 
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For abbreviation we have set 6; = 6(92;), Z; = Z(Q;), Jo = Jo(t2o), and Jy = 
J) (4). An additional index has been added to indicate whether the quantity is part 
of the summation over p or r. Before we perform the time integration we reverse the 
first summation by replacing p > —p and get terms like [ e!(%0—®)' dt = 2718) ete. 
and (22.194) becomes 


2 
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The integration over w will eliminate many components. Specifically, we note 
that all cross terms 595; vanish after integration. We also note that the terms 59,40, 
vanish unless r = p. With this in mind we get from (22.195) 


(qoo)’n» ; 
i= “TaN, 210004 REI AF Zy Sieh Zi: (22.196) 


Finally the summation over p leads to a number of cancellations considering that 
the resistive impedance is an even and the reactive impedance an odd function. With 


Zo = Z0 + iZio, Zo(@) = Zo(—@), and Zjn(@) = —Zi9(—w) (22.196) becomes 
U= (q.20)’ny 3 [Jo (t 20) Z-(2o) + JZ (F Qo) Z(21) (22.197) 
Qn Np p=—00 
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The first and second term are the resistive energy losses of the circulating beam and 
synchrotron oscillations respectively while the third and fourth term are responsible 
for the stability of the multi bunch beam. 

The equation of motion for synchrotron oscillations has been derived in Chap. 9 
and we found that frequency and damping is determined by the accelerating rf-field 
and energy losses. We expect therefore that the energy loss derived for coupled 
bunch oscillations will also lead to a frequency shift and damping or anti damping. 
Specifically, we have for the equation of motion from (9.25) 


hne dV 1 dU 


av =U 22.198 
InBecpy dw|,” To dE ( ) 


Eo 


G+ 


where we notice the phase proportional term which determines the unperturbed 
synchrotron frequency 


hn. dV he eVo cos Ws 
Cap |) eee (22.199) 
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The term proportional to @ gave rise to the damping decrement 


pee (22.200) 
a9 =-~— =| .- : 
0 279 dE |e, 
The modification of the synchrotron frequency is with = t = @/hao 


from (22.93)—(22.95) similar to the derivation of the unperturbed frequency 
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where fo = @o/2z is the revolution frequency. Note that n. < 0 above transition 
and the additional damping or energy loss due to narrow-band impedances reduces 
the frequency as one would expect. 

Similarly we derive the modification of the damping decrement from the 
imaginary term in (22.197) noting that the solution of the synchrotron oscillation 


gives tT = —i§2,t with g = hawot and the damping decrement for a multi-bunch 
beam is 
WoNcI 2 
0c! b 1 A 2 
a, = Aso — J (4821) |°Z,(Q)) . 22.202 
‘ rea ey Q, [a fo Ji (T21))°Z-(21) ( ) 


For proton and ion beams we would set a, = 0 because there is no radiation 
damping and the interaction of a multi-bunch beam with narrow-band impedances 
would provide damping or antidamping depending on the sign of the damping 
decrement for each term. If, however, only one term is antidamped the beam would 
be unstable and get lost as was observed first at the storage ring DORIS [38]. It is 
therefore important to avoid the overlap of any line of the beam spectrum with a 
narrow-band impedance in the ring. 

Since this is very difficult to achieve and to control, it is more convenient to 
minimize higher-order narrow-band impedances in the ring by design as much as 
possible to increase the rise time of the instabilities. In electron storage rings the 
situation is similar, but now the instability rise time must exceed the radiation 
damping time. Even though, modern storage rings are designed for high beam 
currents and great efforts are being undertaken to reduce the impedance of higher 
cavity modes by designing monochromatic cavities where the higher-order modes 
are greatly suppressed [39-42]. 

We have discussed here only the dipole mode of the longitudinal coupled-bunch 
instability. Of course, there are more modes and a similar set of instabilities in the 
transverse dimensions. A more detailed discussion of all aspects of multi-bunch 
instabilities would exceed the scope of this text and the interested reader is referred 
to the specific literature, specifically to [3, 9, 19, 26]. 


Problems 


22.1 (S). Consider a storage ring with 250m circumference and a stored beam 
current of 50 mA in | bunch. Assume that the bunch length is about 5 % of the bunch 
spacing. A typical loss parameter for a BPM assembly is ky = 3.35- 10~? V/pC and 
for bellows its kj = 6.12 - 10~? V/pC. Calculate the induced power in both BPM 
with 50 Q termination and bellows. What is the power at the 50 Q termination? 


22.2 (S). Show that (22.94) is the same as (22.93) and show that the constant A 


in (22.94) is given by A = i Pe where 5 = Ap/po. 
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22.3. Specify a damping ring at an energy of 1.5GeV and an emittance of 10~!° 
m-rad. The rf-frequency be 500 MHz and 10!! electrons are to be stored into a 
single bunch at full coupling. Calculate the Touschek lifetime and the coherent and 
incoherent space-charge tune shift. Would the beam survive in case a tune shift of 
AQ, < 0.05 were permissible? 


22.4. Use the wake field for the SLAC linear accelerator structure (Fig. 22.8) and 
calculate the energy loss of a particle in the tail of a 1mm long bunch of 10!! 
electrons for the whole SLAC linear accelerator of 3 km length. This energy droop 
along a bunch is mostly compensated by accelerating the bunch ahead of the crest 
of the accelerating wave. This way the particles in the head of the bunch gain less 
energy than the particles in the tail of the bunch. The extra energy gain of the tail 
particles is then lost again due to wake field losses. How far off the crest must the 
bunch be accelerated for this compensation? 


22.5. Consider the phenomenon of beam break-up in a linear accelerator and split 
the bunch into a head, center and tail part with a particle distribution M,/4 to Np /2 
to N,/4. Set up the equations of motion for all three particles including wake fields 
and solve the equations. Show the exponential build up of oscillation amplitudes of 
the tail particle. Perform the same derivation including BNS damping where each 
macroparticle has a different betatron oscillation frequency. Determine the condition 
for optimum BNS damping. 


22.6. Determine the perturbation of a Gaussian particle distribution under the 
influence of a capacitive wake field. In particular, derive expressions for the 
perturbation of the distribution (if any) and the change in the fwhm bunch width as 
a function of 0, in the limit of small wakes. If there is a shift in the distribution what 
physical effects cause it? Hint: think of a loss mechanism for a purely capacitive 
wake field? 


22.7. During the discussion of the dispersion relation we observed the stabilizing 
effect of Landau damping and found the stability criterion (22.95) stating that the 
threshold current can be increased proportional to the square of the momentum 
spread in the beam. How does this stability criterion in terms of a momentum spread 
relate to the conclusion in the section on Landau damping that the beam should 
have a frequency overlap with the excitation frequency? Why is a larger momentum 
spread better than a smaller spread? 


22.8. Determine stability conditions for the fast head-tail instability in a storage 
ring of your choice assuming that all transverse wake fields come from accelerating 
cavities. Use realistic parameters for the rf-system and the number of cells appro- 
priate for your ring. What is the maximum permissible transverse impedance for a 
bunch current of 100 mA? Is this consistent with the transverse impedance of pill 
box cavities? If not how would you increase the current limit? 


22.9. Calculate the real and imaginary impedance for the first longitudinal and 
transverse higher-order mode in a pill box cavity and apply these to determine the 
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multi-bunch beam limit for a storage ring of your choice assuming that the beam 
spectrum includes the HOM frequency. Calculate also the frequency shift at the 
limit. 
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Part IX 
Synchrotron Radiation 


Chapter 23 
Fundamental Processes 


Ever since J.C. Maxwell formulated his unifying electromagnetic theory in 1873, 
the phenomenon of electromagnetic radiation has fascinated the minds of theorists 
as well as experimentalists. The idea of displacement currents was as radical as it 
was important to describe electromagnetic waves. It was only 14 years later when 
G. Hertz in 1887 succeeded to generate, emit and receive again electromagnetic 
waves, thus, proving experimentally the existence of such waves as predicted by 
Maxwell’s equations. The sources of the radiation are oscillating electric charges 
and currents in a system of metallic wires. In this text, we discuss the generation of 
electromagnetic radiation emitted by free electrons from first principles involving 
energy and momentum conservation as well as Maxwell’s equations. 


23.1 Radiation from Moving Charges 


Analytical formulation of the emission of electromagnetic radiation posed a con- 
siderable challenge. Due to the finite speed of light one cannot make a snapshot to 
correlate the radiation field at the observer with the position of radiating charges. 
Rather, the radiation field depends on the position of the radiating charges some 
time earlier, at the retarded time, when the radiation was emitted. Already 1867 L. 
Lorenz included this situation into his formulation of the theory of electromagnetic 
fields and introduced the concept of retarded potentials. He did, however, not 
offer a solution to the retarded potentials of a point charge. Liénard [1] in 1898 
and independently in 1900 Wiechert [2] derived for the first time expressions 
for retarded potentials of point charges like electrons. These potentials are now 
called the Liénard-Wiechert potentials relating the scalar and vector potential of 
electromagnetic fields at the observation point to the location of the emitting charges 
and currents at the time of emission. Using these potentials, Liénard was able to 
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calculate the energy lost by electrons while circulating in a homogenous magnetic 
field. 

In 1907 [3, 4] and 1912 [5] Schott formulated and published his classical theory 
of radiation from an orbiting electron. He was primarily interested in the spectral 
distribution of radiation and hoped to find an explanation for atomic radiation 
spectra. Verifying Liénard’s conclusion on the energy loss, he derived the angular 
and spectral distribution and the polarization of radiation. Since this classical 
approach to explain atomic spectra was destined to fail, his paper was forgotten 
and only 40 years later were many of his findings rediscovered. 


23.1.1 Why Do Charged Particles Radiate? 


Before we dive into the theory of electromagnetic radiation in more detail we 
may first ask ourselves why do charged particles radiate at all? Emission of 
electromagnetic radiation from charged particle beams (microwaves or synchrotron 
radiation) is a direct consequence of the finite velocity of light. A charged particle in 
uniform motion through vacuum is the source of electric field lines emanating from 
the charge radially out to infinity. While the charged particle is at rest or moving 
uniformly these field lines also are at rest or in uniform motion together with the 
particle. Now, we consider a particle being suddenly accelerated for a short time. 
That means the field lines should also be accelerated. The fact that the particle has 
been accelerated is, however, still known only within the event horizon in a limited 
area close to the particle. The signal of acceleration travels away from the source 
(particle) only at the finite speed of light. Field lines close to the charged particle 
are directed radially toward the particle, but far away, the field lines still point to 
the location where the particle would be had it not been accelerated. Somewhere 
between those two regimes the field lines are distorted and it is this distortion 
travelling away from the particle at the speed of light what we call electromagnetic 
radiation. The magnitude of these field distortions is proportional to the acceleration. 
In a linear accelerator, for example, electrons are accelerated along the linac 
axis and therefore radiate. The degree of actual acceleration, however, is very low 
because electrons in a linear accelerator travel close to the velocity of light. The 
closer the particle velocity is to the velocity of light the smaller is the actual 
acceleration gained from a given force, and the radiation intensity is very small. 
In a circular accelerator like a synchrotron, on the other hand, particles are deflected 
transversely to their direction of motion by magnetic fields. Orthogonal acceleration 
or the rate of change in transverse velocity is very large because the transverse 
particle velocity can increase from zero to very large values in a very short time 
while passing through the magnetic field. Consequently, the emitted radiation 
intensity is very large. Synchrotron radiation sources come therefore generally in 
form of circular synchrotrons. Linear accelerators can be the source of intense 
synchrotron radiation in conjunction with a transversely deflecting magnet. 
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23.1.2 Spontaneous Synchrotron Radiation 


Charged particles do not radiate while in uniform motion, but during acceleration a 
rearrangement of its electric fields is required and this field perturbation, traveling 
away from the charge at the velocity of light, is what we observe as electromagnetic 
radiation. Free accelerated electrons radiate similarly to those in a radio antenna, 
although now the source (antenna) is moving. Radiation from a fast moving particle 
source appears to the observer in the laboratory as being all emitted in the general 
direction of motion of the particle. This forward collimation is particularly effective 
for highly relativistic electrons where most of the radiation is concentrated into a 
small cone around the forward direction with an opening angle of 1/y, typically 0.1 
to | mrad, where y is the particle energy in units of its rest mass. 

Radiation can be produced by magnetic deflection in a variety of ways. Whether 
it be a single kick-like deflection or a periodic right-left deflection, the radiation 
characteristics reflect the particular mode of deflection. Specific radiation charac- 
teristics can be gained through specific modes of deflections. Here, we will only 
shortly address the main processes of radiation generation and come back later for 
a much more detailed discussion of the physical dynamics. 

In an undulator the electron beam is periodically deflected transversely to 
its direction of motion by weak sinusoidally varying magnetic fields, generating 
periodic perturbations of the electric field lines. A receiving electric field detector 
recognizes a periodic variation of the transverse electromagnetic field components 
and interprets this as quasi monochromatic radiation. In everyday life periodic 
acceleration of electrons occurs in radio and TV antennas and we may receive these 
periodic field perturbations with a radio or TV receiver tuned to the frequency of 
the periodic electron motion in the emitting antenna. The fact that we consider 
relativistic electrons is not fundamental, but we restrict ourselves in this text to high 
energy electrons only. 

To the particle the wavelength of the emitted radiation is equal to the undulator 
period length (A,) divided by y due to relativistic Lorentz contraction. In a stationary 
laboratory system, this wavelength appears to the observer further reduced by 
another factor 2y due to the Doppler effect. The undulator period length of the order 
of centimeters is thus reduced by a factor y? (10°-108) to yield short wavelength 
radiation in the VUV and x-ray regime. The spectral resolution of the radiation is 
proportional to the number of undulator periods N, and its wavelength can be shifted 
by varying the magnetic field. Most radiation is emitted within the small angle of 
(y /Np)'. 

Increasing the magnetic field strength causes the pure sinusoidal transverse 
motion of electrons in an undulator to become distorted due to relativistic effects 
generating higher harmonic perturbations of the electron trajectory. Consequently, 
the monochromatic undulator spectrum exhibits higher harmonics and changes 
into a line spectrum. For very strong fields, many harmonics are generated which 
eventually merge into a continuous spectrum from IR to hard x-rays. In this extreme, 
we call the source magnet a wiggler magnet. The spectral intensity varies little over 
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a broad wavelength range and drops off exponentially at photon energies higher than 
the critical photon energy, écrit « By”. Changing the magnetic field, one may vary 
the critical photon energy to suit experimental requirements. Compared to bending 
magnet radiation, wiggler radiation is enhanced by the number of magnet poles Npoi 
and is well collimated within an angle of 1/y to say 10/y, or a few mrad. 

A bending magnet is technically the most simple radiation source. Radiation is 
emitted tangentially to the orbit similar to a search light while well collimated in 
the non-deflecting, or vertical plane. The observer at the experimental station sees 
radiation from only a small fraction of the circular path which can be described as a 
piece of a distorted sinusoidal motion. The radiation spectrum is therefore similar to 
that of a wiggler magnet while the intensity is due to only one pole. Because bending 
magnets define the geometry of the electron beam transport system or accelerator, 
it is not possible to freely choose the field strength and the critical photon energy is 
therefore fixed. Sometimes, specially in lower energy storage rings, it is desirable 
to extend the radiation spectrum to higher photon energies into the x-ray regime. 
This can be accomplished by replacing one or more conventional bending magnet 
with a superconducting magnet or superbends at much higher field strength. To 
preserve the ring geometry the length of these superbends must be chosen such 
that the deflection angle is the same as it was for the conventional magnet that has 
been replaced. Again, superbends are part of the ring geometry and therefore the 
field cannot be changed. 

A more flexible version of a radiation hardening magnet is the wavelength shifter. 
This is a magnet which consists of a high field central pole and two weaker outside 
poles to compensate the deflection by the central pole. The total deflection angle 
is zero and therefore the field strength can be chosen freely to adjust the critical 
photon energy. It’s design is mostly based on superconducting magnet technology, 
particularly in low energy accelerators, to extend (shift) the critical photon energy 
available from bending magnets to higher values. 

A variety of more complicated magnetic field arrangements have been developed 
to primarily generate circularly or elliptically polarized radiation. In such magnets 
horizontal as well as vertical magnetic fields are sequentially employed to deflect 
electrons into some sort of helical motion giving raise to the desired polarization 
effect. 


23.1.3 Stimulated Radiation 


The well defined time structure and frequency of undulator radiation can be used to 
stimulate the emission of even more radiation. In an optical klystron [6] coherent 
radiation with a wavelength equal to the fundamental undulator wavelength enters 
an undulator together with the electron beam. Since the electron bunch length 
is much longer than the radiation wavelength, some electrons loose energy to 
the radiation field and some electrons gain energy from the radiation field while 
interacting with the radiation field. This energy modulation can be transformed into 
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a density modulation by passing the modulated electron beam through a dispersive 
section. This section consists of deflecting magnetic fields arranged in such a way 
that the total path length through the dispersive section depends on the electron 
energy. The periodic energy modulation of the electron bunch then converts into 
a periodic density modulation. Now we have microbunches at a distance of the 
undulator radiation wavelength. This microbunched beam travels through a second 
undulator where again particles can loose or gain energy from the radiation field. 
Due to the microbunching, however, most particles are concentrated at phases where 
there is only energy transfer from the particle to the radiation field, thus providing a 
high gain of radiation intensity. 

In a more efficient variation of this principle, radiation emitted by electrons 
passing through an undulator is recycled by optical mirrors in such a way that it 
passes through the same undulator again together with another electron bunch. The 
external field stimulates more emission of radiation from the electrons, and is again 
recycled to stimulate a subsequent electron bunch until there are no more bunches 
in the electron pulse. Generating from a linear accelerator a train of thousands of 
electron bunches one can generate a large number of interactions, leading to an 
exponential growth of electromagnetic radiation. Such a devise is called a free 
electron laser or short FEL. 


23.1.4 Electron Beam 


In this text we consider radiation from relativistic electron beams only. Such beams 
can be generated efficiently by acceleration in microwave fields. The oscillatory 
nature of microwaves makes it impossible to produce a uniform stream of particles, 
and the electron beam is modulated into bunches at the distance of the microwave 
wavelength. The bunched nature of the electron beam and the fact that these bunches 
circulate in a storage ring determines the time structure and spectrum of the emitted 
radiation. Typically, the bunch length in storage rings is 30-100 ps at a distance of 
2-3 ns depending on the rf-frequency. 

During the storage time of the particle beam, the electrons radiate and it is this 
radiation that is extracted and used in experiments of basic and applied research. 
Considering, for example, only one bunch rotating in the storage ring, the experi- 
menter would observe a light flash at a frequency equal to the revolution frequency 
Srev. Because of the extremely short duration of the light flash many harmonics of the 
revolution frequency appear in the light spectrum. At the low frequency end of this 
spectrum, however, no radiation can be emitted for wavelength longer than about 
the dimensions of the metallic vacuum chamber surrounding the electron beam. 
For long wavelengths the metallic boundary conditions for electromagnetic fields 
cannot be met prohibiting the emission of radiation. Practically, useful radiation is 
observed from storage rings only for wavelengths below the microwave regime, or 
forA < 1mm. 
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23.2 Conservation Laws and Radiation 


The emission of electromagnetic radiation from free electrons is a classical phe- 
nomenon. We may therefore use a visual approach to gain some insight into 
conditions and mechanisms of radiation emission. First, we will discuss necessary 
conditions that must be met to allow an electron to emit or absorb a photon. 
Once such conditions are met, we derive from energy conservation a quantity, the 
Poynting vector, relating energy transport or radiation to electromagnetic fields. This 
will give us the basis for further theoretical definitions and discussions of radiation 
phenomena. 

The emission of electromagnetic radiation involves two components, the electron 
and the radiation field. For the combined system energy-momentum conservation 
must be fulfilled. These conservation laws impose very specific selection rules on 
the kind of emission processes possible. To demonstrate this, we plot the energy 
versus momentum for both electron and photon. In relativistic terms, we have the 


relation y = 1+ (By) between energy y and momentum fy. For consistency 
in quantities used we normalize the photon energy to the electron rest energy, 


Yp = &p/mc?, where &) = hw is the photon energy and mc” the electron rest 
mass while the normalized photon momentum is yy = hk,/mc*. Similarly, we 
express the speed of light by By = cp/c = 1/n where n > | is the refractive 
index of the medium surrounding the photon. With these definitions and assuming, 
for now, vacuum as the medium (n = 1) the location of a particle or photon in 
energy-momentum space is shown in Fig. 23.1 (left). 


/ : 
y photon creation 
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Fig. 23.1 Energy-momentum relationship for particles and photons (/eft). Violation of energy or 
momentum conservation during emission and absorption of electromagnetic radiation by a free 
electron travelling in perfect vacuum (6, = 1) (right) 
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Energy and momentum of a particle are related such that it must be located on the 
“particle”-line in Fig. 23.1(left) while a photon is always located on the “photon”- 
line. Transfer of energy between particle and photon must obey energy-momentum 
conservation. In Fig. 23.1 (right) we apply this principle to a free electron in vacuum 
emitting (absorbing) a photon. To create a photon the electron would have to loose 
(gain) an amount of momentum which is numerically equal to the energy gained 
(lost) by the photon. Clearly, in this case the electron would end up at a location 
off the “particle’-line, thus violating momentum conservation. That cannot be, and 
such a process is therefore not permitted. A free electron in vacuum cannot emit or 
absorb a photon without violating energy-momentum conservation. 


23.2.1 Cherenkov Radiation 


We have been careful to assume an electron in perfect vacuum. What happens in 
a material environment is shown in Fig. 23.2. Because the refractive index n > 1, 
the phase velocity of radiation is less than the velocity of light in vacuum and with 
f = 1/n, the “photon’-line is tilted towards the momentum axis. 

Formally, we obtain this for a photon from the derivative dy/d(By) which we 


expand to G@ = wv de a and get with y = haw/mc?,k = n®, and the 


Fig. 23.2. Energy and 
momentum conservation in a 
refractive environment with 
n>1 
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momentum By = 2, the derivative 


dyp 1 
a2 eg: 23.1 
d(py), 72 ai 


where we have added the subscript , to differentiate between photon and electron 
parameters. 

The dispersion function for a photon in a material environment has a slope less 
than unity as shown in Fig. 23.2. In this case, the numerical value of the photon 
momentum is less than the photon energy, analogous to the particle case. To create 
a photon of energy Yp we set yp = —Ay = —BABy from (1.30), where from (23.1) 
the photon energy yp = 1 (By), and get from both relations (By), = —nBABy. 
Because of symmetry, no momentum transverse to the particle trajectory can be 
exchanged, which means radiation is emitted uniformly in azimuth. The change in 
longitudinal momentum along the trajectory is —ABy = (By), r = (By), cos @. In 
a dielectric environment, free electrons can indeed emit or absorb a photon although, 
only in a direction given by the angle 6 with respect to the electron trajectory. This 
radiation is called Cherenkov radiation, and the Cherenkov angle 6 is given by the 
Cherenkov condition 


np cos@ = 1. (23.2) 


Note, that this condition is not the same as saying whenever an electron passes 
though a refractive medium with n > | there is Cherenkov radiation. The Cherenkov 
condition requires that nB > 1 which is, for example, not the case for an electron 
beam of less than 20 MeV traveling through air. 


23.2.2 Compton Radiation 


To generate electromagnetic radiation from free electrons in vacuum without 
violating energy-momentum conservation, we may employ the Compton effect 
which is the scattering of an incoming photon by the electron. In energy-momentum 
space this process is shown in Fig. 23.3. The electron, colliding head-on with an 
incoming photon absorbs this photon and emits again a photon of different energy. 
In this process it gains energy but looses momentum bringing the electron in the 
energy-momentum space to an intermediate point, P;, from where it can reach its 
final state on the “particle”-line by emitting a photon as shown in Fig. 23.3. This 
is the process involved in the generation of synchrotron radiation. Static magnetic 
fields in the laboratory system appear as electromagnetic fields like an incoming 
(virtual) photon in the electron system with which the electron can collide. Energy- 
momentum conservation give us the fundamental and necessary conditions under 
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Fig. 23.3 Energy and 
momentum conservation for 
Compton scattering process 


photon emission 


photon creation 


photon absorption 


By 


which a free charged particle can emit or absorb a photon. We turn our attention 
now to the actual interaction of charged particles with an electromagnetic field. 


23.3 Electromagnetic Radiation 


Phenomenologically, synchrotron radiation is the consequence of a finite value 
for the velocity of light. Electric fields extend infinitely into space from charged 
particles in uniform motion. When charged particles become accelerated, however, 
parts of these fields cannot catch up with the particle anymore and give rise to 
synchrotron radiation. This happens more so as the particle velocity approaches 
the velocity of light. 

The emission of light can be described by applying Maxwell’s equations to 
moving charged particles. The mathematical derivation of the theory of radiation 
from Maxwell’s equations is straightforward although mathematically elaborate and 
we will postpone this to Chap.25. Here we follow a more intuitive discussion! 
which displays visually the physics of synchrotron radiation from basic physical 
principles. 

Electromagnetic radiation occurs wherever electric and magnetic fields exist with 
components orthogonal to each other such that the Poynting vector 


a2 We Le: (23.3) 
Ho 


'The author would like to thank Prof. M. Eriksson, Lund, Sweden for introducing him to this 
approach into the theory of synchrotron radiation. 


808 23 Fundamental Processes 


It is interesting to ask what happens if we have a static electric and magnetic 
field such that [E x B] ~ 0. We know there is no radiation but the Poynting vector is 
nonzero. Applying energy conservation (1.87) we find the first two terms to be zero 
which renders the third term zero as well. For a static electric and magnetic field the 
integral defining the radiation loss or absorption is equal to zero and therefore no 
radiation or energy transport occurs. 

Similarly, in case of a stationary electrostatic charge, we note that the electrostatic 
fields extend radially from the charge to infinity which violates the requirement that 
the field be orthogonal to the direction of observation or energy flow. Furthermore, 
the charge is stationary and therefore there is no magnetic field. 


23.3.1 Coulomb Regime 


Next, we consider a charge in uniform motion. In the rest frame of the moving 
charge we have no radiation since the charge is at rest as just discussed. In the 
laboratory system, however, the field components are different. Since the charge 
is moving, it constitutes an electric current which generates a magnetic field. 
Formulating the Poynting vector in the laboratory system we express the fields by 
the pure electric field in the particle rest frame £*. We accomplish that by an inverse 
Lorentz transformations to (1.9), where the laboratory system £ now moves with the 
velocity —6, with respect to £* and f, in (1.9) must be replaced by —6, for 


E, y 00 0 fy0\(E 
Ey 0 y O-By 0 O}] E 
E, 0 0 1 0 00]] & 

= = 23.4 
cBy 0 -B.y0 y 00 cB* ven 
cBy By 0 0 0 y O cB 
cB, 0 00 0 O11 cB* 


In the laboratory system £ the components of the Poynting vector (23.3) become 
then with B* = 0 


CHS = = pF E, , 

ChLoSy = —/ BE, EF , (23.5) 

cpoS: = +y°B: (Ey? + Ey’) . 
where * indicates quantities in the moving system £* and B, = v- /c. The Poynting 
vector is nonzero and describes the flow of field energy in the environment of 
a moving charged particle. The fields drop off rapidly with distance from the 


particle and the “radiation” is therefore confined close to the location of the particle. 
Specifically, the fields are attached to the charge and travel in the vicinity and with 
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the charge. This part of electromagnetic radiation is called the Coulomb regime in 
contrast to the radiation regime and is, for example, responsible for the transport of 
electric energy along electrical wires and transmission lines. 

We will ignore this regime in our further discussion of synchrotron radiation. 
It should be noted, however, that measurements of radiation parameters close to 
radiating charges may be affected by the presence of the Coulomb radiation regime. 
Such situations occur, for example, when radiation is observed close to the source 
point. Related theories deal with this mixing by specifying a formation length 
defining the minimum distance from the source required to sufficiently separate the 
Coulomb regime from the radiation regime. 


23.3.2 Radiation Regime 


In this text we are only interested in the radiation regime and therefore ignore from 
now on the Coulomb regime. To describe the physics of emission of radiation, we 
consider a coordinate system moving with a constant velocity equal to that of the 
charged particle and associated electric fields. The charge is at rest in the moving 
reference system, the electric field lines extend radially out to infinity, and there 
is no radiation as discussed before. Acceleration of the charge causes it to move 
with respect to this reference system generating a distortion of the purely radial 
electric fields of a uniformly moving charge (Fig. 23.4). This distortion, resulting 
in a rearrangement of field lines to the new charge position, travels outward at the 
velocity of light giving rise to what we call radiation. 


Fig. 23.4 Distortion of fields 
due to longitudinal 
acceleration 


charge > 0 


iii vector: § 
long. electric field 
a component E__ 


magnetic field B, 


acceleration 
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To be more specific, we consider a positive charge in uniform motion for t < 0, 
the we apply an accelerating force at time tf = 0 for a time AT and observe the 
charged particle and its fields in the uniformly moving frame of reference. Due to 
acceleration the charge moves in this reference system during the time AT from 
point A to point B and as a consequence the field lines become distorted within a 
radius cAT from the original location A of the particle. It is this distortion, travelling 
away from the source at the speed of light, that we call radiation. 

The effects on the fields are shown schematically in Fig. 23.4 for an acceleration 
of a positive charge along its direction of motion. At time ¢ = 0 all electric field lines 
extend radially from the charge located at point A to infinity. During acceleration 
fieldlines emerge from the charge now at locations between A and B. The new 
field lines must join the old field lines which, due to the finite velocity of light, 
are still unperturbed at distances larger than cAT. As long as the acceleration lasts, 
a nonradial field component, parallel and opposite to the acceleration, is created. 
Furthermore, the moving charge creates an azimuthal magnetic field Bo(t) and 
the Poynting vector becomes nonzero causing the emission of radiation from an 
accelerated electrical charge. 

Obviously, acceleration would not result in any radiation if the velocity of 
propagation for electromagnetic fields were infinite (c — 00). In this case the radial 
fields at all distances from the charge would instantly move in synchrony with the 
movement of the charge. Only the Coulomb regime would exist. 

The electrical field perturbation is proportional to the electrical charge g and the 
acceleration a*. Acceleration along the z-axis generates an electric field E> # 0 and 
its component normal to the direction of observation scales like sin 0*, where ©* is 
the angle between the line of observation and the direction of particle acceleration. 
During the acceleration a fixed amount of field energy is created which propagates 
radially outward from the source. Since the total radiation energy must stay constant 
and the volume of the expanding spherical sheath of field perturbation increases like 
R?, the field strength decays linear with distance R. With this, we make the ansatz 


ea* 


——— —— sin 0* 23.6 
Amey c2R au ( ) 


x 

Ey = 
for the electric field, where we have added a factor c* in the denominator to be 
dimensionally correct. For an electron (e <0) the field perturbation would be 
positive pointing in the direction of the acceleration. As expected from the definition 
of the Poynting vector, the radiation is emitted predominantly orthogonal to the 
direction of acceleration and is highly polarized in the direction of acceleration. 
From (1.89) 


1 
s=— Ejn* , (23.7) 
clo 


where n* is the unit vector in the direction of observation from the observer toward 
the radiation source. The result is consistent with our earlier finding that no free 
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Fig. 23.5 Spatial radiation 
distribution in the rest frame 
of the radiating charge 


Fig. 23.6 Distortion of field 
lines due to transverse 
acceleration 
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radiation is emitted from a charge at rest or uniform motion (a* —> 0). The 
spatial radiation distribution is from (23.6) and (23.7) characterized by a sin? O*- 
distribution resembling the shape of a doughnut as shown in Fig. 23.5, where the 


acceleration occurs along the x-axis. 


Acceleration may not only occur in the longitudinal direction but also in 
the direction transverse to the velocity of the particle as shown in Fig. 23.6. 
The distortion of field lines in this case creates primarily transverse or radial 
field components. The radiation field component transverse to the direction of 


observation is 


(23.8) 
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This case of transverse acceleration describes the appearance of synchrotron 
radiation created by charged particles being deflected in magnetic fields. Similar 
to (23.7) the Poynting vector for transverse acceleration is 


1 2 ok 
S = —EVn". (23.9) 


Problems 


23.1 (S). Consider a relativistic electron traveling along the z-axis. In its own 
system, the electrical field lines extend radially from the charge. Considering only 
the xz-plane, derive an expressions for the electrical field lines in the laboratory 
frame of reference. Sketch the field pattern in the electron rest frame and in the 
laboratory system of reference. 


23.2 (S). Use a 10MeV electron beam passing through atmospheric air. Can 
you observe Cherenkov radiation and if so at what angle? Answer the same 
questions also for a 50 MeV electron beam. Describe and explain with Fig. 23.2 
the fundamental difference of your results (i, = 1.0002769 for A = 5, 600 A). 


23.3 (S). A 10 MeV electron beam passes with normal incidence through a plate of 
polystyren scintilator (1 = 1.58). Is there any Cherenkov radiation and if so at what 
angle? Where does this radiation escape the plate? 


23.4 (S). An electron beam orbits in a circular accelerator with a circumference of 
300 m at an average current of 250 mA and the beam consists of 500 equally spaced 
bunches each | cm long. How many particles are orbiting? How many particles are 
in each bunch? Assuming the time structure of synchrotron radiation is the same as 
the particle beam time-structure specify and plot the radiation time-structure in the 
photon beam line. 


23.5 (S). From Heisenberg’s uncertainty relation construct a “characteristic vol- 
ume” of a photon with energy €p, = fiw. What is the average electric field in this 
volume for a | eV photon and an X-ray photon of 10 keV? 


23.6 (S). Derive from (1.38) the formula for the classical Doppler effect valid for 
sound waves emitted at a frequency f, from a source moving with velocity v and 
received at an angle v. 


23.7 (S). Consider an electron storage ring at an energy of 800 MeV, a circulating 
current of 1 amp and a bending radius of p = 1.784m. Calculate the energy loss 
per turn, and the total synchrotron radiation power from all bending magnets. What 
would the radiation power be if the particles were 800 MeV muons. 


23.8 (S). For the electron beam of exercise 23.7 calculate the critical energy and 
plot the radiation spectrum. What is the useful frequency range for experimentation 
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assuming that the spectral intensity should be within 1 % of the maximum value? 
Express the maximum useful photon energy in terms of the critical photon energy 
(only one significant digit!). 


23.9 (S). What beam energy would be required to produce x-rays at a critical 
photon energy of 10keV from the storage ring of exercise 23.7? Is that energy 
feasible from a conventional magnet point of view or would the ring have to be 
larger? What would the new bending radius and energy have to be? 


23.10. Verify that for a 10 MeV electron colliding head-on with a Ti-Saphire laser 
(A = 0.8 .m) the wavelength in it’s own system is A* = 40.88 nm. Also show that 
the wavelength of the backscattered photon in the laboratory system is A, = 10.4 A. 
What electron beam energy do you need to produce | A radiation? What is the 
maximum acceptance angle allowable to still get a photon beam with a band width 
of 10 % or less? Show that the acceptance angle is £18.15 mrad. 


23.11. Consider a ray of 123.8meV and 10 keV photons, both at a power density 
of 100 Watt/mm*. How many photons occupy their respective “characteristic vol- 
umes”? Show that the photon flux density is 1.875 x 10!° photons (100 meV)/mm°> 
and 1.875 x 10° photons(10 keV)/mm? . Verify that, 61.07 photons (123.8 meV) and 
1.44 x 107!8 photons (10 keV) occupy, on average, its own characteristic volume 
in a 100 W/mm? beam. The X-ray photon distribution is indeed sparse among it’s 
characteristic volume. What are the respective characteristic volumes? 


23.12. Show that Eqs. (1.88) and (1.89) are the same for electromagnetic waves. 
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Chapter 24 
Overview of Synchrotron Radiation 


After Schott’s [1] unsuccessful attempt to explain atomic radiation with his electro- 
magnetic theory no further progress was made for some 40 years mainly because of 
lack of interest. Only in the mid 1940s did the theory of electromagnetic radiation 
from free electrons become interesting again with the successful development of 
circular high-energy electron accelerators. At this time powerful betatrons [2] have 
been put into operation and it was Ivanenko and Pomeranchouk [3], who first in 
1944 pointed out a possible limit to the betatron principle and maximum energy due 
to energy loss from emission of electromagnetic radiation. This prediction was used 
by Blewett [4] to calculate the radiation energy loss per turn in a newly constructed 
100 MeV betatron at General Electric. In 1946 he measured the shrinkage of the 
orbit due to radiation losses and the results agreed with predictions. On April 24, 
1947 visible radiation was observed for the first time at the 70 MeV synchrotron 
built at General Electric [5—7] with a transparent glass vacuum chamber. Since then, 
this radiation is called synchrotron radiation. 

The energy loss of particles to synchrotron radiation causes technical and 
economic limits for circular electron or positron accelerators. As the particle energy 
is driven higher and higher, more and more rf-power must be supplied to the 
beam not only to accelerate particles but also to overcome energy losses due to 
synchrotron radiation. The limit is reached when the radiation power grows to 
high enough levels exceeding technical cooling capabilities or exceeding the funds 
available to pay for the high cost of electrical power. To somewhat ameliorate this 
limit, high-energy electron accelerators have been constructed with ever increasing 
circumference to allow a more gentle bending of the particle beam. Since the 
synchrotron radiation power scales like the square of the particle energy (assuming 
constant magnetic fields) the circumference must scale similar for a constant amount 
of rf-power. Usually, a compromise is reached by increasing the circumference 
less and adding more rf-power in spaces along the ring lattice made available 
by the increased circumference. In general the maximum energy in large circular 
electron accelerators is limited by the available rf-power while the maximum energy 
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of proton or ion accelerators is more likely limited by the maximum achievable 
magnetic fields in bending magnets. 

What is a nuisance for researchers in one field can provide tremendous oppor- 
tunities for others. Synchrotron radiation is emitted tangentially from the particle 
orbit and within a highly collimated angle of +1/y. The spectrum reaches from 
the far infrared up to hard x-rays, the radiation is polarized and the intensities 
greatly exceed other sources specifically in the vacuum ultra violet to x-ray region. 
With these properties synchrotron radiation was soon recognized to be a powerful 
research tool for material sciences, crystallography, surface physics, chemistry, 
biophysics, and medicine to name only a few areas of research. While in the past 
most of this research was done parasitically on accelerators built and optimized for 
high-energy physics the usefulness of synchrotron radiation for research has become 
important in its own right to justify the construction and operation of dedicated 
synchrotron radiation sources all over the world. 


24.1 Radiation Sources 


Deflection of a relativistic particle beam causes the emission of electromagnetic 
radiation which can be observed in the laboratory system as broadband radiation, 
highly collimated in the forward direction. The emission is related to the deflection 
of a charged particle beam and therefore sweeps like a search light across the 
detection apparatus of the observer. It is this shortness of the observable radiation 
pulse which implies that the radiation is detected as synchrotron radiation with a 
broad spectrum. The width of the spectrum is characterized by the critical photon 
energy (24.49) and depends only on the particle energy and the bending radius of 
the magnet. Generally, the radiation is produced in bending magnets of a storage 
ring, where an electron beam is circulating for hours. 

In order to adjust the radiation characteristics to special experimental needs, other 
magnetic devices are being used as synchrotron radiation sources. Such magnets are 
known as insertion devices since they do not contribute to the overall deflection of 
the particle beam in the circular accelerator. Their effect is localized and the total 
deflection in an insertion device is zero. In this chapter, we give a short overview of 
all radiation sources and their characteristics and postpone more detailed discussions 
of insertion device radiation to Chap. 26. 


24.1.1 Bending Magnet Radiation 


The radiation from bending magnets is emitted tangentially from any point along the 
curved path like that of a searchlight and appears therefore as a swath of radiation 
around the storage ring as shown in Fig. 24.1. In the vertical, nondeflecting plane, 
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Fig. 24.1 Radiation swath | 
from bending magnets in an 
electron storage ring 


however, the radiation is very much collimated with a typical opening angle of 
+1/y. 

The temporal structure of synchrotron radiation reflects that of the electron beam. 
Electrons circulating in the storage ring are concentrated into equidistant bunches 
equal to an integer multiple (usually equal to unity) of the rf-wavelength (60cm 
for 500 MHz) while the bunch length itself is of the order of 1 to 3cm or 30 to 
100 ps depending on beam energy and rf-voltage. As a consequence, the photon 
beam consists of a series of short 30-100 ps flashes every 2 ns (500 MHz) or integer 
multiples thereof. 

Radiation is emitted in a broad spectrum reaching, in principal, from mircowaves 
up to the critically photon energy (24.49) and beyond with fast declining intensities. 
The long wavelength limit of the radiation spectrum is actually limited by the 
vacuum chamber, which causes the suppression of radiation at wavelength longer 
than its dimensions. The strength of bending magnets, being a part of the geometry 
of the storage ring cannot be freely varied to optimize for desired photon beam 
characteristics. This is specifically limiting in the choice of the critical photon 
energy. While the lower photon energy spectrum is well covered even for rather 
low energy storage rings, the x-ray region requires high beam energies and/or high 
magnetic fields. Often, the requirements for x-rays cannot be met with existing 
bending magnet and storage ring parameters. 


24.1.2  Superbends 


The critical photon energy from bending magnet radiation (24.51) is determined by 
the magnet field and the particle energy. The combination of both quantities may 
not be sufficient to extend the synchrotron radiation spectrum into the hard x-ray 
regime, especially in low energy storage rings. In this case, it is possible to replace 
some or all original bending magnets by much stronger but shorter magnets, called 
superbends. To be more specific, conventional bending magnets are replaced by high 
field, shorter superconducting magnets deflecting the electron beam by the same 
angle to preserve the storage ring geometry. Since conventional bending magnet 
fields rarely exceed 1.5 T, but superconducting magnets can be operated at 5—6 T or 
higher, one can gain a factor of 3 to 4 in the critical photon energy and extend the 
photon spectrum towards or even into the hard x-ray regime and beyond. 
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The installation of superbends is not always feasible or desirable. To still meet the 
need for harder x-ray radiation in a low energy storage ring, it is customary to use 
a wavelength shifter. Such a device may consist of three or five superconducting 
dipole magnets with alternating magnetic field directions. For this latter reason, a 
wavelength shifter is a true insertion device. Figure 24.2 shows schematically a 
three-pole wavelength shifter. 

The particle beam passing through this wavelength shifter is deflected up and 
down or left and right in such a way that no net deflection remains. To meet this 
condition, the longitudinal field distribution of a horizontally deflecting wavelength 
shifter must obey the condition 


[- £O=Odd= 0. (24.1) 


CO 


A wavelength shifter with such field properties is neutral on the geometry of the 
particle beam path through a storage ring and therefore can be made in principle as 
strong as necessary or technically feasible. 

Only the central high field pole is used as the radiation source, while the two 
side poles compensate the beam deflection from the central pole. In a five-pole 
wavelength shifter the three central poles would be used as radiators, while both end 
poles again act as compensators. Mostly, the end poles are longer than the central 
poles and operate at a lower field. As their name implies, the primary objective 
in wavelength shifters is to extend the photon spectrum while the enhancement of 
intensity through radiation accumulation from many poles, while desirable, is of 
secondary importance. To maximize the desired effect, wavelength shifters are often 
constructed as high field superconducting magnets to maximize the critical photon 
energy for the given particle beam energy. Some limitations apply for such devices 
as well as for any other insertion device. The end fields of magnets can introduce 
particle focusing and nonlinear field components may introduce aberrations and 
cause beam instability. Both effects must either be kept below a critical level or 
be compensated. 


particle trajectory 


central pole 
compensating poles 


Fig. 24.2 Magnetic field distribution along the beam path for a wave length shifter 
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24.1.4 Wiggler Magnet Radiation 


The principle of a wavelength shifter is extended in the case of a wiggler-magnet. 
Such a magnet consists of a series of equal dipole magnets with alternating magnetic 
field direction. Again, the end poles must be configured to make the total device 
neutral to the geometry of the particle beam path such that the conditions / Bdz = 0 
are met in both planes. 

The main advantage of using many magnet poles is to increase the photon flux. 
Like a single bending magnet, each of the N,.; magnet poles produces a fan of 
radiation in the forward direction and the total photon flux is N,)-times larger than 
that from a single pole. Wiggler-magnets may be constructed as electromagnets 
with fields up to 2T to function both as a flux enhancer and as a more modest 
wavelength shifter compared to the superconducting type. An example of an 8-pole, 
1.8 T electromagnetic wiggler-magnet [8] is shown in Fig. 24.3. 

In this picture, the magnet gap is wide open, to display the flat vacuum chamber 
running through the magnet between the poles. The pole pieces in the lower row are 
visible surrounded by water cooled excitation coils. During operation, both rows of 
wiggler poles are closed to almost touch the flat vacuum chamber. When the magnet 
is closed, a maximum magnetic field of 1.8 T can be obtained. Strong fields can be 
obtained from electromagnets, but the space requirement for the excitation coils 
limits the number of poles that can be installed within a given length. 

Progress in the manufacturing of high field permanent magnet material permits 
the installation of many more poles into the same space compared to an electromag- 
net. An example of a modern 26 pole, 2.0T permanent magnet wiggler magnet is 
shown in Fig. 24.4.! 


Fig. 24.3 Electromagnetic 
wiggler magnet with eight 
1.8 T poles 


'The author would like to thank T. Rabedau, Stanford for this picture. 
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BL11: 26 poles, 2.0T, 175mm penod 


gap for beam | 
vacuum chamber | 


Fig. 24.4 Permanent magnet wiggler magnet with 26 poles, a 175mm period length and a 
maximum field of 2.0T 


Figure 24.4 shows the wiggler magnet during magnetic measurement with the rail 
in front of the magnet holding and guiding the Hall probe. The increased number 
of poles and simplified design compared to the electromagnetic wiggler in Fig. 24.3 
are Clearly visible. 

For short wiggler poles, we express the magnetic field by 


2 
By (x,y = 0,z) = Bosin = (24.2) 
Pp 


and the maximum beam deflection from the axis is equal to the deflection angle per 
half pole 


v= — sin ——dz= ——, (24.3) 


where Bp is the beam rigidity. Multiplying this with the beam energy y, we define 
the wiggler strength parameter 


By A 
K = yd = 2°? = 0.9348  (T) Ay (cm) . (24.4) 
mec* 27 


For longer magnet poles (24.2) must be replaced by a sum of harmonics. Most 
wiggler magnets, though, are designed for the lowest harmonic only. This wiggler 
strength parameter is generally much larger than unity. Conversely, a series of 


alternating magnet poles is called a wiggler magnet if the strength parameter K > 1 
and condition (24.1) is met. As we will see later, a weak wiggler magnet with 


24.1 Radiation Sources 821 


Fig. 24.5 On-axis field q 
strength in a vanadium 3 4 
Permendur hybrid wiggler B, (T) 4 
magnet as a function of gap y _ 
aperture (24.5) 27 qi 
14 : 
0 TT TTTTT T_T TTTT1 
0.01 0.11 g/ r, 1 


K < 1 is called an undulator and produces radiation with significant different 
characteristics. The magnetic field strength can be varied in both electromagnetic 
wigglers as well as in permanent magnet wigglers. While this is obvious for 
electromagnets, the magnetic field strength in permanent magnets depends on the 
distance between magnet poles or on the gap height g. By varying mechanically 
the gap height of a permanent magnet wiggler, the magnetic field strength can be 
varied as well. The field strength also depends on the period length and on the design 
and magnet materials used. For a wiggler magnet constructed as a hybrid magnet 
with Vanadium Permendur poles, the field strength along the midplane axis scales 
approximately like [9] 


B,(T) ~ 3.33 exp |-% (sa7- al . for O.1ApS gS 10A,, (24.5) 


Pp Pp 


where g is the gap aperture between magnet poles. This dependency is also shown 
in Fig. 24.5 and we note immediately that the field strength drops off dramatically 
for magnet gaps of the order of a period length or greater. 

On the other hand, significant field strengths can be obtained for small gap 
apertures and it is therefore important to install the insertion device at a location, 
where the beam dimension normal to the deflection plane is small. 

The total radiation power can be derived by integrating (24.32) through the 
wiggler magnet. The result of this integration is 


An? 
(Pa) = iremc” cy K > ; (24.6) 
P 
or in practical units 
(P, (W)) = 632.7 Bj I Ly, (24.7) 


where J is the circulating beam current, and Ly = NpAp the length of the wiggler 
magnet. 
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For a sinusoidal field distribution Bo sin 2, the desired wavelength shifting 
property of a strong wiggler magnet can be obtained only in the forward direction. 
Radiation emitted at a finite angle with respect to the wiggler axis is softer because 
it is generated at a source point where the field is lower. The hardest radiation 
is emitted in the forward direction from the crest of the magnetic field. For a 
distance Az away from the crest, the emission angle in the deflection plane is 


2 
Ji i wo. 
= + gin ( 2 Az) and the curvature at the source point is 1o1 /j;_( 5 
po 21 Ap p Po K 


where we have made use of (24.4). Consequently, the critical photon energy for 
radiation in the direction y with respect to the wiggler axis varies with the emission 
angle w like 


(24.8) 


At the maximum deflection angle Wnax = K/y the critical photon energy has 
dropped to zero, reflecting a zero magnetic field at the source point. 

This property is undesirable if more than one experimental station is supposed to 
receive hard radiation from the same wiggler magnet. The strength of the wiggler 
magnet sweeps the electron beam over a considerable angle, a feature which can be 
exploited to direct radiation not only to one experimental station along the axis but 
also to two or more side-stations on either side of the wiggler axis. However, these 
side beam lines at an angle y ¥ 0 receive softer radiation than the main beam line. 
This can be avoided if the poles of the wiggler magnet are lengthened thus flattening 
the sinusoidal field crest. As the flat part of the field crest is increased, hard radiation 
is emitted into an increasing angular cone. 


24.1.5 Undulator Radiation 


So far, we discussed insertion devices designed specifically to harden the radiation 
spectrum or to increase the radiation intensity. Equally common is the implemen- 
tation of insertion devices to optimize photon beam quality by maximizing its 
brightness or to provide specific characteristics like elliptically polarized radiation. 
This is done with the use of undulator magnets, which are constructed similar to 
wiggler magnets, but are operated at a reduced field strength. 

Fundamentally, an undulator magnet causes particles to be only very weakly 
deflected with an angle of less than +1/y and consequently the transverse motion of 
particles is nonrelativistic. In this picture, the electron motion viewed from far away 
along the beam axis appears as a purely sinusoidal transverse oscillation similar to 
the electron motion in a linear radio antenna driven by a transmitter and oscillating 
at the station’s carrier frequency. The radiation emitted is therefore monochromatic 
with a period equal to the oscillation period. 
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To be more precise, viewed from far away the particle appears to be at rest or 
uniform motion as long as the electron has not yet reached the undulator magnet. 
Upon entering the magnet the electron performs sinusoidal transverse oscillations 
and returns to its original motion again after it exits the undulator. As a consequence 
of this motion and in light of earlier discussions, we observe emission of radiation 
at the frequency of the transverse oscillating beam motion. If Nper is the number 
of undulator periods, the electric field lines have been perturbed periodically Nper- 
times and the radiation pulse is composed of Nper oscillations. In the particle rest 
frame £* the undulator period length is Lorentz contracted to AY = A,/y which is 
the wavelength of the emitted radiation. Because the radiation includes only a finite 
number of Nper oscillations, the radiation is not quite monochromatic but rather quasi 
monochromatic with a band width of 1/Nper as illustrated in Fig. 24.6 (top). 

In Fig. 24.6 (bottom) the radiation lobe and spectrum is shown in the laboratory 
system. The monochromatic nature of the radiation is somewhat lost because 
radiation emitted at different angles experiences different Doppler shifts. Of course, 
the radiation is again quasi monochromatic even in the laboratory system when 
observed through a narrow pin hole along the axis. This monochromatic radiation is 
called the fundamental undulator radiation and has for K < | a wavelength of 


Ap 
dy & By?" (24.9) 
a) electron rest system : 
I(@*) 
acceleration 
> 
ck” y i 
b) laboratory system 
I(@) 
l/y 
acceleration o > 
2y’ck,, 


Fig. 24.6 Beam dynamics and radiation lobes in the particle rest system (a) and the laboratory 
system (b) for a weak undulator (K < 1) 
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Fig. 24.7 Distortion of 
sinusoidal motion due to 
relativistic perturbation of 
transverse motion 


The situation becomes more complicated as the undulator strength is increased. 
Two new phenomena appear, an oscillatory forward motion and a transverse 
relativistic effect. The first phenomenon that we need to discuss is the fact that the 
transverse motion becomes relativistic. As a consequence of this, the pure sinusoidal 
transverse motion becomes distorted. There is a periodic Lorentz contraction of the 
longitudinal coordinate, which is larger when the particle travels almost parallel 
to the axis in the vicinity of the oscillation crests and is smaller when in between 
crests. The cusps and valleys of the sinusoidal motion become Lorentz-contracted 
in the particle system thus perturbing the sinusoidal motion as shown in Fig. 24.7. 
In addition with increasing undulator strength the transverse motion becomes 
relativistic and the transverse Lorentz contraction enhances the distortion of the 
sine-like motion. 

This perturbation is symmetric about the cusps and valleys causing the appear- 
ance of odd and only odd (34, sth 7th .) harmonics of the fundamental oscillation 
period. From an undulator of medium strength (K = 1) we observe therefore along 
the axis a line spectrum of odd harmonics in addition to the fundamental undulator 
radiation. 

The second phenomenon to be discussed is the periodic modulation of the lon- 
gitudinal motion. The longitudinal component of the particle velocity is maximum 
when the particle travels close to the crest of the oscillations and at a minimum 
when it is close to the axis crossings. In a reference system which moves uniformly 
with the average longitudinal particle velocity along the axis, the particle performs 
periodic longitudinal oscillations in addition to the transverse oscillations. For each 
transverse period, the particle performs two longitudinal oscillations and its path 
looks therefore like a figure of eight. This situation is shown in Fig. 24.8. 

We have now two orthogonal accelerations, one transverse and one longitudinal, 
and two radiation lobes as indicated in Fig. 24.8. Since the longitudinal motion 
occurs at twice the frequency of the transverse motion, we observe now radiation 
also at twice the fundamental frequency. Of course, the relativistic perturbation 
applies here too and we have therefore a line spectrum which includes two series, 
one with all odd harmonics and one with only even harmonics. Even and odd 
harmonic radiation is emitted in the particle system in orthogonal directions and 
therefore we find both radiation lobes in the laboratory system spatially separated 
as well. The odd harmonics all have their highest intensities along the undulator 
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Fig. 24.8 Beam dynamics (/eft) and radiation lobes (middle) in the particle rest system together 
with the harmonics spectrum (right) for a stronger undulator (K = 1) 


axis, while the even harmonic radiation is emitted preferentially into an angle 1/y 
with respect to the axis and has zero intensity along the axis. 

In another equally valid view of undulator radiation, the static and periodic 
magnetic undulator field appears in the rest frame of the electron as a Lorentz 
contracted electromagnetic field or as monochromatic photon of wavelength A* = 
Ap/y. The emission of photons can therefore be described as Thomson scattering of 
virtual photons by free electrons [10] resulting in monochromatic radiation in the 
direction of the particle path. Viewed from the laboratory system, the radiation is 
Doppler shifted and applying (1.38) the wavelength of the backscattered photons is 


Xr 
a (24.10) 
" y2(1+ Bre) 
Viewing the radiation parallel to the forward direction (0 ~ 0), (1.39) becomes 
with n, = cos}* ~ 1— 40", and B ~ 1 

+nt yr 
cpa x =i, (24.11) 

: Nn, n- 


i A 
Lie ew ay? (1+ f0*?), (24.12) 
Pp} 


With (1.40) the angle 3* of the particle trajectory with respect to the observation 
is transformed into the laboratory system like 0* = 2y%. We distinguish two 
configurations. One where 3} = K/y = const. describing the particle motion in 
a helical undulator, where the magnetic field, being normal to the undulator axis, 
rotates about this axis. The other more common case is that of a flat undulator, 
where the particle motion follows a sinusoidal path in which case } = Vana + Vobs- 
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Here Ying = * sin kyz is the observation angle due to the periodic motion of the 
electrons in the undulator and v,», is the actual observation angle. With these 
definitions and taking the average (02,,) we get y°0* = $K?+y703,,. Depending on 
the type of undulator, the wavelength of radiation from an undulator with a strength 


parameter K is 


Ap 
a P(1+K?+y?02,..) fora helical undulator 
A= i (24.13) 
ae (1+ 4K? + y703,,) fora flat undulator. 
4 


From now on only flat undulators will be considered in this text and readers 
interested in more detail of helical undulators are referred to [11]. No special 
assumptions have been made here which would prevent us to apply this derivation 
also to higher harmonic radiation and we get the general expression for the 
wavelength of the kth harmonic 


Xp 
2y7k 


a= (1 + 1K? + y?92,) . (24.14) 


The additional terms $K> + 7? cps COMpared to (24.9) comes from the correct 
application of the applet effect. Since the particles are deflected periodically in 
the undulator, we view even the on-axis radiation at a periodically varying angle 
which accounts for the iK ?-term. Of course, observation of the radiation at a finite 
angle Uops generates an addin red-shift expressed by the term y~ aD 

In more practical units, the undulator wavelengths for the kth harmonic are 
expressed from (24.14) by 


Ap (cm) 


A, (A) = 13.056—?~———_ 
(A) k E2 (GeV?) 


(1+ 4K> + y783,.) (24.15) 


and the corresponding photon energies are 


E? (GeV’) 
Ap (cm) (1 + $K2 + y202,) | 


ex (eV) = 950 (24.16) 


Recollecting the discussion of undulator radiation, we found that the first 
harmonic or fundamental radiation is the only radiation emitted for K «< 1. As 
the undulator parameter increases, however, the oscillatory motion of the particle in 
the undulator deviates from a pure sinusoidal oscillation. For K > 1 the transverse 
motion becomes relativistic, causing a deformation of the sinusoidal motion and 
the creation of higher harmonics. These harmonics appear at integral multiples of 
the fundamental radiation energy. Only odd harmonics are emitted along the axis 
(0 = 0) while even harmonics are emitted into a small angle from the axis. As 
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Fig. 24.9 Transition from quasi-monochromatic undulator radiation to broad band wiggler radia- 
tion 


the undulator strength is further increased more and more harmonics appear, each 
of them having a finite width due to the finite number of undulator periods, and 
finally merging into the well-known broad spectrum of bending or wiggler magnet 
radiation (Fig. 24.9). 

We find no fundamental difference between undulator and wiggler magnets, one 
being just a stronger/weaker version of the other. From a practical point of view, the 
radiation characteristics are very different and users of synchrotron radiation make 
use of this difference to optimize their experimental capabilities. In Chap. 26 we 
will discuss the features of undulator radiation in much more detail. 

The electron motion through an undulator with Nye, periods includes that many 
oscillations and so does the radiation field. Applying a Fourier transformation to the 
field, we find the spectral width of the radiation to be 


ae (24.17) 
A Nowe ; 


In reality, this line width is increased due to the finite aperture of the radiation 
detection elements, and due to a finite energy spread and finite divergence of the 
electron beam. Typical experimental undulator spectra are shown in Fig. 24.10 for 
increasing undulator strength K [12]. 

Although this radiation was measured through a pin hole and on-axis, we still 
recognize even harmonic radiation since the pin hole covers a finite solid angle and 
lets some even harmonic radiation through. Furthermore, the measured intensities of 
the line spectrum does not reflect the theoretical expectation for the lower harmonics 
at higher values of K. This is an artifact of the experimental circumstances, where 
the x-rays have been extracted from the storage ring vacuum chamber through a 
Be-window. Such a window works very well for hard x-rays but absorbs heavily at 
photon energies below some 3 keV. 

The concentration of all radiation into one or few spectral lines is very desirable 
for many experiments utilizing monochromatic photon beams since radiation is 
produced only in the vicinity of the desired wavelength at high brightness. Radiation 
at other wavelengths creating undesired heating effects on optical elements and 
samples is greatly eliminated. 
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Fig. 24.10 Measured radiation spectrum from an undulator for different strength parameters K. 
The intensity at low photon energies are reduced by absorption in a Be-window [12] 


Back Scattered Photons 


The principle of Thomson backscattering or Compton scattering of the static 
undulator fields can be expanded to that of photon beams colliding head on with 
the particle beam. In the electron system of reference the electromagnetic field of 
this photon beam looks fundamentally no different than the electromagnetic field 
from the undulator magnet. We may therefore apply similar arguments to determine 
the wavelength of back scattered photons. The basic difference of both effects is 
that in the case of back scattered photons the photon beam moves with the velocity 
of light towards the electron beam and therefore the electron sees twice the Lorentz 
contracted photon frequency and we expect therefore a back scattered photon beam 
at twice the Doppler shifted frequency. That extra factor of two does not apply for 
undulator radiation since the undulator field is static and the relative velocity with 
respect to the electron beam is c. If Ay is the wavelength of the incident radiation or 
incident laser, the wavelength of the backscattered photons is 


AL 
hy = ay? (1+ y70o,5) » (24.18) 
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where Ups is the angle between the direction of observation and the particle beam 
axis. Scattering, for example, a high intensity laser beam from high-energy electrons 
produces a monochromatic beam of hard x-rays which is highly collimated within 
an angle of +1/y. If the laser wavelength is, for example, A, = 10m and the 
particle energy is 100 MeV the wavelength of the backscattered x-rays would be 
1.3A or the photon energy would be 9.5 keV which is well within the hard x-ray 
regime. 


Photon Flux 


The intensity of the backscattered photons can be calculated in a simple way 
utilizing the Thomson scattering cross section [10] 


orm = *2r2 = 6.65 x 107” cm’. (24.19) 


The total scattering event rate or the number of back scattered photons per unit 
time is then 


Nec = OTL, (24.20) 


where CL is called the luminosity. The value of the luminosity is independent of the 
nature of the physical reaction and depends only on the intensities and geometrical 
dimensions of the colliding beams. The definition of the luminosity is the product 
of the target density of one beam by the “particle’’-flux of the other beam onto this 
target. Therefore the luminosity can be determined by folding the particle density in 
one beam with the incident “particles” per unit time of the other beam. Obviously, 
only those parts of the beam cross sections count which overlap with the cross 
section of the other beam. For simplicity, we assume a Gaussian distribution in 
both beams and assume that both beam cross sections are the same. In a real setup 
one would focus the electron beam and the photon beam to the same optimum cross 
section given by the Rayleigh length (27.59). We further consider the particle beam 
as the target for the photon beam. 

With N, electrons in each bunch of the particle beam within a cross section of 
20,0, the particle density is N-/ 270.0). We consider now a photon beam with 
the same time structure as the electron beam. If this is not the case only that part of 
the photon beam which actually collides with the particle beam within the collision 
zone may be considered. For an effective photon flux Non the luminosity is 


_ NeNon 
270,0) 


(24.21) 


Although the Thomson cross-section and therefore the photon yield is very 
small, this technique can be used to produce photon beams with very specific 
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characteristics. By analyzing the scattering distribution this procedure can also be 
used to determine the degree of polarization of an electron beam in a storage ring. 

So far, it was assumed that the incident and scattered photon energies are much 
smaller than the particle energy in which case it was appropriate to use the classical 
case of Thomson scattering. However, we note from (24.18) that the backscattered 
photon energy increases quadratically with the particle energy and therefore at 
some energy the photon energy becomes larger than the particle energy which is 
nonphysical. In case of large photon energies comparable with the particle energy, 
Compton corrections [13-15] must be included. The Compton cross-section for 
head-on collision is given by [16] 


3 0Th 4 8 1 8 1 
a 1--- In(1 5 += - — 5], 24.22 
com” ( x 2) aera 2(1 +x) aes 
where x = Shon and fiw the incident photon energy. The energy spectrum of the 


scattered photons is then [16] 


4 dy? 
= . ; (24.23) 


dy 4x * l-y x+y) x2(1—y)? 


where y = hw/E is the scattered photon energy in units of the particle energy. 


24.2 Radiation Power 


Synchrotron radiation properties can be described in more detail by integrating 
the Poynting vector (23.7) over a closed surface enclosing the radiating charge. 
With (23.9) and n*dA* = R’ sin 0*d@*d®* we get the total radiation power from 
a single electron in its own rest frame 


pP* = | S*dA* = 27,—a” (24.24) 
~ Cc. 


where we have set ia = 4seyremc*. From the discussion of 4-vectors, we know 
that the square of the 4-acceleration is invariant to Lorentz transformations and get 
from (B.21) for the total radiation power in the laboratory system 


P = 2r.mey® E ox (B xB )' (24.25) 


Equation (24.25) expresses the radiation power in a simple way and allows us to 
calculate other radiation characteristics based on beam parameters in the laboratory 
system. The radiation power is greatly determined by the geometric path of the 
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particle trajectory through the quantities B and B . Specifically, if this path has strong 
oscillatory components we expect that motion to be reflected in the synchrotron 
radiation power spectrum. This aspect will be discussed later in more detail. Here 
we distinguish only between acceleration parallel B | OF perpendicular B , to the 
propagation f of the charge and set therefore 


B=B8,+8B.. (24.26) 


Insertion into (24.25) shows the total radiation power to be composed of 
separate contributions from parallel and orthogonal acceleration. Separating both 
contributions we get the synchrotron radiation power for both parallel and transverse 
acceleration respectively 


-2 
P| — =remcy°B I; (24.27) 
P, = 2rmey*B ). (24.28) 


Expressions have been derived that define the radiation power for parallel 
acceleration like in a linear accelerator or orthogonal acceleration found in circular 
accelerators or deflecting systems. We note a similarity for both contributions except 
for the energy dependence. At highly relativistic energies the same acceleration 
force leads to much less radiation if the acceleration is parallel to the motion of the 
particle compared to orthogonal acceleration. Parallel acceleration is related to the 


accelerating force by mv) = ss p| /dt and after insertion into (24.27) the radiation 
power due to parallel acceleration becomes 
or . 
ae (21) (24.29) 
3mc \ dt 


The radiation power for acceleration along the propagation of the charged particle 
is therefore independent of the energy of the particle and depends only on the 
accelerating force or with dp, /dt =dE/dz on the energy increase per unit length 
of accelerator. Different from circular electron accelerators we encounter therefore 
no practical energy limit in a linear accelerator at very high energies. In contrast very 
different radiation characteristics exist for transverse acceleration as it happens, for 
example, during the transverse deflection of a charged particle in a magnetic field. 
The transverse acceleration v is expressed by the Lorentz force 


oa. = ymb, = e[v x Bl (24.30) 
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and after insertion into (24.28) the radiation power from transversely accelerated 
particles becomes 


2 
Pp =? (+) (24.31) 
dt 

From (24.29), (24.31) we find that the same accelerating force leads to a much 
higher radiation power by a factor y? for transverse acceleration compared to lon- 
gitudinal acceleration. For all practical purposes, technical limitations prevent the 
occurrence of sufficient longitudinal acceleration to generate noticeable radiation. 
From here on we will stop considering longitudinal acceleration unless specifically 
mentioned and eliminate, therefore, the index | setting for the radiation power 
P, = P,. We also restrict from now on the discussion to singly charged particles 
and set q = e ignoring extremely high energies, where multiple charged ions may 
start to radiate. Replacing the force in (24.31) by the Lorentz force (24.30) we get 


_ 4a 2Kre 


= — B’E? = CpB’E’, (24.32) 
"fo 3 (mc?) ° 


where 6 ~ 1 and 


4 2? 1 
= A = 379355 ' (24.33) 
Ho 3 (mc?) T’GeVs 


The synchrotron radiation power scales like the square of the magnetic field and 
the square of the particle energy. Replacing the deflecting magnetic field B by the 
bending radius p, the instantaneous synchrotron radiation power becomes 


4,,4 
Py = Frames (24.34) 
or in more practical units, 
CGE (24.35) 
"On p2’ , 
where 
4m fe _5 m 
y= >a = 8.8463 x 10 a (24.36) 
3 (me?) GeV” 


The electromagnetic radiation of charged particles in transverse magnetic fields 
is proportional to the fourth power of the particle momentum fy and inversely 
proportional to the square of the bending radius p. 
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The synchrotron radiation power increases very fast for high-energy particles and 
provides the most severe limitation to the maximum energy achievable in circular 
accelerators. In storage rings with different magnets and including insertion devices 
it is important to formulate the average radiation power of an electron during the 
course of one turn. In this case we calculate the average 


(P)) = OBS) = GE = (24.37) 
i a i rs ; : 
We note, however, from also a strong dependence on the kind of particles involved 
in the process of radiation. Because of the much heavier mass of protons compared 
to the lighter electrons we find appreciable synchrotron radiation only in electron 
accelerators. 

The radiation power of protons actually is smaller compared to that for electrons 
by the fourth power of the mass ratio or by the factor 


Pe 4 1B 
5. = 18364 = 1.1367 x 10" (24.38) 


P 


In spite of this enormous difference measurable synchrotron radiation has been 
predicted by Coisson [17] and was indeed detected at the 400GeV proton syn- 
chrotron, SPS (Super Proton Synchrotron), at CERN in Geneva [18, 19]. Substantial 
synchrotron radiation is expected in multi-TeV proton colliders like the LHC (Large 
Hadron Collider) at CERN [20]. 

Knowledge of the synchrotron radiation power allows us now to calculate the 
energy loss per turn of a particle in a circular accelerator by integrating the radiation 
power along the circumference of the circular accelerator 


dz 


2 (24.39) 


Up = f Pyar = 2remc By* 


In an isomagnetic lattice, where the bending radius is the same for all bending 
magnets p = const., the integration around a circular accelerator can be performed 
and the energy loss per turn due to synchrotron radiation is 


9 4 
Uy = Pa” = ‘Frome pT. (24.40) 


In more practical units, the energy loss of relativistic electrons per revolution in a 
circular accelerator with an isomagnetic lattice and a bending radius p is given by 


4 4 
Up,iso (GeV) = reece B (24.41) 
p(m) 
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For a beam of J, particles or a circulating beam current J = ef--yN. the total 
average radiation power is 


(Ps) = Un. (24.42) 


or in more practical units 


E* (GeV) 


= 0.088463 ——__ = 
p(m) 


(P; (MW)) I(A). (24.43) 


iso 


The total synchrotron radiation power scales like the fourth power of the particle 
energy and is inversely proportional to the bending radius. The strong dependence 
of the radiation on the particle energy causes severe practical limitations on the 
maximum achievable energy in a circular accelerator. 


24.3 Spectrum 


Synchrotron radiation from relativistic charged particles is emitted over a wide 
spectrum of photon energies. The basic characteristics of this spectrum can be 
derived from simple principles as suggested in [21]. For an observer synchrotron 
light has the appearance similar to the light coming from a lighthouse. Although the 
light is emitted continuously an observer sees only a periodic flash of light as the 
aperture mechanism rotates in the lighthouse. Similarly, synchrotron light emitted 
from relativistic particles will appear to an observer as a single flash if it comes 
from a bending magnet in a transport line passed through by a particle only once 
or as a series of equidistant light flashes as bunches of particles orbit in a circular 
accelerator. 

Since the duration of the light flashes is very short the observer notes a broad 
spectrum of frequencies as his eyes or instruments Fourier analyze the pulse 
of electromagnetic energy. The spectrum of synchrotron light from a circular 
accelerator is composed of a large number of harmonics of the particle revolution 
frequency. These harmonics reach a cutoff, where the period of the radiation 
becomes comparable to the duration of the light pulse. Even though the aperture 
of the observers eyes or instruments are assumed to be infinitely narrow we still 
note a finite duration of the light flash. This is a consequence of the finite opening 
angle of the radiation as illustrated in Fig.24.11. Synchrotron light emitted by a 
particle travelling along the orbit cannot reach the observer before it has reached 
the point Po when those photons emitted on one edge of the radiation cone at an 
angle —1/y aim directly toward the observer. Similarly, the last photons to reach 
the observer are emitted from point P, at an angle of +1/y. Between point Pp and 
point P; we have therefore a deflection angle of 2/y. The duration of the light flash 
for the observer is not the time it takes the particle to travel from point Pp to point 
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Fig. 24.11 Temporal pulse formation of synchrotron radiation 


P, but must be corrected for the finite time of flight for the photon emitted at Po . If 
particle and photon would travel toward the observer with exactly the same velocity 
the light pulse would be infinitely short. However, particles move slower following a 
slight detour and therefore the duration of the light pulse equals the time difference 
between the first photons from point Po arriving at the observer and the last photons 
being emitted by the particles at point P;. Although the particle reaches point Po at 
time t = 0 the first photon can be observed at point P, only after a time 


2psin+ 
ty = ——. (24.44) 
c 
The last photon to reach the observer is emitted when the particle arrives at point 
P, at the time 


i, = ae 
“Bey” 


The duration of the light pulse 6t is therefore given by the difference of both travel 
times (24.44), (24.45) 


(24.45) 


6t = te —ty = =— (24.46) 

The sine-function can be expanded for small angles keeping linear and third order 
terms only and the duration of the light pulse at the location of the observer is after 
some manipulation 


4 
i=, 
3cy3 


(24.47) 


The total duration of the electromagnetic pulse is very short scaling inversely 
proportional to the third power of y. This short pulse translates into a broad 
spectrum. Using only half the pulse length for the effective pulse duration the 
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spectrum reaches up to a maximum frequency of about 


(24.48) 


which is called the critical photon frequency of synchrotron radiation. The critical 
photon energy &- = Awe is then given by 


EB 
fe = Co; (24.49) 
p 
with 
3h 
=. (24.50) 
2 (mc?) 
For electrons numerical expressions are 
E* (GeV*) 2 2 
&¢ (keV) = 2.2183 ——_——. = 0.66503 E* (GeV*) B(T). (24.51) 


p (m) 


The synchrotron radiation spectrum from relativistic particles in a circular acceler- 
ator is made up of harmonics of the particle revolution frequency wo with values up 
to and beyond the critical frequency (24.51). Generally, a real synchrotron radiation 
beam from say a storage ring will not display this harmonic structure. The distance 
between harmonics is extremely small compared to the extracted photon frequencies 
in the VUV and x-ray regime while the line width is finite due to the energy spread 
and beam emittance. 

For a single pass of particles through a bending magnet in a beam transport line 
we observe the same spectrum. Specifically, the maximum frequency is the same 
assuming similar parameters. Synchrotron radiation is emitted in a particular spatial 
and spectral distribution, both of which will be derived in Chap. 25, and we will 
use here only some of these results. A useful parameter to characterize the photon 
intensity is the photon flux per unit solid angle into a frequency bin Aw/q and from 
a circulating beam current J defined by 


PN, A : 
ae = Cab (2) Ki, 6) FES). (24.52) 


where vy is the angle in the deflecting plane and @ the angle normal to the deflecting 
plane, 


a aes ei photons 


° Ane (m2? s mrad2GeV2A 100%BW’ 


(24.53) 
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Fig. 24.12 Modified Bessel’s functions K1/3 (&) and K/3 (€) 


a the fine structure constant and 


(24.54) 


F (€,8) = (1+ 776)" (: ee ce 3] . 


1 + y?6? K3,, () 


The functions K1/3 (€) and K2/3 (€), displayed in Fig. 24.12, are modified Bessel’s 
functions with the argument 


g= 19 (14 y262)?, (24.55) 


Synchrotron radiation is highly polarized in the plane normal (o-mode), and parallel 
(zt-mode), to the deflecting magnetic field. The relative flux in both polarization 
directions is given by the two components in the second bracket of function F (&, @) 
in (24.54). The first component is equal to unity and determines the photon flux 
for the polarization normal to the magnetic field or o-mode, while the second term 
relates to the polarization parallel to the magnetic field which is also called the zr- 
mode. Equation (24.52) expresses both the spectral and spatial photon flux for both 
the o-mode radiation in the forward direction within an angle of about +1/y and 
for the z-mode off axis. 

For highly relativistic particles the synchrotron radiation is collimated very much 
in the forward direction and we may assume that all radiation in the nondeflecting 
plane is accepted by the experimental beam line. In this case we are interested in 
the photon flux integrated over all angles 9. This integration will be performed in 
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Chap. 26 with the result (25.140) 


dN, da IA 
ei 2s(2), (24.56) 


dy 9'ew \a 


where w is the deflection angle in the bending magnet, « the fine structure constant 
and the function S (x) is defined by 


s(=) _ v3 © / Ksy3 (&) de (24.57) 


We 


with K5/3(x) a modified Bessel’s function. The function S(@/w¢) is known as the 
universal function of synchrotron radiation and is shown in Fig. 24.13. In practical 
units, the angle integrated photon flux is 


dN ph 
dy 


A 
=(, 21-8 (=) (24.58) 
w (2) 


Cc 


with Cy defined by 


photons 


——____—— . (24.59) 
srad A GeV 


Go. eso iit 
9e mc? 

The spectral distribution depends only on the particle energy, the critical 
frequency w, and a purely mathematical function. This result has been derived 
originally by Ivanenko and Sokolov [22] and independently by Schwinger [23]. 
Specifically it should be noted that the spectral distribution, if normalized to the 
critical frequency, does not depend on the particle energy and can therefore be 
represented by a universal distribution shown in Fig. 24.13. 

The energy dependence is contained in the cubic dependence of the critical 
frequency acting as a scaling factor for the actual spectral distribution. The 
synchrotron radiation spectrum in Fig. 24.13 is rather uniform up to the critical 
frequency beyond which the intensity falls off rapidly. This synchrotron radiation 
spectrum has been verified experimentally soon after such radiation sources became 
available [24, 25]. 

Equation (24.56) is not well suited for quick calculation of the radiation intensity 
at a particular frequency. We may, however, express (24.56) in much simpler form 
for very low and very large frequencies making use of limiting expressions of 
Bessel’s functions for large and small arguments. For small arguments x = 2 < 1 
an asymptotic approximation [26] for the modified Bessel’s function may be used 
to give instead of (24.58) with AS(9.6.9) [26] 


dN, A on 
a x Cy EI— 1.333 (=) (24.60) 
oO We 
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Fig. 24.13 Universal function of the synchrotron radiation spectrum, S(w/@-) 


Similarly, for high photon frequencies x = ° >> 1 we get with AS(9.7.2) [26] 


dNon 
dy 


mid (24.61) 
ex 


Aw 


where x = ©. Both approximations are included in Fig. 24.13 and display actually 
a rather good representation of the real spectral radiation distribution over all but 
the central portion of the spectrum where S(x) ~ 0.4. Specifically, we note the slow 
increase in the radiation intensity at low frequencies and the exponential drop off 
above the critical frequency. 


24.4 Spatial Photon Distribution 


The expressions for the photon fluxes (24.52), (24.56) provide the opportunity 
to calculate the spectral distribution of the photon beam divergence. Photons are 
emitted into a narrow angle and we may represent this narrow angular distribution 
by a Gaussian distribution. The effective width of a Gaussian distribution is /27104 
and we set 


dNpn — d?Non 


ae  apag J 2209. (24.62) 
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Fig. 24.14 Scaling function f(x) = og (mrad) E(GeV) for the photon beam divergence in (24.63) 


With (24.52), (24.58) the angular divergence of the forward lobe of the photon beam 
or for a beam polarized in the o-mode is 


_ % 1 8@ _ f®@ 
aoe dal J2nCo BVK (3x) ~ E(GeV)’ ee) 


where x = @/@,. For the forward direction 6 ~ 0 the function f(x) = 
og (mrad) E (GeV) is shown in Fig. 24.14 for easy numerical calculations. 

For wavelengths w < We, (24.63) can be greatly simplified to become in more 
practical units 


06 (mrad) ~ 


1/3 
0.54626 (- ) _ : 7.124 (24.64) 


E (GeV) \ a p(m) en (eV)? 
where p is the bending radius and ¢€, the photon energy. The photon beam 
divergence for low photon energies compared to the critical photon energy is 
independent of the particle energy and scales inversely proportional to the third root 
of the bending radius and photon energy. 


24.5 Fraunhofer Diffraction 


Synchrotron radiation is emitted from a rather small area equal to the cross section 
of the electron beam. In the extreme and depending on the photon wavelength the 
radiation may be spatially coherent because the beam cross section in phase space 
is smaller than the wavelength. This possibility to create spatially coherent radiation 
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Fig. 24.15 Diffraction geometry 


is important for many experiments specifically for holography and we will discuss 
in more detail the conditions for the particle beam to emit such radiation. 

Reducing the particle beam cross section in phase space by diminishing the 
particle beam emittance reduces also the source size of the photon beam. This 
process of reducing the beam emittance is, however, effective only to some point. 
Further reduction of the particle beam emittance would have no effect on the 
photon beam emittance because of diffraction effects. A point like photon source 
appears in an optical instrument as a disk with concentric illuminated rings. For 
synchrotron radiation sources it is of great interest to maximize the photon beam 
brightness which is the photon density in phase space. On the other hand designing 
a lattice for a very small beam emittance can cause beam stability problems. It is 
therefore prudent not to push the particle beam emittance to values much less than 
the diffraction limited photon beam emittance. In the following we will therefore 
define diffraction limited photon beam emittance as a guide for low emittance lattice 
design. 

For highly collimated synchrotron radiation it is appropriate to assume Fraun- 
hofer diffraction. Radiation from an extended light source appears diffracted in the 
image plane with a radiation pattern which is characteristic for the particular source 
size and radiation distribution as well as for the geometry of the apertures involved. 
For simplicity, we will use the case of a round aperture being the boundaries of 
the beam itself although in most cases the beam cross section is more elliptical. 
In spite of this simplification, however, we will obtain all basic physical properties 
of diffraction which are of interest to us. We consider a circular light source with 
diameter 2a. The radiation field at point P in the image plane is then determined by 
the Fraunhofer diffraction integral [27] 


a 20 
UP) =C / i e Keweos-Wd@ odo, (24.65) 
0 0 


where k is the wave number of the radiation and w is the sine of the angle between 
the light ray and the optical axis as shown in Fig. 24.15. 
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Fig. 24.16 Fraunhofer diffraction for a circular uniform light source 


With a = ©—vw and the definition of the lowest order Bessel’s function Jo (x) = 
_ te e108 dey, (24.65) can be expressed by the integral 


a 


U(P) = 22nC / ‘To (kpw) pdp. (24.66) 
0 


This integral can be solved analytically as well with the identity ie Jo(y) ydy = 
xJ; (x). The radiation intensity is proportional to the square of the radiation field and 
we get finally for the radiation intensity in the image plane at the point P 


AJ : (kaw) 


I(P) = 1S awe 


(24.67) 


where /(P) = |U (P)|? and Jy) = I(w — 0) is the radiation intensity at the image 
center. This result has been derived first by Airy [28]. The radiation intensity from a 
light source of small circular cross section is distributed in the image plane due 
to diffraction into a central circle and concentric rings illuminated as shown in 
Fig. 24.16. 

Tacitly, we have assumed that the distribution of emission at the source is uniform 
which is generally not correct for a particle beam. A Gaussian distribution is more 
realistic resembling the distribution of independently radiating particles. We must 
be careful in the choice of the scaling parameter. The relevant quantity for the 
Fraunhofer integral is not the actual particle beam size at the source point but rather 
the apparent beam size and distribution. By folding the particle density distribution 
with the argument of the Fraunhofer diffraction integral we get the radiation field 
from a round, Gaussian particle beam, 


oo p 
Ucg(P) « i exp (-5) Jo (kpw) pdp, (24.68) 
0 0; 
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where o, is the apparent standard source radius. Introducing the variable x = 
p//20, and replacing kow = /2xko,w = 2x,/z we get from (24.68) 
ca 2 
Ug(P) « / e* xJo (2x,/z) dx (24.69) 
0 
and after integration 
Uc(P) « exp |-3 (ko, wy] (24.70) 


The diffraction pattern from a Gaussian light source (Fig. 24.17) does not exhibit 
the ring structure of a uniform source. The radiation field assumes rather the form of 


a Gaussian distribution in the emission angles w with a standard width of on = (w?) 
or 


(24.71) 


24.6 Spatial Coherence 


Synchrotron radiation is emitted into a broad spectrum with the lowest frequency 
equal to the revolution frequency and the highest frequency not far above the critical 
photon energy. Detailed observation of the whole radiation spectrum, however, may 
reveal significant differences to these theoretical spectra at the low frequency end. 
At low photon frequencies we may observe an enhancement of the synchrotron 
radiation beyond intensities predicted by the theory of synchrotron radiation as 
discussed so far. We note from the definition of the Poynting vector that the 
radiation power is a quadratic effect with respect to the electric charge. For photon 
wavelengths equal and longer than the bunch length, we expect therefore all particles 
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within a bunch to radiate coherently and the intensity to be proportional to the square 
of the number JV, of particles rather than linearly proportional as is the case for high 
frequencies. This quadratic effect can greatly enhance the radiation since the bunch 
population can be 10° — 10!! electrons. 

Generally such radiation is not emitted from a storage ring beam because 
radiation with wavelengths longer than the vacuum chamber dimensions are 
shielded and will not propagate along a metallic beam pipe [29]. This radiation 
shielding is fortunate for storage ring operation since it eliminates an otherwise 
significant energy loss mechanism. Actually, since this shielding affects all radiation 
of sufficient wavelength both the ordinary synchrotron radiation and the coherent 
radiation is suppressed. New developments in storage ring physics, however, may 
make it possible to reduce the bunch length by as much as an order of magnitude 
below presently achieved short bunches of the order of 5-10mm. Such bunches 
would then be much shorter than vacuum chamber dimensions and the emission of 
coherent radiation in some limited frequency range would be possible. Much shorter 
electron bunches down to a few fs can be produced in linear accelerators [30, 31], 
and specifically with bunch compression [32] a significant fraction of synchrotron 
radiation is emitted spontaneously as coherent radiation [33]. 

In this section we will discuss the physics of spontaneous coherent synchrotron 
radiation while distinguishing two kinds of coherence in synchrotron radiation, the 
temporal coherence and the spatial coherence. Temporal coherence occurs when 
all radiating electrons are located within a short bunch length of the order of the 
wavelength of the radiation. In this case the radiation from all electrons is emitted 
with about the same phase. For spatial coherence the electrons may be contained in 
a long bunch but the transverse beam emittance must be smaller than the radiation 
wavelength. In either case there is a smooth transition from incoherent radiation 
to coherent radiation as determined by a formfactor which depends on the bunch 
length or transverse emittance. 

Similar to the particle beam characterization through its emittance we may do 
the same for the photon beam and doing so for the horizontal or vertical plane we 
have with oxy = o,/ /2 and Ov yt = Oo, / /2 the photon beam emittance 


€phaxy = 50,0, = as (24.72) 

This is the diffraction limited photon emittance and reducing the electron beam 
emittance below this value would not lead to an additional reduction in the photon 
beam emittance. To produce a spatially coherent or diffraction limited radiation 
source the particle beam emittance must be less than the diffraction limited photon 
emittance 


Xr 
Exy S ae (24.73) 
; I 
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Fig. 24.18 Apparent photon source size 


Obviously, this condition is easier to achieve for long wavelengths. For visible 
light, for example, the electron beam emittance must be smaller than about 5 x 
10~8 rad-m to be a spatially coherent radiation source. After having determined the 
diffraction limited photon emittance we may also determine the apparent photon 
beam size and divergence. The photon source extends over some finite length L 
along the particle path which could be either the path length required for a deflection 
angle of 2/y or a much longer length in the case of an undulator to be discussed in 
the next chapter. With o,, the diffraction limited beam divergence the photons seem 
to come from a disc with diameter (Fig. 24.18) 


D=oyL. (24.74) 
On the other hand, we know from interference theory the correlation 
Dsino, + Dow = Xr (24.75) 


and eliminating D from both equations gives the diffraction limited photon beam 
divergence 


me 24.76) 
OY = - ( . 


With this we get finally from (24.71) also the diffraction limited source size 
1 
0, = — VAL. (24.77) 
20 


The apparent diffraction limited, radial photon beam size and divergence depend 
both on the photon wavelength of interest and the length of the source. 
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24.7 Temporal Coherence 


To discuss the appearance of temporal coherent synchrotron radiation, we consider 
the radiation emitted from each particle within a bunch. The radiation field at a 
frequency w from a single electron is 


E; x ellortei) (24.78) 


where ; describes the position of the jth electron with respect to the bunch center. 
With z; the distance from the bunch center, the phase is 


oj = 5. (24.79) 


Here we assume that the cross section of the particle beam is small compared to 
the distance to the observer such that the path length differences from any point of 
the beam cross section to observer are small compared to the shortest wavelength 
involved. The radiation power is proportional to the square of the radiation field and 
summing over all electrons we get 


Ne Ne 
Pw) «> GE « >> elorte)) ort) 
il jd 


Ne Ne 
= ye exp i(gj — 1) = Ne + > expi (9; = (1) : (24.80) 
jl JAl 


The first term N. on the r.h.s. of (24.80) represents the ordinary incoherent 
synchrotron radiation with a power proportional to the number of radiating particles. 
The second term describes the coherent power averaging to zero for all but 
long wavelengths. The actual coherent radiation power spectrum depends on the 
particular particle distribution in the bunch. For a storage ring bunch it is safe to 
assume a Gaussian particle distribution and we use therefore the density distribution 


Ne z 
ae =e exp aes (24.81) 


where o is the standard value of the Gaussian bunch length. Instead of summing over 
all electrons we integrate over all phases and folding the density distribution (24.81) 
with the radiation power (24.80) we get with (24.79) 


Noa I 
P(@) « Ne + Ne——> hh, (24.82) 
2no02 
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where the integrals J; and J, are defined by 


=e a z 

i [. exp (-5 + i2n;) dz, (24.83a) 
FOO w w 

h= [. exp (- +127 i) dw, (24.83b) 


and z = 5 Ag; and w = SHAQ). The factor NV. — | reflects the fact that we integrate 
only over different particles. Both integrals are equal to the Fourier transform for a 
Gaussian particle distribution. With 


+00 2 2 
ee Nie al ee (al 
[. exp ( 702 +i2n}) dz = 210 exp | 20 (5) (24.84) 


we get from (24.82) for the total radiation power at the frequency w = 21c/A 
P(w) = p(w) Ne[1 + (Ne— 1) 8 (0.A)] . (24.85) 


where p (@) is the radiation power from one electron and the Fourier transform 


g’ (0, A) = exp |-2"? (5) | (24.86) 


is called the formfactor. With the effective bunch length 


L=<J/210 (24.87) 
this formfactor becomes finally 
2 e 
g (€,A) = exp |-= =| : (24.88) 


The coherent radiation power falls off rapidly for wavelengths as short or even 
shorter than the effective bunch length @. In Fig. 24.19 the relative coherent radiation 
power is shown as a function of the effective bunch length in units of the radiation 
wavelength. The fast drop off is evident and for an effective bunch length of about 
£ = 0.6A the radiation power is reduced to only about 10 % of the maximum power 
for very short bunches. Particle beams from a linear accelerator have often a more 
compressed particle distribution of a form between a Gaussian and a rectangular 
distribution. If we take the extreme of a rectangular distribution 


1 for —j£<z< te 


Ww, = : 
@) 0 otherwise 


(24.89) 


848 24 Overview of Synchrotron Radiation 


13 
200.2) 4 Gaussian 
oS : 4 distribution 
0.01 
a rectangular ~ 
0.001 4 distribution 
0.0001 T i TE UTTT T T UVTI T T PTETET 
0.01 0.1 1 n/a, 10 


Fig. 24.19 Formfactor g*(€, A) for a Gaussian and rectangular particle distribution 


we expect to extend the radiation spectrum since the corners and sharp changes of 
the particle density require a broader spectrum in the Fourier transform. Following 
the procedure for the Gaussian beam we get for a rectangular particle distribution 
the Fourier transform 
sin x 
g(f) = —, (24.90) 
x 

where x = a/A. Figure 24.19 also shows the relative coherent radiation power 
for this distribution and we note a significant but scalloping extension to higher 
radiation frequencies. Experiments have been performed with picosecond electron 
bunches from linear accelerators both at Tohoku University [30] and at Cornell 
University [31] which confirm the appearance of this coherent part of synchrotron 
radiation. 


24.8 Spectral Brightness 


The optical quality of a photon beam is characterized by the spectral brightness 

defined as the six-dimensional volume occupied by the photon beam in phase space 
N, 

ne (24.91) 


~ dw ’ 
41070,0y Oyoy 


where Non is the photon flux defined in (24.58). In the laser community, this 
quantity is called the radiance while the term spectral brightness is common in the 
synchrotron radiation community.” 


Sometimes the term brilliance is used. Since there is no common definition for brilliance and the 
dictionary does not connect brilliance with say luminescence of a source we do not use this term 
in this book. 
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For bending magnet radiation there is a uniform angular distribution in the 
deflecting plane and we must therefore replace the Gaussian divergence o, by the 
total acceptance angle Ay of the photon beam line or experiment. The particle beam 
emittance must be minimized to achieve maximum spectral photon beam brightness. 
However, unlimited reduction of the particle beam emittance will, at some point, 
seize to further increase the brightness. Because of diffraction effects the electron 
beam emittance need not be reduced significantly below the limit (24.72) discussed 
in the previous section. 

For a negligible particle beam emittance the maximum diffraction limited 
spectral brightness is from (24.72), (24.91) 


4 


Big jaa Nom. (24.92) 


For a realistic synchrotron light source the finite beam emittance of the particle 
beam must be taken into account as well which is often even the dominant emittance 
being larger than the diffraction limited photon beam emittance. We may add both 
contributions in quadrature and have for the total source parameters 


= 2 1 =, 2 1-2 

Oto.x = Ox 1 507 , Orot,x’ = yf % x + 50,,, (24.93) 
—_ 2 1 —_ 2, 1 2 

ory = V My at 50; ; Story’ = Vf %My + zy > (24.94) 


where oy refers to the respective particle beam parameters. 


24.8.1 Matching 


A finite particle beam emittance does reduce the photon beam brightness from it’s 
ideal maximum. The amount of reduction, however, depends on the matching to 
the photon beam. The photon beam size and divergence are the result of folding 
the diffraction limited source emittance with the electron beam size and divergence. 
In cases where the electron beam emittance becomes comparable to the diffraction 
limited emittance the effective photon beam brightness can be greatly affected by 
the mutual orientation of both emittances. Matching both orientations will maximize 
the photon beam brightness. 
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Fig. 24.20 Matching of the electron beam emittance to the diffraction limited emittance to gain 
maximum photon beam brightness 


This matching process is demonstrated in Fig. 24.20. The left side shows a 
situation of poor matching in 2-dimensional x — x’-phase space. In this case the 
electron beam width is much larger than the diffraction limited source size while 
its divergence is small compared to the diffraction limit. The effective photon 
beam distribution in phase space is the folding of both electron beam parameters 
and diffraction limit and is much larger than either one of its components. The 
photon beam width is dominated by the electron beam width and the photon beam 
divergence is dominated by the diffraction limit. Consequently, the effective photon 
density in phase space and photon beam brightness is reduced. 

To improve the situation one would focus the electron beam to a smaller beam 
size at the source point at the expense of beam divergence. The reduction of the 
electron beam width increases directly the photon beam brightness while the related 
increase of the electron beam divergence is ineffective because the diffraction limit 
is the dominant term. Applying more focusing may give a situation shown on the 
right side of Fig. 24.20 where the folded photon phase space distribution is reduced 
and the brightness correspondingly increased. Of course, if the electron beam is 
focused too much we have the opposite situation as discussed. There is an optimum 
focusing for optimum matching. To find this optimum we use the particle beam 
parameters 

2 2 Exy 
Oxy = &xyBry and Oh yy = (24.95) 


sy! ’ 
By 


where fy are the betatron functions at the photon source location and €,, the beam 
emittances, in the horizontal and vertical plane respectively. Including diffraction 
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limits, the product 


Otot.xOtot.x’ = €xBx - 502 2. + $07, (24.96) 
has a minimum (GF Mrot.aMrot.’ = 0) for 
0; L 
pb. = — =. (24.97) 
Oo, 20 


A similar optimum occurs for the vertical betatron function at the source point. 
The optimum value of the betatron functions at the source point depends only on 
the length of the undulator. 

The values of the horizontal and vertical betatron functions should be adjusted 
according to (24.97) for optimum photon beam brightness. In case the particle beam 
emittance is much larger than the diffraction limited photon beam emittance, this 
minimum is very shallow and almost nonexistent in which case the importance of 
matching becomes irrelevant. As useful as matching may appear to be, it is not 
always possible to reach perfect matching because of limitations in the storage ring 
focusing system. Furthermore it is practically impossible to get a perfect matching 
for bending magnet radiation since the effective source length L is very small, L = 


2p/y. 


24.9 Photon Source Parameters 


In the previous paragraph, we have assumed that there is no dispersion at the 
source point. This is not always true and we have to modify our beam sizes to take 
the effect of energy spread and dispersion into account. Still simplifying, we use 
only the horizontal dispersion. Where this is not acceptable, the vertical dispersion 
effects have to be added in quadrature. The beam width or height is defined by the 
contribution of the betatron phase space og, and the energy phase space o,,,,, and 
is 


2 
O¢ 
nS 4 [03 vy +o; = yobs + (nz) (24.98) 


2 
xy 


7 I+a , ioe 
2 — — oe = = 1 
with Og vy = Say Byy and 0) = Ba? Vey and ay.) = SP uy Similarly, we 


get for the beam divergence 


7 5 2 / Oe ; 24.99 
Op,x! yf = OB x! y/ as on = ExyYay TM Eo ; ond 
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These beam parameters resemble in general the source parameters of the photon 
beam. Deviations occur when the beam emittance becomes very small, comparable 
to the photon wavelength of interest. First the matching conditions should be 
checked and modified if necessary. Second, the photon source parameters may be 
modified by diffraction effects which limit the apparent source size and divergence 
to some minimum values even if the electron beam cross section and divergence 
should be very small. For radiation at a wavelength A, the diffraction limited radial 
photon source parameters are* 


1 rv 
0, = — VAL and Ov = 4/-. (24.100) 
20 L 
Projection onto the horizontal or vertical plane gives 0, = o,/ J/2 etc. Due 


to diffraction, it is not useful to push the electron beam emittance to values much 
smaller than 


ny 
€xy = —. (24.101) 
: 4a 


For an arbitrary electron beam cross section the photon source parameters are the 
quadratic sums of both contributions 


Coney = Sexy + 3% > (24.102) 
Orn a! y! = Oex' y! r 50,7 . (24.103) 


The contribution from diffraction can be ignored if 


Xr 
Exy > —, (24.104) 
" 4n 


which is generally true in the x-direction but not in the y-direction because of the 
small coupling in a storage ring. 
Problems 


24.1 (S). Bending magnet radiation (9 = 2m) from a 800 MeV, 500 mA storage 
ring includes a high intensity component of infrared radiation. Calculate the photon 


3Many authors use a different definition o, = o,/ /2. The difference is mainly that the subscript , 
refers to radiation and the related beam parameters are already projected to the x or y-plane. In this 
text, we use the subscript , from the radial coordinate since we derive the diffraction effects from 
a round beam. 
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beam brightness for A = 10 \1m radiation at the experimental station which is 5m 
away from the source. The electron beam cross section is 0p,, X 0p, = 1.10.11 mm 
and its divergence 0p,’ X Opy’ = 0.11 x 0.011 mrad. What is the corresponding 
brightness for infrared radiation from a black body radiator at 2,000 K with a source 
size of x x y = 10 x 2mm? (Hint: the source length L = 026,44 where 6,24 is the 
vertical opening angle of the radiation.) 


24.2 (S). What is the probability for a 6GeV electron to emit a photon with an 
energy of € = o, per unit time travelling on a circle with radius p = 25m. How 
likely is it that this particle emits another such photon within a damping time? In 
evaluating quantum excitation and equilibrium emittances, do we need to consider 
multiple photon emissions? (use isomagnetic ring) 


24.3 (S). Derive a formula for the average number of photons emitted by an 
electron of energy E per turn. How many are these for E = 3 GeV and p = 10m. 


24.4 (S). In a 7 GeV electron ring the circulating beam current is 200 mA and the 
bending radius p = 20m. Your experiment requires a photon flux of 10° photons/sec 
at a photon energy of 8 keV, within a band width of 10~* onto a sample with a cross 
section of 10 x 10 j1m7and your experiment is 15 m away from the source point. Can 
you do your experiment on a bending magnet beam line of this ring? 


24.5 (S). How well is the electron beam phase space of exercise 24.1 at the source 
matched to the photon beam? Show the phase space ellipses of both the electron and 
the photon beam in phase space and in x and y. 


24.6 (S). Derive an expression for the total synchrotron radiation power from a 
wiggler magnet. 


24.7. Verify the numerical validity of Eqs. (24.4), (24.43), (24.51), (24.53), (24.59) 


24.8 (S). In the SLAC linear accelerator operating at 100Hz electrons can be 
accelerated to 50 GeV at a rate of 17 MeV/m. Calculate to total radiation power 
from 10° electrons per pulse at 50 GeV due to longitudinal acceleration. Compare 
with the radiation power if this bunch of 10? electrons is deflected at the same energy 
by | mrad in a 0.6 T bending magnet. 


24.9 (S). Consider an electron storage ring at an energy of 1 GeV, a circulating 
current of 200mA and a bending radius of op = 2.22m. Calculate the energy 
loss per turn, the critical energy and the total synchrotron radiation power. At what 
frequency in units of the critical frequency has the intensity dropped to | % of the 
maximum? Plot the radiation spectrum and determine the frequency range available 
for experimentation. 


24.10. What beam energy would be required to produce x-rays from the storage 
ring of problem 24.9 at a critical photon energy of 10 keV? Is that energy feasible 
from a conventional magnet point of view or would the ring have to be larger? What 
would the new beam energy and bending radius have to be? 
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24.11. Consider a storage ring with an energy of 1 GeV and a bending radius of 
p = 2.5m. Calculate the angular photon flux density dN /dw for a high photon 
energy é where the intensity is still 1 % of the maximum spectral intensity. What is 
this maximum photon energy? Installing a wavelength shifter with a field of B = 6T 
allows the spectrum to be greatly extended. By how much does the spectral intensity 
increase at the photon energy é and what is the new photon energy limit for the 
wavelength shifter? 


24.12. Consider an electromagnetic wavelength shifter in a 1 GeV storage ring with 
a central pole length of 30cm and a maximum field of 6T. The side poles are 
60cm long and for simplicity assume that the field in all poles has a sinusoidal 
distribution along the axis. Determine the focal length due to edge focusing for the 
total wavelength shifter. To be negligible, the focal length should typically be longer 
than about 30 m. Is this the case for this wavelength shifter? 


24.13. Collide a 25 MeV electron beam with a 1 kW CO>-laser beam (A = 10 wm). 
What is the energy of the backscattered photons? Assume a diffraction limited 
interaction length of twice the Rayleigh length and an electron beam cross section 
matching the photon beam. Calculate the x-ray photon flux for an electron beam 
from a 3 GHz linear accelerator with a pulse length of 1\s, a repetition rate of 10 Hz 
and a pulse current of 10OmA. 
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Chapter 25 
Theory of Synchrotron Radiation 


The phenomenon of synchrotron radiation has been introduced in a conceptual 
way and a number of basic relations have been derived. In this chapter we will 
approach the physics of synchrotron radiation in a more formal way to exhibit 
detailed characteristics. Specifically, we will derive expressions for the spatial and 
spectral distribution of photon emission in a way which is applicable later for special 
insertion devices. 

The theory of synchrotron radiation is intimately related to the electromagnetic 
fields generated by moving charged particles. Wave equations can be derived from 
Maxwell’s equations and we will find that any charged particle under the influence 
of external forces can emit radiation. We will formulate the characteristics of this 
radiation and apply the results to highly relativistic particles. 


25.1 Radiation Field 


The electromagnetic fields for a single moving point charge will be derived first 
and then applied to a large number of particles. Fields are determined by Maxwell’s 
equations (1.4) for moving charges in vacuum where €, = Ly = 1. The magnetic 
field can be derived from a vector potential A defined by 


B=VxA. (25.1) 


Inserting the vector potential into Faraday’s law (1.4) we haveV x (E + eS) = 0, 
or after integration 


dA 
E=-——V¢, (25.2) 
ot 
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where ¢ is the scalar potential. We choose the scalar potential such that cVA + 
ioe = 0, a condition known as the Lorentz gauge. With (A.21) applied to A the 
expression for the electric field together with Ampere’s law (1.4) results in the wave 
equation 


VA poe ae B (25.3) 
c2 Of 7 ae : : 


Similarly, we derive the wave equation for the scalar potential 
-sSa =P: (25.4) 


These are the well-known wave equations with the solutions 


Aga i WOE! ipance (25.5) 
4n R i 
and 
1 ae 
y(t) = pe 2 dea de. (25.6) 
Ame R t 


Because of the finite velocity of light, all quantities under the integrals must be 
evaluated at the retarded time 


eo 1 (4) (25.7) 
iC 


when the radiation was emitted by the moving charge, in contrast to the time f 
when the radiation is observed at a distant point. The quantity R is the distance 
between the observation point P(x, y,z) and the location of the charge element 
P(X; Yrs Zr) dx,-dy,dz, at the retarded time t,. The vector 


R=(, —%,¥r — Ys — 2) (25.8) 


points away from the observation point to the charge element at the retarded time as 
shown in Fig. 25.1. 

Special care must be exercised in performing the integrations. Although we 
consider only a point charge gq, the integral in (25.6) cannot be replaced by qg/R but 
must be integrated over a finite volume followed by a transition to a point charge. 
As we will see this is a consequence of the fact that the velocity of light is finite and 
therefore the movement of charge elements must be taken into account. 

To define the quantities involved in the integration we use Fig.25.1. The 
combined field at the observation point P at time t comes from all charges located at 
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Vv particle trajectory 


charge q at time t 


—=_ 
P(x! y',z! . 
(y'2) observation 
point 


p(x',y',z') 


charge q ie 


at time t' radiation emitted at 


time t' toward 
observation point 


Fig. 25.1 Retarded position of a moving charge distribution 


a distance R away from P. We consider the contribution from all charges contained 
within a spherical shell centered at P with a radius R and thickness dr to the radiation 
field at P and time f. Radiation emitted at time 4, will reach P at the time ¢. If 
do is a surface element of the spherical shell, the volume element of charge is 
dx dy dz =dodr. The retarded time for the radiation from the outer surface of the 
shell is f; and the retarded time for the radiation from the charge element on the 
inner surface of the shell is f, — x From Fig. 25.1 we find the electromagnetic field 
observed at P at time ¢ to originate from the fractional charges within the volume 
element dod, or from the charge element dq = pdodr. 

The radiation observed at point P and time ¢ is the sum of all radiation arriving 
simultaneously at P. Elements of this radiation field may have been emitted by 
different charge elements and at different times and locations. In case of only one 
electrical charge moving with velocity v, we have to include in the integration those 
charge elements that move across the inner shell surface into the volume dodr 
during the time dr/c. For a uniform charge distribution this additional charge is 
6q = pun dtdo where n is the vector normal to the surface of the shell and pointing 
away from the observer 


(25.9) 


ls 


With dt =dr/c and B = v/c, we get then for both contributions to the charge 
element 


dg = p(1+nB)drdo . (25.10) 


Depending on the direction of the velocity vector B, we find an increase or a 
reduction in the radiation field from moving charges. We solve (25.10) for p drdo 
and insert into the integrals (25.5), (25.6). Now we may use the assumption that the 
electrical charge is a point charge and get for the retarded potentials of a moving 
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point charge q at time f and observation point P 


1 q B 
A(P,t) = = 25.11 
(Pt) 4acen R1+nB , ( ) 
and 
1 gq 1 
P,t) = — = ; 25.12 
eed) 4re9 R1+nB ‘. ( ) 


These equations are known as the Liénard—Wiechert potentials and express the 
field potentials of a static or moving charge as functions of the charge parameters at 
the retarded time. To obtain the electric and magnetic fields we insert the retarded 
potentials into (25.1), (25.2) noting that the differentiation must be performed with 
respect to the time ¢ and location P of the observer while the potentials are expressed 
at the retarded time f,. 

In both equations for the vector and scalar potential we have the same denomi- 
nator 


r=R(1+nB). (25.13) 


It will become necessary to calculate the derivative of the retarded time with respect 
to the time t and since t, = t — R/c the time derivative of t, is 


dt, i 1 dR dt, (05.14) 
dt c dt, dt , 


The variation of the distance R with the retarded time depends on the velocity v of 
the moving charge and is the projection of the vector v df, onto the unity vector n. 
Therefore, 


dR = vndt, (25.15) 


and (25.14) becomes with (25.14) and (25.13) 


dt, 1 R 


dt 1+np or 


(25.16) 


The electric field (25.2) is with (25.11), (25.12) and (25.16) after a few manipula- 
tions expressed by 


_1R0B | BRar 1 


cro cro rr 


4ney— = Vir. (25.17) 
q 
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In evaluating the nabla operator and other differentials we remember that all 
parameters on the r.h.s. must be taken at the retarded time (25.7) which itself 
depends on the location of the observation point P. To distinguish between the 
ordinary nabla operator and the case where the dependence of the retarded time 
on the position P(x, y, z) must be considered, we add to the nla syne! the index 


, like V,. The components of this operator are then Z|. = a 2 + a ne and similar 
for the other components. We evaluate first 
Vir = V,R + V; (BR) (25.18) 
and with VR = —n from (25.8) 
OR 
V,R = —-n+ —Vt,. (25.19) 
Ot; 
For the gradient of the retarded time, we get 
1 1 1 OR 
Vi, = Vi t— —R(t) | = —-V.R = —- [-n + —Vt, (25.20) 
Cc Cc Cc Ot; 
and performing the differentiation we get with 2 5 = = Ux,... 
OR OR ox, OR dy, | OR OX 
= =nv. 25.21 
Bt, ax; Ot Oy, Ot, Oz Ot, aad 
Solving (25.20) for Vif, we get 
R 
Vi = — (25.22) 
cr 
and (25.19) becomes finally 
R 
V.R=-n+—(Bn). (25.23) 
r 


For the second term in (25.18) we note that the velocity v does not depend on 
the location of the observer and with V,R—1, (25.22) and 


= 25.24 
co ( ) 


we get for the second term in (25.18) 


V, (BR) = =p + “EM vy, =-6 +(R a) ate. (95.95) 
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To oe the evaluation of the electric field in (25.17), we express the 
derivative # 7, with 


ar _ aR | ABR) _ 
Ot, Oty Oty 


op 
Ot 


cnB + cB* + R— (25.26) 


where we made use of (25.21). Collecting all differential expressions required 
in (25.17) we get with (25.18), (25.23), (25.25), (25.26) 


trey = — aren + — (np 4p? 1R)| 


pe 
r 
R 
cr~ 


gp HPS * (np +p + 1BR) (25.27) 


where B =dB /dt,. After some manipulation and using (A.10), the equation for the 
electrical field of a charge g moving with velocityy becomes 


1—p? 
r 


; (25.28) 


E 
4r€)— => 
q r 


(R+RB),+— {Rx | (+ RB), * al 
cr dt, 


where we have added the index ; as a reminder that all quantities on the r.h.s. 
of (25.28) must be taken at the retarded time ¢,. 

This equation for the electric field of a moving charge has two distinct parts. The 
first part is inversely proportional to the square of the distance between radiation 
source and observer and depends only on the velocity of the charge. For a charge 
at rest B=0 this term reduces to the Coulomb field of a point charge qg. The area 
close to the radiating charge where this term is dominant is called the Coulomb 
regime. The field is directed toward the observer for a positive charge at rest and 
tilts into the direction of propagation as the velocity of the charge increases. For 
highly relativistic particles we note the Coulomb field becomes very small. 

We will not further consider this regime since we are interested only in the 
radiation field far away from the moving charge. The second term in (25.28) depends 
on the velocity as well as on the acceleration of the charge. This term scales linear 
with the distance r falling off much slower than the Coulomb term and therefore 
reaches out to large distances from the radiation source. We call this regime the 
radiation regime and the remainder of this chapter will focus on the discussion of 
the radiation from moving charges. The electrical field in the radiation regime is 


os dp 
ae [Bx | e+ 88 xo] 


The polarization of the electric field at the location of the observer is purely 
orthogonal to the direction of observation R. Similar to the derivation of the electric 
field, we can derive the expression for the magnetic field and get from (25.1) 


E(t) 


47€y —— (25.29) 
q 


rad Tr 
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with (25.11) 
Bavixd=a[vex 8] =f Iv, )- S1vrx 8). (25.30) 
r ‘i r 


where again all parameters on the r.h.s. must be evaluated at the retarded time. The 
evaluation of the “retarded” curl operation V, x B becomes obvious if we evaluate 
one component only, for example, the x component 


0 Ot 0 0. Ot dp 
(5+ Fa le-(E+ Ea Wx Bl +] Vex F] (25.31) 


In a similar way, we get the other components and find with (25.22) and the fact 
that the particle velocity B does not depend on the coordinates of the observation 
point (V x B =O), 


[Vor B]= IV xB] +| Vix F | = - [rx | ‘ 


The gradient V,r has been derived earlier in (25.18) and inserting this into (25.30) 
we find the magnetic field of an electrical charge moving with velocity v 


Anceye =~ (B x n- [E xa] 
qd r 


25.32 
di ( ) 


T 


+5 (en+6° +f aR) [p orl 


Again, there are two distinct groups of field terms. In case of the electrical field 
the terms that fall off like the square of the distance are the Coulomb fields. For 
magnetic fields such terms appear only if the charge is moving B # 0 and are 
identical to the Biot—Savart fields. Here we concentrate only on the far fields or 
radiation fields which decay inversely proportional to the distance from the source. 
The magnetic radiation field is then given by 


2 = E <n oe (Fe \ oa (25.33) 


dt cr? 


Bit 
4mcéy @ 


rad 


Comparing the magnetic field (25.33) with the electrical field (25.28) reveals a 
very simple correlation between both fields. The magnetic field can be obtained 
from the electric field, and vice versa, by mere vector multiplication with the unit 
vector n 


B=!|Ex nl, . (25.34) 


864 25. Theory of Synchrotron Radiation 


From this equation we can deduce special properties for the field directions by 
noting that the electric and magnetic fields are orthogonal to each other and both are 
orthogonal to the direction of observation n. The existence of electric and magnetic 
fields can give rise to radiation for which the Poynting vector is 


S= 2 [E x Bl, = enc |E x (E x n)], . (25.35) 
CLO 


Using again the vector relation (A.10) and noting that the electric field is normal 
to n, we get for the Poynting vector or the radiation flux in the direction to the 
observer 


S = -e9c E7n|.. (25.36) 


Equation (25.36) defines the energy flux density measured at the observation 
point P and time fin form of synchrotron radiation per unit cross section and parallel 
to the direction of observation n. All quantities expressing this energy flux are still 
to be taken at the retarded time. For practical reasons it becomes desirable to express 
the Poynting vector at the retarded time as well. The energy flux at the observation 
point in terms of the retarded time is then dW/dt, = (dW/dr) (dt/dt,) and instead 
of (25.36) we express the Poynting vector with (25.16) like 


dt 
s:.= S— = —e€9cE* [(1+ Bn)n], . (25.37) 


The Poynting vector in this form can be readily used for calculations like those 
determining the spatial distribution of the radiation power. 


25.2 Total Radiation Power and Energy Loss 


So far, no particular choice of the reference system has been assumed, but a 
particularly simple reference frame £* is the one which moves uniformly with the 
charge before acceleration. From now on, we use a single particle with a charge e. 
To an observer in this reference system, the charge moves due to acceleration and 
the electric field in the radiation regime is from (25.29) 


E*(t) = : S| nx (nx) | 
4mey cR dt 


(25.38) 
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The synchrotron radiation power per unit solid angle and at distance R from the 
source is from (25.37) with v = 0 

dp* 

dx2 


= —nS"*R? = enc E*R?|_ : (25.39) 


Introducing the classical particle radius by e? = 4z€oremc” we obtain expres- 


sions which are independent of electromagnetic units and with (25.38) 


d * 
nx|{nx B 

dt 
where ?, is the retarded angle between the direction of acceleration and the direction 
of observation n. Integration over all solid angles gives the total radiated power. 


With dQ = sin },dv,d@, where ¢ is the azimuthal angle with respect to the direction 
of acceleration, the total radiation power is in agreement with (24.24) 


: _ remc” dB * 


4ac 


2) 
sin? 0, (25.40) 


T 


dpP* _ romc> 
dQ  4nc 


T 


2 


Palme (25.41) 


3 dt 


T 


This equation has been derived first by Larmor [1] within the realm of classical 
electrodynamics. The emission of a quantized photon, however, exerts a recoil on 
the electron varying its energy slightly. Schwinger [2] investigated this effect and 
derived a correction to the radiation power like 


P* = tastes (1 = 3) ’ (25.42) 


where €, is the critical photon energy and E the electron energy. The correction is 
generally very small and we ignore therefore this quantum mechanical effect in our 
discussions. 

Equation (25.41) must be transformed from the particle system to the laboratory 
frame of reference. This has been done already in Sect. 24.2. 


25.2.1 Transition Radiation 


Digressing slightly from the discussion of synchrotron radiation we turn our 
attention to the solution of (25.39). Generally, we do not know the fields E* and 
to solve (25.40) we need to know more about the particular trajectory of the particle 
motion. In the case of transition radiation, we have, however, all information to 
formulate a solution. Transition radiation is emitted when a charged particle passes 
through the boundary of two media with different dielectric constant. We will not 
go into the detailed general theory of transition radiation but concentrate on the 
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case where a charged particle passes through a thin metallic foil in vacuum. As 
the particle passes through the foil backward transition radiation is emitted when 
the particle enters the foil and forward radiation is emitted when it appears on the 
other side. The emitted radiation energy can be derived directly from (25.39). First, 
we replace the electric radiation field by the magnetic field component and (25.39) 
becomes simply 


de) _ 


a coc B**(t) R?| dQ. (25.43) 


From Parseval’s theorem (A.42) we know that 
[o,@) 1 CO 
i B?(t) dt = — i B’(w)dow. (25.44) 
—_ 27 Joo 


The emission of transition radiation occurs in a very short time tT + Oy ' where Wp 
is the plasma frequency. For this reason, the transition radiation frequency reaches 
into the x-ray regime. We limit ourselves here to frequencies w, which are much 
lower such that t < w~!. The magnetic field is nonzero only during the emission 
process and we can therefore set 


lee) : 1/2 
B(w) -| B(t)e ax [ B(t) dt. (25.45) 
—0o —t/2 


To solve this integral we recall the definition of the vector potential B(t) = V xA, 
and keep in mind that all quantities are to be taken at the retarded time. Expressing in 


.  OAy : ; . ee 
component formV x A; = a > ua — Az | dy _ an the derivatives 
dy az az ox dz ax F 
r=1—L Ro 
Az __ dA: dy vn Ot — Lyrcy _ My dA, _ Ay _ 1 9A, 1 AAy 
are 5 = Dy ay ete: With hoe Ee = ee get By ec only ¢ oe OF 
finally 
1 rl) lo 
Bit) =VxA, = -n, xX —A; = [n x A],. (25.46) 
Cc Ot; Cc Ot; 


The magnetic field spectrum (25.45) becomes then simply 


initial * 


t/2 
B(w) = / B(t) dt = : [n x A], |" (25.47) 
—1/2 


©, 


Initially, while the electron has not yet vanished into the metallic foil, the vector 
potential is made up of the Liénard—Wiechert potentials of a free electron and its 
image charge (a positron) moving in the opposite direction. The vector potential is 
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therefore 


A =Axee, (25.48) 


Bo ye 8 
_ wCeg ———. * 
R(1 + Bn) °R(— Bn) 

rs 
electron positron 


Instead of (25.43) we use the spectral radiation energy de(w) = Ra +B e 
(t)d@ 2, where the extra factor of two comes from using only positive frequencies 
w > 0, and get with (25.48) and e? = r.-mc?47€9 


ds 1 remc* { nx B nxB )? 
dod2 422 c (|1+Bn° 1-Bn 


remc* 5 B 2 
= xe be 
°C 1 — B? (nz) 


where we used B ~ fz and where z is the unit vector along the z-axis. The 
emission angle ? is taken with respect to the z-axis. The spectral and spatial 
transition radiation distribution from a single electron is finally with nz = cos? 
andnx z = sin? 


de rome? B* sin? 3 


daw ds2 ~ 2c (1 — B2 cos? 8)” 


(25.49) 


The spatial radiation distribution of transition radiation is shown in Fig. 25.2. 
No radiation is emitted along the axis 3 = 0 while the radiation intensity reaches 
a maximum at an emission angle of 1/y. Equation (25.49) does not exhibit any 
frequency dependence, which is due to the fact that the emission process occurs in 
a very short time generating a uniform spectrum. Very high frequencies in the x-ray 
regime, where the spectral intensity is expected to drop, have been excluded in this 
derivation. 


Fig. 25.2 Intensity 
distribution aaa 2c. of 600 - 


reme? 


transition radiation 
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Integrating (25.49) over a half space, we get 


d 2 . 2 m/2 2 ss. i) 
eee i ae 5 sin ¥ dd 
dw mc Jo = (1 — B2 cos? #) 
2remc? 1 1+ 6 
= — | (1+ B’) n—— - 28], 25.50 
me gg [C+ in 6| om 
which is for relativistic particles y >> 1 
de(w) — 2remc? 
~ Iny. (25.51) 


dw IC 


The spectral energy emitted into one half space by a single electron in form of 
transition radiation is uniform for all frequencies reaching up into the x-ray regime 
and depends only logarithmically on the particle energy. 


25.3 Spatial Radiation Distribution 


Coming back to synchrotron radiation we must define the electron motion in great 
detail. It is this motion which determines many of the photon beam characteristics. 
The radiation power and spatial distribution of synchrotron radiation in the electron 
frame of reference is identical to that from a linear microwave antenna being emitted 
normal to the direction of acceleration with a sin?-distribution. 

Expressions for the radiation fields and Poynting vector exhibit strong vectorial 
dependencies on the directions of motion and acceleration of the charged particles 
and on the direction of observation. These vectorial dependencies indicate that 
the radiation may not be emitted isotropic but rather into specific directions 
forming characteristic radiation patterns. Similarly, we note a strong dependence 
on the photon frequency. In the following paragraphs, we will investigate theses 
dependencies closer. 


25.3.1 Radiation Lobes 


In this section we will derive these spatial radiation characteristics and determine 
the direction of preferred radiation emission. 

In (25.40) the radiation power per unit solid angle is expressed in the reference 
frame of the particle 


dP _ Feme ; 


aa = ae B;? sin’ © (25.52) 
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Fig. 25.3 Radiation pattern 
in the particle frame of y 


x 
reference or for 
nonrelativistic particles in the 
laboratory system 
Z 


showing a particular directionality of the radiation as shown in Fig. 25.3. The 
radiation power is mainly concentrated in the x, y-plane and is proportional to sin? © 
where @ is the angle between the direction of acceleration, in this case the z-axis, 
and the direction of observation n. The radiation pattern in Fig. 25.3 is formed by 
the end points of vectors with the length dP/d&2 and angles © with respect to the z- 
axis. Because of symmetry, the radiation is isotropic with respect to the polar angle 
gy and therefore the radiation pattern is rotation symmetric about the direction of 
acceleration or in this case about the z-axis. 

This pattern is the correct representation of the radiation for the reference 
frame of the radiating particle. We may, however, also consider this pattern as 
the radiation pattern from non relativistic particles like that from a linear radio 
antenna. For relativistic particles the radiation pattern differs significantly from the 
non relativistic case. The Poynting vector in the form of (25.37) can be used to 
calculate the radiation power per unit solid angle in the direction to the observer —n 


—_ 2 = 2, 2, 
6 =—nSR’| =ecE’ (1+ Bn)R’|.. (25.53) 


We calculate the spatial distribution of the synchrotron radiation for the case of 
acceleration orthogonal to the propagation of the particle as it happens in beam 
transport systems where the particles are deflected by a transverse magnetic fields. 
The particle is assumed to be located at the origin of a right-handed coordinate 
system as shown in Fig. 25.4 propagating in the z-direction and the orthogonal 
acceleration in this coordinate system occurs along the x-axis. 

With the expression (25.29) for the electric fields in the radiation regime the 
spatial radiation power distribution (25.53) becomes 


870 25 Theory of Synchrotron Radiation 


direction of 


acceleration particle path 
pee 


to observer 


Fig. 25.4 Radiation geometry in the laboratory frame of reference for highly relativistic particles 


io ~ nme jx [@+B) xB) (25.54) 


We will now replace all vectors by their components to obtain the directional 
dependency of the synchrotron radiation. The vector n pointing from the observation 
point to the source point of the radiation has from Fig. 25.4 the components 

n = (—sin@ cosgy,—sin @ sing, cos 8) , (25.55) 
where the angle @ is the angle between the direction of particle propagation and 


the direction of emission of the synchrotron light —n. The x-component of the 
acceleration can be derived from the Lorentz equation 


dp 
ymo, = —* =ceB.B,. (25.56) 
With v, ~ v we have 1/p = c eBy/cp = ceB,/(ymcv) and the acceleration vector 
is 


2 
v1 = (0,0,0) = (=.0.0) ; (25.57) 
p 


The velocity vector is 
v = (0,0, v) (25.58) 
and after replacing the double vector product in (25.54) by a single vector sum 


nx[(n+B)x Bl) =@+B)@B)—-BA+nB), (25.59) 


25.3. Spatial Radiation Distribution 871 


we may now square the r.h.s. of (25.54) and replace all vectors by their components. 
The denominator in (25.54) then becomes 


r=R(1+np) = R(1— Bcosé6)°’, (25.60) 


and the full expression for the radiation power exhibiting the spatial distribution is 
finally 


dP 


__ reme*c B* (1 — B cos 6)* — (1 — B”) sin” 6 cos” (25.61) 
Fie) 4x (1 — Bcos@)> 


This equation describes the instantaneous synchrotron radiation power per unit 
solid angle from charged particles moving with velocity v and being accelerated 
normal to the propagation by a magnetic field. Integration over all angles results 
again in the total synchrotron radiation power (24.34). 

In Fig. 25.5 the radiation power distribution is shown in real space as derived 
from (25.61). We note that the radiation is highly collimated in the forward direction 
along the z-axis which is also the direction of particle propagation. Synchrotron 
radiation in particle accelerators or beam lines is emitted whenever there is a 
deflecting electromagnetic field and emerges mostly tangentially from the particle 

trajectory. An estimate of the typical opening angle can be derived from (25.61). We 
set g = 0 and expand the cosine function for small angles cos @ = 1 — 5 67. With 
pel 5 y? we find the radiation power to scale like (y~? + 67)~?. The radiation 
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Fig. 25.5 Spatial synchrotron radiation distribution 
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power therefore is reduced to about one eighth the peak intensity at an emission 
angle of 6, = 1/y or virtually all synchrotron radiation is emitted within an angle 
of 


1 
6, = +- (25.62) 
y 


with respect to the direction of the particle propagation. 

From Fig. 25.5 we observe a slightly faster fall off for an azimuthal angle of 
gy = 0 which is in the plane of particle acceleration and propagation. Although the 
synchrotron radiation is emitted symmetrically within a small angle of the order 
of ay with respect to the direction of particle propagation, the radiation pattern 
from a relativistic particle as observed in the laboratory is very different in the 
deflecting plane from that in the nondeflecting plane. While the particle radiates 
from every point along its path, the direction of this path changes in the deflecting 
plane but does not in the nondeflecting plane. The synchrotron radiation pattern from 
a bending magnet therefore resembles the form of a swath where the radiation is 
emitted evenly and tangentially from every point of the particle trajectory as shown 
in Fig. 25.6. 

The extreme collimation of the synchrotron radiation and its high intensity in 
high energy electron accelerators can cause significant heating problems as well as 
desorption of gas molecules from the surface of the vacuum chamber. In addition, 
the high density of thermal energy deposition on the vacuum chamber walls can 
cause significant mechanical stresses causing cracks in the material. A careful 
design of the radiation absorbing surfaces to avoid damage to the integrity of the 
material is required. On the other hand, this same radiation is a valuable source 
of photons for a wide variety of research applications where, specifically, the 
collimation of the radiation together with the small source dimensions are highly 
desired features of the radiation. 


synchrotron radiation 


Fig. 25.6 Synchrotron radiation from a circular particle accelerator 
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Synchrotron radiation is emitted within a wide range of frequencies. As we have 
seen in the previous paragraph, a particle orbiting in a circular accelerator emits light 
flashes at the revolution frequency. We expect therefore in the radiation frequency 
spectrum all harmonics of the revolution frequency up to very high frequencies 
limited only by the very short duration of the radiation pulse being sent into a 
particular direction toward the observer. The number of harmonics increases with 
beam energy and reaches at the critical frequency the order of y?. 

The frequency spectrum of synchrotron radiation has been derived by many 
authors. In this text, we will stay closer to the derivation by Jackson [3] than others. 
The general method to derive the frequency spectrum is to transform the electric 
field from the time domain to the frequency domain by the use of Fourier transforms. 
Applying this method, we will determine the radiation characteristics of the light 
emitted by a single pass of a particle in a circular accelerator at the location of the 
observer. The electric field at the observation point has a strong time dependence 
and is given by (25.29) while the total radiation energy for one pass is from (25.38) 


dw 0° dP sa 
— =-— i. —dt= / S,n R’dt = €ocR” / E?(t)dt. (25.63) 
dQ —0o dg? —0oO —0o 


The transformation from the time domain to the frequency domain is performed 
by a Fourier transform or an expansion into Fourier harmonics. This is the point 
where the particular characteristics of the transverse acceleration depend on the 
magnetic field distribution and are, for example, different in a single bending 
magnet as compared to an oscillatory wiggler magnet. We use here the method of 
Fourier transforms to describe the electric field of a single particle passing only 
once through a homogeneous bending magnet. In case of a circular accelerator 
the particle will appear periodically with the period of the revolution time and we 
expect a correlation of the frequency spectrum with the revolution frequency. This 
is indeed the case and we will later discuss the nature of this correlation. Expressing 
the electrical field E,(t) by its Fourier transform, we set 


E,(@) = / E,(t) edt, (25.64) 


where —oo < w < oo. Applying Parseval’s theorem (A.42) the total absorbed 
radiation energy from a single pass of a particle is therefore 


dw ee 2 
—_ = — E, do. 253 
10 €0 fe fi (@) |" dw (25.65) 
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Evaluating the electrical field by its Fourier components, we derive an expression 
for the spectral distribution of the radiation energy 

ew 

dQda 


é < |E.(w) 7 R2, (25.66) 


where we have implicitly used the fact that E,(w) = E,(—q@) since E,(f) is real. To 
calculate the Fourier transform, we use (25.29) and note that the electrical field is 
expressed in terms of quantities at the retarded time. The calculation is simplified 
if we express the whole integrand in (25.64) at the retarded time and get with t, = 
t— R(t.) and dt, = “dt instead of (25.64) 


E(w) = eit) ay, (25.67) 


Ley Rx|(R+BR)xB| 


4m€9 ¢ Joo r2R 


Li 


We require now that the radiation be observed at a point sufficiently far away 
from the source that during the time of emission the vector R(t,) does not change 
appreciably in direction. This assumption is generally justified since the duration 
of the photon emission is of the order of 1/(@Ly), where w, = c/p is the Larmor 
frequency. The observer therefore should be at a distance from the source large 
compared to p/y. Equation (25.67) together with (25.14) may then be written like 


E(w) = eiolet®) at. (25.68) 


i ¢ [ nx |(n+B) x B | 


Amey CR Jo (l+np) 


With 


nx|in+ B)xB | _ dnx(nx B) 


(1+nB) ~ dt 1+np es 


we integrate (25.68) by parts while noting that the integrals vanish at the boundaries 
and get 


1 —-iew 


E,(@) = / . In x (nx B)| ete) dy, (25.70) 


4mé€y cR 


After insertion into (25.66) the spectral and spatial intensity distribution is 


aw remc? 9 
— o 


oo 2 
= “io(u+®) q 25.71 
sada ae im [n x (nx B)]e i s (25.71) 


ir 
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observation point P 


at electron at time t 


Fig. 25.7 Radiation geometry 


The spectral and spatial radiation distribution depends on the Fourier transform 
of the particle trajectory which itself is a function of the magnetic field distribution. 
The trajectory in a uniform dipole field is different from say the step function of 
real lumped bending magnets or oscillating deflecting fields from wiggler magnets 
and the radiation characteristics may therefore be different. In this chapter, we will 
concentrate only on a uniform dipole field and postpone the discussion of specific 
radiation characteristics for insertion devices to Chap. 26. 

The integrand in (25.71) can be expressed in component form to simplify 
integration. For that we consider a fixed coordinate system (x,y,z) as shown in 
Fig. 25.7. The observation point is far away from the source point and we focus on 
the radiation that is centered about the tangent to the orbit at the source point. The 
observation point P and the vectors R and 7 are therefore within the (y, z)-plane and 
radiation is emitted at angles @ with respect to the z-axis. 

The vector from the origin of the coordinate system Pp to the observation point 
P is r, the vector R is the vector from P to the particle at P, and rp is the vector from 
the origin to Py. With this we have 


r=1ry)—R(t), (25.72) 


where r,and R, are taken at the retarded time. The exponent in (25.71) is then 
0) 
w(t; + R;/c) = w(t, + nR,/c) = — (ct + mrp — nr) (25.73) 
c 


and the term —“nr is independent of the time generating only a constant phase 
factor which is completely irrelevant for the spectral distribution and may therefore 
be ignored. 

Following the above discussion the azimuthal angle is constant and set to 
g= 5 x because we are interested only in the vertical radiation distribution. The 


876 25 Theory of Synchrotron Radiation 


horizontal distribution is uniform by virtue of the tangential emission along the orbit. 
With these assumptions, we get the vector components for the vector m from (25.55) 


n = (0,—sin0,—cos@). (25.74) 


The vector rp is defined by Fig. 25.7 and depends on the exact variation of the 
deflecting magnetic field along the path of the particles. Here we assume a constant 
bending radius p and have 


rp = [-pcos(at;), 0, psin(wyt;)] , (25.75) 


where w, = fc/p is the Larmor frequency. From these component representations 
the vector product 


Ny = —p sin(@,t;) cos 6 (25.76) 


Noting that both arguments of the trigonometric functions in (25.76) are very 
small, we may expand the r.h.s. of (25.76) up to third order in ¢, and the factor 
tr + nrp/c in (25.73) becomes 


ch, + Fy = ct, — p[@it; — Z(@t,) (1— 387)] - (25.77) 


With @, = Bc/p we get t,(1—p a/c) = (1—B) t © t,/(2y). Keeping only up 
to third order terms in wt, and 6 we have finally for high energetic particles B ~ 1 


Pr, 
= 1 (y? + 6?) 4 + o?t?. (25.78) 


The triple vector product in (25.71) can be evaluated in a similar way. For the 
velocity vector we derive from Fig. 25.7 


B = B [-sign(1/p) sin(wit,), 0, cos(@Lt,)] . (25.79) 


Consistent with the definition of the curvature, the sign of the curvature sign(1/p) 
is positive for a positive charge and a positive magnetic field vector B,. The vector 
relation (A.10) and (25.74), (25.79) can be used to express the triple vector product 
in terms of its components 


nx(nx B)=f [sign(1/p) sin(@ t;), 5 sin 20 cos(wyt;), — sin? 6 cos(@ t+) | : 
(25.80) 


Splitting this three-dimensional vector into two parts will allow us to characterize 
the polarization states of the radiation. To do this, we take the unit vector uw, in 
the x-direction and wy a unit vector normal to u, and normal to r. The y and 
z — components of (25.80) are then also the components of wu and we may express 
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the vector (25.80) by 
nx (nx B) = Bsign(1/p) sin(@yt,)u_ + B sin @ cos(@yt,)uy , (25.81) 
Inserting (25.78) and (25.81) into the integrand (25.70) we get with 6 = 1 


1 ew [{(*% 


~ Ameo R Cc = 


E,(@) = [sign(1/p) sin(@Lt,)u. + sin @ cos(wt;)uy | e*dt,, 


(25.82) 
where 
X=-i fale [( +767) t+ iy wt, | : 
2y? 3 : 


Two polarization directions have been defined for the electric radiation field. One 
of which (uw) is in the plane of the particle path being perpendicular to the particle 
velocity and to the deflecting magnetic field. Following Sokolov and Ternov [4] 
we call this the o-mode (uj = u,). The other polarization direction in the plane 
containing the deflecting magnetic field and the observation point is perpendicular 
to n and is called the 2-mode (u = u,,). Since the emission angle 6 is very small, 
we find this polarization direction to be mostly parallel to the magnetic field. Noting 
that most accelerators or beam lines are constructed in the horizontal plane, the 
polarizations are also often referred to as the horizontal polarization for the o-mode 
and as the vertical polarization for the z-mode. 


25.4.1 Spectral Distribution in Space and Polarization 


As was pointed out by Jackson [3], the mathematical need to extend the integration 
over infinite times does not invalidate our expansion of the trigonometric functions 
where we assumed the argument w,7, to be small. Although the integral (25.82) 
extends over all past and future times, the integrand oscillates rapidly for all but the 
lowest frequencies and therefore only times of the order ct, = + p/y centered 
about 7, contribute to the integral. This is a direct consequence of the fact that 
the radiation is emitted in the forward direction and therefore only photons from 
a very small segment of the particle trajectory reach the observation point. For 
very low frequencies of the order of the Larmor frequency, however, we must 
expect considerable deviations from our results. In practical circumstances such low 
harmonics will, however, not propagate in the vacuum chamber [5] and the observed 
photon spectrum therefore is described accurately for all practical purposes. 

The integral in (25.82) can be expressed by modified Bessel’s functions in the 
form of Airy’s integrals as has been pointed out by Schwinger [6]. Since the 
deflection angle wyt, is very small, we may use linear expansions sin(@yt;) ~*~ 
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q@yt,and cos(@t;) *~ 1. Inserting the expression for the electric field (25.82) 
into (25.65) we note that cross terms of both polarizations vanish u,u) = 0 and 
the radiation intensity can therefore be expressed by two separate orthogonal 
polarization components. Introducing in (25.82) the substitutions [6] 


[1 
Of, = 4/5 + 07x, (25.83) 
Y 


1 
ae m y62)3/2 _ el 4 2623/2, (25.84) 


= 1 
3 


ure 
| 


where fiw, is the critical photon energy, the argument in the exponential factor of 
(25.82) becomes 


(0) 
py LO + YO + sy’) = 3EGx+ >). (25.85) 


With these substitutions, (25.82) can be evaluated noting that only even terms 
contribute to the integral. With wf, and 6 being small quantities we get integrals of 
the form [7] 


i cos [5&(3x + || dx = wki3(§) : (25.86) 

Jo” sin [SEGx + x°)] dx = vp K2/a (6). , 
where the functions K, are modified Bessels’s functions of the second kind. These 
functions assume finite values for small arguments but vanish exponentially for 
large arguments as shown in Fig. 24.12. Fast converging series for these modified 
Bessels’s functions with fractional index have been derived by Kostroun [8]. The 
Fourier transform of the electrical field (25.82) finally becomes 


_ -l v3ee eo oe e. y9Ki/3(&) 
E,(@) = dicen ok a yd+y@) ien($) kota ean 


(25.87) 


describing the spectral radiation field far from the source for particles traveling 
through a uniform magnetic dipole field. Later, we will modify this expression to 
make it suitable for particle motion in undulators or other nonuniform fields. 

The spectral synchrotron radiation energy emitted by one electron per pass is 
proportional to the square of the electrical field (25.87) and is from (25.66) 


aw 3rmce ,(@ ° 9 y7é Ki 3(6) , 
= ¢ 1 292 2 K2 2 
dQdw An Y (=) ( +yY ) 3/3 (EUs rs 1+ y202 us 
(25.88) 
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Fig. 25.8 Radiation lobes for 
o- and z-mode polarization 
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The radiation spectrum has two components of orthogonal polarization, one in 
the plane of the particle trajectory and the other almost parallel to the deflecting 
magnetic field. In (25.87) both polarizations appear explicitly through the orthog- 
onal unit vectors. Forming the square of the electrical field to get the radiation 
intensity, cross terms disappear because of the orthogonality of the unit vectors u, 
and u,,. The expression for the radiation intensity therefore preserves separately the 
two polarization modes in the square brackets of (25.88) representing the o-mode 
and z-mode of polarization, respectively. 

It is interesting to study the spatial distribution for the two polarization modes in 
more detail. Not only are the intensities very different but the spatial distribution is 
different too. The spatial distribution of the o-mode is directed mainly in the forward 
direction while the z-mode radiation is emitted into two lobes at finite angles and 
zero intensity in the forward direction 6 = 0. In Fig. 25.8 the instantaneous radiation 
lobes are shown for both the o- and the z-mode at the critical photon energy and 
being emitted tangentially from the orbit at the origin of the coordinate system. 


25.4.2 Spectral and Spatial Photon Flux 


The radiation intensity W from a single electron and for a single pass may not always 
be the most useful parameter. A more useful parameter is the spectral photon flux 
per unit solid angle into a frequency bin Aw/w and for a circulating beam current / 


PNon (@) — @W(w) 11 Ao 
dddw dwdQ he ow’ 


(25.89) 


Here we have replaced the solid angle by its components, the vertical angle 6 and 
the bending angle w. In more practical units the differential photon flux is 


,, Aw (wo : 2 
“Tay = CRIS (Z) KORE), a 
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Fig. 25.9 Distribution in frequency and angular space for o-mode radiation 


where 
3a photons 
Ce = ee = 1 1? 25.91 
- 4n2e(mc*)? s mrad? GeV7A ‘ ) 
a the fine structure constant and 
292 K73(é) 
ye 1/3 
FEA =(+ yy [1+ : (25.92) 
Lay g? K 3/3) 


For approximate numerical calculations of photon fluxes, we may use the graphic 
representation in Fig. 24.12 for the modified Bessel 's function. 

The spatial radiation pattern varies with the frequency of the radiation. Specifi- 
cally, the angular distribution concentrates more and more in the forward direction 
as the radiation frequency increases. The radiation distribution in frequency and 
angular space is shown for both the o- (Fig. 25.9) and the mz-mode (Fig. 25.10) at 
the fundamental frequency. The high collimation of synchrotron radiation in the 
forward direction makes it a prime research tool to probe materials and its atomic 
and molecular properties. 


25.4.3 Harmonic Representation 


Expression (25.88) can be transformed into a different formulation emphasizing 
the harmonic structure of the radiation spectrum. The equivalence between both 
formulations has been shown by Sokolov and Ternov [4] expressing the modified 
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Fig. 25.10 Distribution in frequency and angular space for z-mode radiation 


Bessel’s functions K1/3 and K2/3 by regular Bessel’s functions of high order. With 
v= a the asymptotic formulas for v > 1 are 


V3x 


K1/3(&) = viepee cos 6), (25.93) 
3 
Ky/3(&) = p08 cos 0), (25.94) 


where € = 4 (1—* cos? ay? aw Ry + p29)? for small angles. These 
approximations are justified since we are only interested in very large harmonics of 
the revolution frequency. The harmonic number v for the critical photon frequency, 
for example, is given by ve = w./@_ = 3y? which for practical cases is generally a 
very large number. Inserting these approximations into (25.88) gives the formulation 
that has been derived first by Schott [9-11] in 1907 long before synchrotron 
radiation was discovered in an attempt to calculate the radiation intensity of atomic 
spectral lines 


dP me 
dQdv = oe v? [J?(v cos 6) + O77 (v cos 0)| ; (25.95) 


where we have introduced the radiation power P = Woe This form still exhibits 
the separation of the radiation into the two polarization modes. 


25.4.4 Spatial Radiation Power Distribution 


Integrating over all frequencies we obtain the angular distribution of the syn- 
chrotron radiation. From (25.88) we note the need to perform integrals of the form 
fae wK;, (aw) dw, where aw = &. The solution can be found in the integral tables 
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of Gradshteyn and Ryzhik [12] as solution number GR(6.576.4)! 


[o.e) 2 2 
22 m 1—4p 

K dw = ——~ , 
[ oReeyae 32a3 cosa 


(25.96) 


fora > 0, and —1.5 < yw < 1.5. Applying this solution to (25.88) and integrating 
over all frequencies, we get for the angular energy distribution of the synchrotron 
radiation per electron 


dw Pame 5 5 292 
= y ( y ) (25.97) 


dQ 16 p- (1+ y262)5/2 T1+ y262 

This result is consistent with the angular radiation power distribution (25.61) 
where we found that the radiation is collimated very much in the forward direction 
with most of the radiation energy being emitted within an angle of +1/y. There 
are two contributions to the total radiation intensity, the o-mode and the z-mode. 
The o-mode has a maximum intensity in the forward direction, while the maximum 
intensity for the z-mode occurs at an angle of 6, = 1/ (572 y). The quantity 
dW/dQ is the radiation energy per unit solid angle from a single electron and a 
single pass and the average radiation power is therefore P, = W / Trey or (25.97) 
becomes 


Pe Fine 5 5 y292 
ia E (1+ y ). (25.98) 


dQ = 32mp? (1 + y262)5/2 714 y262 


Integrating (25.98) over all angles 6, we find the synchrotron radiation power 
into both polarization modes. In doing so, we note first that (25.98) can be simplified 
with (24.34) and B = 1 


gP,,. 21 FP, y 5 ape 
— 1 . 25.99 
dQ 322 (1+ 262)? ( a1 4pe ee 


This result is consistent with (25.61) although it should be noted that (25.99) 
gives the average radiation power from a circular accelerator with uniform intensity 
in w, while (25.61) is the instantaneous power into the forward lobe. Equa- 
tion (25.99) exhibits the power into each polarization mode for which the total 
power can be obtained by integration over all angles. First, we integrate over all 
points along the circular orbit and get a factor 27 since the observed radiation power 
does not depend on the location along the orbit. Continuing the integration over all 
angles of 6, we find the contributions to the integral to become quickly negligible for 
angles larger than 1/y. If it were not so, we could not have used (25.99) where the 


‘In this chapter we will need repeatedly results from mathematical tables. We abbreviate such 
solutions with the first letters of the authors names and the formula number. 
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trigonometric functions have been replaced by their small arguments. Both terms 
in (25.99) can be integrated readily and the first term becomes with GR(2.271.6) 
[12] 


Omaxy > 1 ydé _ 4 3s a 
saver A+ POD 3 ates 
The second term is with GR[2.272.7] [12] 
fr y>07d _ 4 (25 101) 
Onxy<a (Ll + y202)7/2 15° ; 
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With these integrals and (25.99) the radiation power into the o- and z-mode with 
P, from (24.34) is 


ie (25.102) 
Py = gPy. 

The horizontally polarized component of synchrotron radiation greatly dom- 
inates the photon beam characteristics and only 12.5% of the total intensity is 
polarized in the vertical plane. In the forward direction the o-polarization even 
approaches 100 %. Obviously, the sum of both components is equal to the total 
radiation power. This high polarization of the radiation provides a valuable charac- 
teristic for experimentation with synchrotron radiation. In addition, the emission of 
polarized light generates a slow polarizing reaction on the particle beam orbiting in 
a circular accelerator like in a storage ring [13]. 


25.5 Asymptotic Solutions 


Expressions for the radiation distribution can be greatly simplified if we restrict 
the discussion to very small or very large arguments of the modified Bessel’s 
functions for which approximate expressions exist [14]. Knowledge of the radiation 
distribution at very low photon frequencies becomes important for experiments 
using such radiation or for beam diagnostics where the beam cross section is being 
imaged to a TV camera using the visible part of the radiation spectrum. To describe 
this visible part of the spectrum, we may in most cases assume that the photon 
frequency is much lower than the critical photon frequency. 
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25.5.1 Low Frequencies and Small Observation Angles 


For very small arguments or low frequencies and small angles, we find the following 
approximations AS(9.6.9) [14] 


21/3) f@o\7? 1 

2 a 

Kip(§ — 0) * a5 (=) Tye? (25.103a) 
2 2/3 x2 wo\ 4? 1 

Ky3(§ — 0) x 2°°T"(2/3) (=) (4 p62 (25.103b) 


where the Gamma functions (1/3) = 2.6789385 and (2/3) = 1.351179 and 
from (25.85) 


Pa + y262)32, (25.104) 
We 


Nie 


Inserting this into (25.90) the photon flux spectrum in the forward direction becomes 
for? =Oand * «1 


Non 
d0 dy 


267" A 
=) == (25.105) 


We 
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a) 
The photon spectrum at very low frequencies is independent of the particle energy 
since w, « E%. Clearly, in this approximation there is no angular dependence for the 
o-mode radiation and the intensity increases with frequency. The 2-mode radiation 
on the other hand is zero for 6 = 0 and increases in intensity with the square of 0 
as long as the approximation is valid. 


25.5.2 High Frequencies or Large Observation Angles 


For large arguments of the modified Bessel’s functions or for high frequencies and 
large emission angles different approximations hold. In this case, the approximate 
expressions are actually the same for both Bessel’s functions indicating the same 
exponential drop off for high energetic photons AS(9.7.2) [14] 


5 me 
Kj 3(§ —> 00) & 26 (25.106a) 
me 
K33(€ — 00) & se (25.106b) 
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The photon flux distribution in this approximation becomes from (25.90) 


d?Non 3r.mc? 7@ _» Aa lI ae 
~w if 28/1 + p22 1 ; 25.107 
dddw ~ 4xtic’ a =e a( = on) ( 


where Np» is the number of photons emitted per pass. The spatial radiation distribu- 
tion is greatly determined by the exponential factor and the relative amplitude with 
respect to the forward direction scales therefore like 


exp -— [( + ye)? - i] ; (25.108) 


Cc 


We look now for the specific angle for which the intensity has fallen to 1/e. 
Since ® >> @, this angle must be very small y@ < 1 and we can ignore other 
@-dependent factors. The exponential factor becomes equal to 1/e for 


3—y' Oi, © 1 (25.109) 


and solving for 6)/. we get finally 


la, 
Oije = A Peat for @ >> @. (25.110) 
oe 


The high energy end of the synchrotron radiation spectrum is more and more 
collimated into the forward direction. The angular distribution is graphically 
illustrated for both polarization modes in Figs. 25.9 and 25.10. 


25.6 Angle-Integrated Spectrum 


Synchrotron radiation is emitted over a wide range of frequencies and it is of great 
interest to know the exact frequency distribution of the radiation. Since the radiation 
is very much collimated in the forward direction, it is useful to integrate over all 
angles of emission to obtain the total spectral photon flux that might be accepted 
by a beam line with proper aperture. To that goal, (25.88) will be integrated with 
respect to the emission angles to obtain the frequency spectrum of the radiation. 
The emission angle 6 appears in (25.88) in a rather complicated way which makes 
it difficult to perform the integration directly. We replace therefore the modified 
Bessel’s functions by Airy’s functions defined by AS(10.4.14) and AS(10.4.31) [14] 


Ve 


Ai(z) = Jane 


K1/3(€), (25.11 1a) 
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z 


Ai’ (2) = - Wax Ko/3(&). (25.111b) 
With the definition 
ee i. (25.112) 
we get from (25.85) 
z= (38)? = 93 (1+ 726) . (25.113) 


We apply this to the periodic motion of particles orbiting in a circular accelerator. 
In this case the spectral distribution of the radiation power can be obtained by 
noting that the differential radiation energy (25.88) is emitted every time the particle 
passes by the source point. A short pulse of radiation is sent towards the observation 
point at periodic time intervals equal to the revolution time. The spectral power 
distribution (25.88) expressed by Airy functions is 


dP, _ OP) y 
dwod2Q 2m a 


[12 Ai? @ + n¥97704? |. (25.114) 


To obtain the photon frequency spectrum, we integrate over all angles of emission 
which is accomplished by integrating along the orbit contributing a mere factor of 
2x and over the angle @. Although this latter integration is to be performed between 
-1/2 and +7/2, we choose the mathematically easier integration from —oo to +00 
because the Airy functions fall off very fast for large arguments. In fact, we have 
seen already that most of the radiation is emitted within a very small angle of +1/y. 
The integrals to be solved are of the form f° 6" A? [77/3(1 + y?67] d6 where n = 
0 or 2. We concentrate first on the second term in (25.114) and form with (25.86) 
and (25.1 11a) the square of the Airy function 


1 CO CO 
6? Ai?(z) = =| 6” cos [4x + zx] ax | 6? cos [ty + zy] dy. 
am” Jo 0 . 
(25.115) 
We solve these integrals by making use of the trigonometric relation 
cos(a@ + 5B) cos(a — 5B) = cosa cosp. (25.116) 


After introducing the substitutions x + y = s and x — y = ft, we obtain integrals 
over two terms which are symmetric in s and ¢ and therefore can be set equal to get 


1 lo) co 
PAP) = =| i 0? cos[ 5s +350 +25] dsdy, (25.117) 
zm Jo Jo 
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where the factor 5 comes from the transformation of the area element 


dsdy = Ba In our problem we replace the argument z by the expression 


z= 77/3 (1 + y767) and integrate over the angle 0 


i / 0? Ai’(z) dd = il] 6° cos[bs* + 3st? + sn? (1 + y?0")] ds dy dé. 
(25.118) 


The integrand is symmetric with respect to 6 and the integration therefore needs 
to be performed only from 0 to oo with the result being doubled. We also note 
that the integration is taken over only one quadrant of the (s,f)-space. Further 
simplifying the integration, the number of variables in the argument of the cosine 
function can be reduced in the following way. We note the coefficient ¢ ?-+77/3 707 
which is the sum of squares. Setting St = rcosg and n!/7y6 = rsing this term 
becomes simply r?. The area element transforms like dtd@ = 2/(n'/*y) rdrdg and 
integrating over g from 0 to 2/2, since we need integrate only over one quarter 
plane, (25.118) becomes finally 


1 


0° Ai* (z) d@ = ——~ 
/ ale 2nny? 


Co 
// rcos[ 45° + sys + | rdrds . 
0 
(25.119) 
The integrand of (25.119) has now a form close to that of an Airy integral and we 


will try to complete that similarity. With ¢g = (3&/2)!/>x the definition of the Airy 
functions AS(10.4.31) [14] are consistent with (25.111) 


ik Co 
Ai(z) = a cos[5q° + zq]dq. (25.120) 
0 
Equation (25.119) can be modified into a similar form by setting 
Ss and s(??4+r)=yw. (25.121) 


Solving for w we get w = s/2?/3 and with y = 27/3 (77/3 + r?), ds = 2?/3dw and 
dy = 2°/3r dr Eq. (25.119) becomes 


in 6? Ai? (z) dO = i [ (55 ze ue) Ai(y) dy , (25.122) 
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where we have used the definition of Airy’s function and where the integration 
starts at 


2/3 
(ey) 


corresponding to r = 0. 

We may separate this integral into two parts and get a term yAi(y) under one of 
the integrals. This term is by the definition of Airy’s functions AS(10.4.1) [14] equal 
to Ai”. Integration of this second derivative gives 


[o,) 
Ai” (y) dy = —Ai' (v0) (25.124) 
yo 
and collecting all terms in (25.122) we have finally 


‘_ 6? Ai?(z) dé = 7 E (vo) +f Ai(y) ay] (25.125) 


The derivation of the complete spectral radiation power distribution (25.114) 
requires also the evaluation of the integral [ Ai’(z)d 0. This can be done with the 
help of the integral f Ai’(z) d6 and (25.125). We follow a similar derivation that 
led us just from (25.118) to (25.119) and get instead of (25.125) 


lo.e) 1 co 
/ Ai*(z) d@ = Er i Ai(y) dy. (25.126) 
—0o 2n “Y Jyo 
Recalling the definition of the argument y = 77/3 (1 + y?0?), we differenti- 
ate (25.126) twice with respect to 7’? to get 

loo) 21/3 
Ps [Ai (z) + Ai? (z)] dO = ———Ai'(y0) . (25.127) 

—00 n'y 


Using the relation Ai’ (z) = zAi(z) and the results (25.124), (25.125) in (25.127) 
we get 


/ ~ Ai’ (z) dd = —— — [2a (vo) +f Ai(y) ay] (25.128) 
—00 a Yo 


At this point, all integrals have been derived that are needed to describe the 
spectral radiation power separately in both polarization modes and the spectral 
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radiation power from (25.114) becomes 


dP, = 27P yo (<= 3Ai’ 0) =. Ail) wy) 


dw 162 via 
(=e Oo) +f Ai(y) a). (25.129) 


The first term describes the o-mode of polarization and the second term the zr- 
mode. Combining both polarization modes, we may derive a comparatively simple 
expression for the spectral radiation power. To this goal, we replace the Airy’s 
functions by modified Bessel’s functions 


Ai’ (yo) 
> v= — Je Kaya(xo) (25.130) 


where from (25.111), (25.112), and (25.122) x9 = w/a. With /y dy =dx, the 
recurrence formula 2K; a —K\/3 + Ks;3 and (25.111) the Airy integral is 


[ Aiow=-% [Kom a-~ [Kea 
¥0 x0 xo 
= EKoplt) - taf Ks/3(x) dx. (25.131) 


We use (25.130) and (25.131) in (25.129) and get the simple expression for the 
synchrotron radiation spectrum 


dP. P oo P 
ra oie / Ks/3(x) dx = =s(=) (25.132) 
dw Wc We xo We w 


where we defined the universal function 


s(2) = a= = | Ks/3(x)dx. (25.133) 
We 7 We w/c 


The spectral distribution depends only on the critical frequency @,, the total 
radiation power and a purely mathematical function. This result has been derived 
originally by Ivanenko and Sokolov [15] and independently by Schwinger [6]. 
Specifically, it should be noted that the synchrotron radiation spectrum, if nor- 
malized to the critical frequency, does not depend on the particle energy and is 
represented by the universal function shown in Fig. 25.11. The energy dependence is 
contained in the cubic dependence of the critical frequency acting as a scaling factor 
for the real spectral distribution. The mathematical function is properly normalized 
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Fig. 25.11 Universal 


function: a oe = 1.333 eae pias 
S(E) = BEES KssQ)dx, — S(@/o,) J ba | % 
i = ss “| | ‘ 
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as we can see by integrating over all frequencies. 
/ zie = Lp, | v0 f Ks/3(x)dx dx . (25.134) 
0 w 0 x0 


After integration by parts, the result can be derived from GR[6.561.16] [12] 


if aw = 1p, | x9 Ks/3(xo)dxo = T (4/3) (2/3). (25.135) 
0 0 


Using the triplication formula AS(6.1.19) [14] the product of the gamma functions 
becomes 


I (4/3) (2/3) = $3. (25.136) 


With this equation the proper normalization of (25.134) is demonstrated 


°° dP, 
—dw = Py. (25.137) 
7 dw p 


Of more practical use is the spectral photon flux per unit angle of deflection in the 
bending magnet. With the photon flux dNj, =dP/fhw we get from (25.132) 


aN = oe AO ee (25.138) 
dw  2nha, wo \a, ‘ 


and with (24.34) and (24.49) 


(25.139) 


dNon _ 4a I Aw o 
de Sem \a)’ 
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where w is the deflection angle in the bending magnet and a the fine structure 
constant. In practical units, this becomes 


dN pn Aw .(o 
= Ces 25.140 
dy Yo (=) lalla 
with 
4 hot 
Cy = —, = 3.9614 x 19'65_POs _. (25.141) 
9e mc? s mrad A GeV 


The synchrotron radiation spectrum in Fig.25.11 is rather uniform up to the 
critical frequency beyond which the intensity falls off rapidly. Equation (25.132) 
is not well suited for quick calculation of the radiation intensity at a particular 
frequency. We may, however, express (25.132) in much simpler form for very low 
and very large frequencies as discussed in Sect. 24.3. 


25.7 Statistical Radiation Parameters 


The emission of synchrotron radiation is a classical phenomenon. For some 
applications it is, however, useful to express some parameters in statistical form. 
Knowing the spectral radiation distribution, we may follow Sands [16] and express 
some quantities in the photon picture. We have used such to derive expressions 
for the equilibrium beam size and energy spread. Equilibrium beam parameters 
are determined by the statistical emission of photons and its recoil on the particle 
motion. For this purpose, we are mainly interested in an expression for a and the 
photon flux at energy épy. From these quantities, we may derive an expression for the 


average photon energy («2,) emitted along the circumference of the storage ring. 


With JT (€pn) being the probability to emit a photon with energy ép, we have 


[o.@} 
(«bn) = / epntT (€pn) depn . (25.142) 
z 0 


The probability [7 (€pn) is defined by the ratio of the photon flux n(ep,) emitted 
at energy €pp to the total photon flux Non 


(Eph) 


TT (€pn) = Ae 
p 


: (25.143) 
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where 


Py S( 
ae a es (25.144) 
& Xx Ec 


Epn) = 


The photon flux at én is related to the spectral photon power by €pn(épn) = P (€pn) : 
Integrating (25.138) over all angles y and multiplying by Aw = épn we get for the 
spectral radiation power 


dN P € 
P (&pn) dep = &ph——depn = — S (=) depn . (25.145) 
déph Ec Ec 


The total number of emitted photons per unit time is just the integral 


; oe Py (73s 15/3 P 
Non = / A(Epn) déph = / ce ol Ets (25.146) 
0 E 0 XxX 8 Ec 


Cc 


With this, the probability to emit a photon of energy épp is finally 


8 1S(x) 
fe) =— =. 25.147 
(eon) 15/3. 2% ¢ 
and 
8e2 oO 11 
2) = —< S(x) dx = —e?. 25.148 
(ea) oe) KSiy) 578 ( ) 


To calculate equilibrium beam parameters in Chaps. 11.3 and 11.4, for example, 
we need to know the quantity ("vpn(e2,)) which is now from (25.146), (25.148) 


(Non (€?)), = —=(ecPy). . (25.149) 


where the average is to be taken along the orbit and around the storage ring through 
all magnets. Expressing the critical photon energy by (24.49) and the radiation 
power by (24.34) and we get finally 


i 
(Non(e*)), = recmc-hey! (“) (25.150) 


55 
24/3 
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Problems 


25.1 (S). Integrate the radiation power distribution (25.61) over all solid angles and 
prove that the total radiation power is equal to (24.34). 


25.2 (S). In the ESRF (European Synchrotron Radiation Facility) synchrotron 
radiation source in Grenoble (France) an electron beam of 200 mA circulates at an 
energy of 6 GeV. The bending magnet field is 1.0 T. Derive and sketch the spectral 
photon flux into a band width of 1% and an acceptance angle of 10 mrad as a 
function of photon energy. 


25.3 (S). Derive an expression identifying the angle at which the spectral intensity 
has dropped to p% from the maximum intensity. Derive approximate expressions for 
very low or very large photon energies. Find the angle at which the total radiation 
intensity has dropped to 10 %. 


25.4. Derive the wave equations (25.3) and (25.4). 
25.5. Derive (25.17). 


25.6. Derive (25.28) from (25.27). Show that the electrical field in the radiation 
regime is purely orthogonal to the direction of observation. Is the field also parallel 
to the acceleration? 


25.7. Design a synchrotron radiation source for a critical photon energy of your 
choice. Use a simple FODO lattice and specify the minimum beam energy, beam 
current, and bending radius which will produce a bending magnet photon flux of 
10!* photons/s/mrad at the desired photon energy and into a band width of Aw/w = 
1%, What is the minimum and maximum photon energy for which the photon flux 
is at least 10!! photons/s/mrad? How big is your ring assuming a 30 % fill factor for 
bending magnets? 
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Chapter 26 
Insertion Device Radiation 


Synchrotron radiation from bending magnets is characterized by a wide spectrum 
from microwaves up to soft or hard x-rays as determined by the critical photon 
energy. To optimally meet the needs of basic research with synchrotron radiation, 
it is desirable to provide radiation characteristics that cannot be obtained from ring 
bending magnets but require special magnets. The field strength of bending magnets 
and the maximum particle beam energy in circular accelerators like a storage ring 
is fixed leaving no adjustments to optimize the synchrotron radiation spectrum for 
particular experiments. To generate specific synchrotron radiation characteristics, 
radiation is often produced from insertion devices installed along the particle beam 
path. Such insertion devices introduce no net deflection of the beam and can 
therefore be incorporated in a beam line without changing its geometry. Motz [1] 
proposed first the use of wiggler magnets to optimize characteristics of synchrotron 
radiation. By now, such magnets have become the most common insertion devices 
consisting of a series of alternating magnet poles deflecting the beam periodically 
in opposite directions as shown in Fig. 26.1. 

In Chap. 24 the properties of wiggler radiation were discussed shortly in an 
introductory way. Here we concentrate on more detailed and formal derivations 
of radiation characteristics from relativistic electrons passing through periodic 
magnets. 

There is no fundamental difference between wiggler and undulator radiation. 
One is the stronger/weaker version of the other. The deflection in an undulator is 
weak and the transverse particle momentum remains nonrelativistic. The motion is 
purely sinusoidal in a sinusoidal field, and the emitted radiation is monochromatic at 
the particle oscillation frequency which is the Lorentz-contracted periodicity of the 
undulator period. Since the radiation is emitted from a moving source the observer in 
the laboratory frame of reference then sees a Doppler shifted frequency. We call this 
monochromatic radiation the fundamental radiation or radiation at the fundamental 
frequency of the undulator. 
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Fig. 26.1 Trajectory of a wiggler 
particle beam in a flat wiggler period sinusoidal beam path 


= “TT AAA 


As the undulator field is increased, the transverse motion becomes stronger and 
the transverse momentum starts to become relativistic. As a consequence, the so far 
purely sinusoidal motion becomes periodically distorted causing the appearance of 
harmonics of the fundamental monochromatic radiation. These harmonics increase 
in number and density with further increase of the magnetic field and, at higher 
frequencies, eventually merge into one broad spectrum characteristic for wiggler 
or bending magnet radiation. At very low frequencies, the theoretical spectrum is 
still a line spectrum showing the harmonics of the revolution frequency. Of course, 
there is a low frequency cut-off at a wavelength comparable or longer than vacuum 
chamber dimensions which therefore do not show-up as radiation. 

An insertion device does not introduce a net deflection of the beam and we 
may therefore choose any arbitrary field strength which is technically feasible to 
adjust the radiation spectrum to experimental needs. The radiation intensity from 
a wiggler magnet also can be made much higher compared to that from a single 
bending magnet. A wiggler magnet with say ten poles acts like a string of ten 
bending magnets or radiation sources aligned in a straight line along the photon 
beam direction. The effective photon source is therefore ten times more intense than 
the radiation from a single bending magnet with the same field strength. 

Wiggler magnets come in a variety of types with the flat wiggler magnet being 
the most common. In this wiggler type only the component B, is nonzero deflecting 
the beam in the horizontal plane. To generate circularly or elliptically polarized 
radiation, a helical wiggler magnet [2] may be used or a combination of several flat 
wiggler magnets deflecting the beam in orthogonal planes which will be discussed 
in more detail in Sect. 26.3.2. 


26.1 Particle Dynamics in a Periodic Field Magnet 
Insertion devices are characterized by the requirement that 


[ tsac=o. 


As discussed in Chap. 15 this requirement demands that the first and second integral 
must be made zero with the use of steering magnets before and after the undulator. 
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This correction is sufficient from the beam stability point of view. However, it does 
not address the effect of field tolerances on the intensity of radiation into harmonics. 
For example, the curved trajectory in Fig. 15.4 can reduce the radiation intensity 
because not all periods radiate in the same direction and constructive interference of 
light emitted by individual periods is not optimum. Therefore the overall trajectory 
curvature in Fig. 15.4 should be corrected as discussed in Chap. 15. Furthermore, 
variations in field strength and period length from period to period in the undulator 
can seriously diminish the radiation intensity especially in the higher harmonics. 
The effect of such errors on individual harmonic intensities have been studied [3]. 
A special shimming procedure has been proposed by Elleaume [4] to transform an 
undulator with construction tolerances to an almost ideal undulator giving close to 
perfect intensities for about a dozen harmonics. If the shimming is done correctly 
the long coil mentioned in Chap. 15 is not necessary anymore. In the following 
discussion we assume that the integrals have been corrected and that the undulator 
has been shimmed. 

Particle dynamics and resulting radiation characteristics for an undulator have 
been derived first by Motz [1] and later in more detail by other authors [5, 6]. 
A sinusoidally varying vertical field causes a periodic deflection of particles in 
the (x, z)-plane shown in Fig. 26.1. To describe the particle trajectory, we use the 
equation of motion 


; = yf [B x Bl, (26.1) 


where # is the particle velocity and get with (6.110) the equations of motion in 
component form 


@x _ eBoy dz 

d2 ~~ ypm dt cos (kpz) (26.2) 
az — +4 £80 2 cos (kz) 
d2 ~~ ' yBm dt pry 3 


where we have set ky = 27/A, anddz = fBcdt with B = v/c. 

Equations (26.2) describe the coupled motion of a particle in the sinusoidal field 
of a flat wiggler magnet. This coupling is common to the particle motion in any 
magnetic field but generally in beam dynamics we set dz/dt ~ v and dx/dt ~ 0 
because dx/dt « dz/dt. This approximation is justified in most beam transport 
applications for relativistic particles, but here we have to be cautious not to neglect 
effects that might be of relevance on a very short time or small geometric scale 
comparable to the oscillation period and wavelength of synchrotron radiation. 

We will keep the dx/df-term and get from (26.2) with dz/dt ~ v and after 
integrating twice that the particle trajectory follows the magnetic field in the sense 
that the oscillatory motion reaches a maximum where the magnetic field reaches a 
maximum and crosses the beam axis where the field is zero. We start at the time 
t = 0 in the middle of a magnet pole where the transverse velocity x) = 0 while 
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the longitudinal velocity 7 = Bc and integrate both equations (26.2) utilizing the 
integral of the first equation in the second to get 


ie = —pex sin (kpz) . 
dz 


z ; (26.3) 
¢ = Be [! = sf sin? (kz) : 


The transverse motion describes the expected oscillatory motion and the longitu- 
dinal velocity v exhibits a periodic modulation reflecting the varying projection of 
the velocity vector to the z-axis. Closer inspection of this velocity modulation shows 
that its frequency is twice that of the periodic motion. It is convenient to describe 
the longitudinal particle motion with respect to a Cartesian reference frame moving 
uniformly along the z-axis with the average longitudinal particle velocity Bc = (z) 
which can be derived from (26.3b) 


p=, (1 - A) ; (26.4) 


In this reference frame the particle follows a figure-of-eight trajectory composed 
of the transverse oscillation and a longitudinal oscillation with twice the frequency. 
We will come back to this point since both oscillations contribute to the radiation 
spectrum. A second integration of (26.3b) results finally in the equation of motion 
in component representation 


x(t) = Tyke cos (kpBet) , 


= 20, = (26.5) 
z(t) = Bet + Biytk sin (2kpBct) , 
where we set z = Bet. The maximum amplitude a of the transverse particle 
oscillation is finally 
K ApK 
(26.6) 


a= = : 
Bykp 2nBy 


This last expression gives another simple relationship between the wiggler 
strength parameter and the transverse displacement of the beam trajectory 


Ap (cm) K 


= 0.8133 
a E (GeV) 


(26.7) 


For most cases, this beam displacement is very small. 
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26.2 Undulator Radiation 


The physical process of undulator radiation is not different from the radiation 
produced from a single bending magnet. However, the radiation received at great 
distances from the undulator exhibits special features which we will discus in 
more detail. Basically, we observe an electron performing N, oscillations while 
passing through an undulator with N, undulator periods. The observed radiation 
spectrum is the Fourier transform of the electron motion and therefore quasi- 
monochromatic with a finite line width inversely proportional to the number of 
oscillations performed. 


26.2.1 Fundamental Wavelength 


Undulator radiation can also be viewed as a superposition of radiation fields from N, 
sources yielding quasi-monochromatic radiation as a consequence of interference. 
To see that, we observe the radiation at an angle 0 with respect to the path of the 
electron as shown in Fig. 26.2. 

The electron travels on its path at an average velocity given by (26.4) and it takes 
the time 


—_ Ap _ Ap 
~ cB cBl1— K?/(4y?)] 


to move along one undulator period. During that same time, the radiation front 
proceeds a distance 


(26.8) 


Ap 
Bll — K*/(4y’)] 


moving ahead of the particle since sp, > tcB . For constructive superposition of 
radiation from all undulator periods, we require that the difference spy — Ap cos 3 
be equal to an integer multiple of the wavelength A, or for small observation angles 
v<i 


Sph = TC= (26.9) 


kAp = —Ap(1 — 48°). (26.10) 


———— 
Bl — k?/(4y7)] 


Fig. 26.2 Interference of 
undulator radiation 
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After some manipulations, we get with K*/y? « 1 and B = 1 for A, 


A 
Ape — (14+ 1K? 4+ y78?). 26.11 
k ay | + 5K’ + yd’) ( ) 


The lowest harmonics is defined by k = | and is called the fundamental undulator 
wavelength. 

From an infinitely long undulator, the radiation spectrum consists of spectral 
lines at a wavelength determined by (26.11). In particular, we note that the shortest 
wavelength is emitted into the forward direction while the radiation at a finite angle 
v appears red shifted by the Doppler effect. For an undulator with a finite number 
of periods, the spectral lines are widened to a width of about 1/N, or less as we will 
discuss in the next section. 


26.2.2. Radiation Power 


The radiation power is from (25.41) 


P = 2rmc|B*|?, (26.12) 
where * indicates quantities to be evaluated in the particle reference system. We 
may use this expression in the particle system to calculate the total radiated energy 
from an electron passing through an undulator. The transverse particle acceleration 
is expressed by mv* = dp, /dt* = ydp,/dt where we used f* = t/y and inserting 
into (26.12) we get 


2 o} 
Toy dpi 
P=2z —])}. 26.13 
5 ( re) ( ) 


The transverse momentum is determined by the particle deflection in the 
undulator with a period length A, and is for a particle of momentum cpo 


PL =P sinapt, (26.14) 
where p = poO and w) = cky = 2mc/Ap. The angle 6 = K/y is the maximum 
deflection angle defined in (6.121). With these expressions and averaging over one 
period, we get from (26.13) for the instantaneous radiation power from a charge e 
traveling through an undulator 


Ping = ereme”y KK, , (26.15) 


where r, is the classical electron radius. The duration of the radiation pulse is equal 
to the travel time through an undulator of length L, = A,Np and the total radiated 
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energy per electron is therefore 


AE = 4remc"y?K’ KL . (26.16) 
In more practical units 
gee E*K? 
AE(eV) = Cy——- Ly = 725.69 Ly (26.17) 
Co72 Aj(cm) 
with 
_ 4x? le 7 mM 
= = 7.2569 x 10 : (26.18) 
3mc? eV 


The average total undulator radiation power for an electron beam circulating in a 
storage ring is then just the radiated energy (26.16) multiplied by the number of 
particles MN, in the beam and the revolution frequency or 


L 
Payg = Lrecme*y?K?k-Nyp— = 26.19 
g = 3recmcy pea ( ) 

or 
Payg(W) = 633.6E’ Bol Lu, (26.20) 


where / is the circulating electron beam current. The total angle integrated radiation 
power from an undulator in a storage ring is proportional to the square of the beam 
energy and maximum undulator field By and proportional to the beam current and 
undulator length. 


26.2.3 Spatial and Spectral Distribution 


For bending magnet radiation, the particle dynamics is relatively simple being 
determined only by the particle velocity and the bending radius of the magnet. 
In a wiggler magnet, the magnetic field parameters are different from those 
in a constant field magnet and we will therefore derive again the synchrotron 
radiation spectrum for the beam dynamics in a general wiggler magnet. No special 
assumptions on magnetic field configurations have been made to derive the radiation 
spectrum (25.71) and we can therefore use this expression together with the 
appropriate beam dynamics to derive the radiation spectrum from a wiggler magnet 


aw =" mc w 
dadQ 


sf nx [nx plei# ete Yar! (26.21) 
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Fig. 26.3 Particle trajectory 
and radiation geometry in a 
wiggler magnet 


The integrand in (26.21) can be evaluated from known particle dynamics in a 
wiggler magnet noting that all quantities are to be taken at the retarded time 4. 
The unit vector from the observer to the radiating particle is from Fig. 26.3 


n =—cosgsind x — sing sind y — cos 0 Z, (26.22) 


where (x, y,Z) are coordinate unit vectors. The exponent in (26.21) includes the 
term R/c = nR/c. We express again the vector R from the observer to the particle 
by the constant vector r from the origin of the coordinate system to the observer and 
the vector rp from the coordinate origin to the particle for R = —r +r, as shown in 
Fig. 26.3. 

The r-term gives only a constant phase shift and can therefore be ignored. The 
location vector rp of the particle with respect to the origin of the coordinate system 
is 


Ty (tr) = x(t,)¥ + 2(t,)z 


and with the solutions (26.5) we have 


(t,) us cos(@pt,)& + | Bet, + a in(2wpt,) | 2 (26.23) 
r,(t,) = — 1x 1+ —— sin t : : 
ay a OT 8th SP |? 
where 
Wp = kpc. (26.24) 


The velocity vector finally is just the time derivative of (26.23) 


K. . ee K? P 
B(t;) = ——B sin(@pt,)x + B + — cos(2up) Z. (26.25) 
y aye 
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We use these vector relations to evaluate the integrand in (26.21). First, we express 
the triple vector product n x [n x B] by its components and get with (26.22), (26.25) 


K - K- 
nx [nx B] = +x ee sin’ 0 cos* ~ COS pt; + —B sin @pt; 

4 ig 

K2 
+B (143 + 73 BOS 2a) sin? cos 0 cos °| 
4y? 
|| BS Bt geo. ‘ 
+y|——B sin’  singcos¢@ sin pt; 
4 


KR. 
+B (1+ —5 60S 2aph, :) sin? cos? sin °| (26.26) 
4y? 


na 


K.- 
+2Z |-+8 sin } cos 3} COSY COS Wpfy 
Y 


K2 
+B (1 + Gyi C08 2p! :) (cos? 3 — | 
Ay? 


This expression can be greatly simplified considering that the radiation is emitted 
into only a very small angle * < 1. Furthermore, we note that the deflection due 
to the wiggler field is in most practical cases very small and therefore K < y and 


p= (1 — i‘) ~ B. Finally, we carefully set 8 ~ 1 where this term does not 


appear as a difference to unity. With this and ignoring second order terms in # and 
K/y we get from (26.26) 


nx[nx Bl = (av cos g@ + po sin (ont) + (Bd sing) >. (26.27) 


The vector product in the exponent of the exponential function is just the product 
of (26.22) and (26.23) 


1 KB 7 2- 
=n p(t) = Bia sin} cos g cos (ptr) = @ +4 
Cc YOp 


sin 2m» ft, | cos? . 


Pp 
(26.28) 


Employing again the approximation } < 1 and keeping only linear terms we get 
from (26.28) 


= 


l = K, 
t, + —nr,(t,) = (1 — B cos?) — aad cos Y cos (Wpf,) — 8 7 
é 


sin (2@, t,) . 
a ee (2p tr) 


(26.29) 


With (26.4) and cos? = 1 — 5 07, the first term becomes 
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a 1 1 72 2.92 Wp 
Pee = (rok se ee (26.30) 


where we have defined the fundamental wiggler frequency w, by 


p) 2 
ji = — (26.31) 
"1+ 4K? + y2? 


or the fundamental wavelength of the radiation 


Xp 
A 14+5K?>+y°0 26.32 
l= 2? > ( +5 K+ yY *) ( ) 
in full agreement with (26.11). At this point, it is worth to remember that the term 
5K? becomes K? for a helical wiggler [2]. With (26.30), the complete exponential 


term —iw [t: + ‘nrp(t,)| in (26.21) can be evaluated to be equal to 


KBd K° 
=a Opt: — ad dlp cosy cos (Wptr) — tae — sin (2pft,) | , (26.33) 
| Y Wp 8y* 

and (26.21) can be modified with this expression into a form suitable for integration 
by inserting (26.27) and (26.30) into (26.21) for 


ew ro mc @ = 
0 ae e082) 


2 
x 


’ 


bad K 
i |» cosy + — sin (ont) x + (3 sing) y e*dt, 
- 4 


co 


where 


2 
. c= f_. oO. ot = Ko al COS @ COS (ptr) = a sin (2a || 
Y 


| 1 Wp y? Op 


We are now ready to perform the integration of (26.34) noticing that the 
integration over all times can be simplified by separation into an integral along the 
wiggler magnet alone and an integration over the rest of the time while the particle 
is traveling in a field free space. We write symbolically 


lee) 1Np/@p lee) 1Np/@p 
/ -| «40+ / (= 0)- f (K = 0). (26.35) 
=O0) —1Np/@p —0o —1Np/@p 


First, we evaluate the second integral for K = 0 which is of the form 


[ctr = 500), 


00 |x| 
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where 6(q@) is the Dirac 6-function. The value of the integral is nonzero only for 
w@ = Oin which case the factor w* in (26.34) causes the whole expression to vanish. 
The second integral is therefore zero. 

The third integral has the same form as the second integral, but since the 
integration is conducted only over the length of the wiggler magnet we get 


- IN, 
1Np/, ; sin —2 & 
p/ Pp ~i sts t = 2mNp 2y2 p 
e y dt, = "—#Ns ny. 
—1Np/ oO mp 
TNp/@p P 2p? wp 


(26.36) 


The value of this integral reaches a maximum of 27 = mn for @ — 0. From (26.34) 
we note the coefficient of this integral to include the ‘tele v = 1/y and the whole 
integral is therefore of the order or less than L,/(cy), where Ly = NpAp is the total 
length of the wiggler magnet. This value is in general very small somnpated to the 
first integral and can therefore be neglected. Actually, this statement is only partially 
true since the first integral, as we will see, is a fast varying function of the radiation 
frequency with a distinct line spectrum. Being, however, primarily interested in the 
peak intensities of the spectrum we may indeed neglect the third integral. Only 
between the spectral lines does the radiation intensity from the first integral become 
so small that the third integral would be a relatively significant although absolutely 
a small contribution. 

To evaluate the first integral in (26.35) with K 4 0 we follow Alferov [5] and 
introduce with (26.31) the abbreviations 


= oe (26.37a) 
K? B 
a+ 7 +p) (26.37b) 
to get from (26.34) the exponential functions in the form 
oar ghar Pe eta (26.38) 


The integral in the radiation power spectrum (26.34) has two distinct forms, one 
where the integrand is just the exponential function multiplied by a time independent 
factor while the other includes the sine function sin wp?, as a factor of the exponential 
function. To proceed further we replace the exponential functions by an infinite sum 
of Bessel’s functions 


p=oo 
elk sin — >. Jy(K) eP¥ (26.39) 


p=—00 
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and apply this identity to the first integral type in (26.34). Applying the iden- 
tity (26.39) also to the second and third exponential factors in (26.38), we get with 
efoss — e@sin(<+7/2) the product of the exponential functions 


oo CO 
é i (2 Opty a C COS Opt; a S sin 2wpt,) = > s ‘s (u) J,(v) eis gi Rv optr ; (26.40) 


m= n= 
—OO —0O 


where 


wW (00) (00) 
Ry, = —-—n-2m, u=—S, and v= —C. (26.41) 
@\ Q| Q| 


The time integration along the length of the wiggler magnet is straight forward 
for this term since no other time dependent factors are involved and we get 


1Np/a, af 5 i 
/ Bich ei (2m Joop dt, = 2mNp _ (xNpRo) . (26.42) 


1Np/Wp Wp TNpRy 


In the second form of the integrand, we replace the trigonometric factor, sin wpf;, 
by exponential functions and get with (26.42) integrals of the form 


1Np/@p 
P : —iRy Opt; 
SIN Wpt; € p™ dt, 


Np/@p 


1 MNp/@p : iR 

= -i- / (era er) eee (26.43) 
2 —1Np/@ 

pip 


_ Ny sin [ 7Np(Ru + 1] _ aN, sin [ Np (Ro - 1)] 
=4 -i 
®p Np (Ra + 1) Wp Np(Ra — 1) 


Both integrals (26.42) and (26.43) exhibit the character of multibeam interference 
spectra well known from optical interference theory. The physical interpretation 
here is that the radiation from the N, wiggler periods consists of N, photon beamlets 
which have a specific phase relationship such that the intensities are strongly 
reduced for all frequencies but a few specific frequencies as determined by the sms 
factors. The resulting line spectrum, characteristic for undulator radiation, is the 
more pronounced the more periods or beamlets are available for interference. To get 
amore complete picture of the interference pattern, we collect now all terms derived 
separately so far and use them in (26.34) which becomes with (26.38) 


dw mNp/p ‘i e ie 
iodo a les. [( o tA sin @pt;) ¥ + | 


-i 2 Oph Jivcos apt giusin 2p ? 
xe 1 Pe pre PT dt}, (26.44) 
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where a = sere reneP'w*, Ayo = cosy, A; = - and By = ¥ sing. Introducing the 


identity 06. 38), ‘the photon energy spectrum becomes 


dw 1Np/@p 
—_ = Ao + A1 Sin @pt,) & + Boy 
dw dQ Ue is aaa 
lo @) lo, e) 2 
rr 
x > > Jin (U) In(v) 27M eRe ot dy. (26.45) 


m>=—W n=—Co 


and after integration with (26.42) and (26.43) 


2mNp sin («Np R w) 
TNpRy 


dw 
snag =" aS > inti, (v) ei2™ 


m= n= 
—OO —CO 


Co co 


+¥A1Y > Y In(u) Inv) 082" (26.46) 
Bee 00 


- aN, | sin [Np (Ro + 1)] _ IN, sin [ 7 Np(Ro - 1)] 
i -i 
TNp(Rw + 1) Wp mNp(Ra — 1) 


2p 


2 
2nN, sin (7NpRo) 


+5 Bo 3 ah (u) Jn(v) et2™” en 


m= n= 
=00 —0O 


To determine the frequency and radiation intensity of the line maxima, we 
simplify the double sum of Bessel’s functions by selecting only the most dominant 
terms. The first and third sums in (26.46) show an intensity maximum for R, = 0 
at frequencies 


wo = (n+2m)o, (26.47) 


and intensity maxima appear therefore at the frequency @,; and harmonics thereof. 
The transformation of a lower frequency to very high values has two physical 
components. In the system of relativistic particles, the static magnetic field of the 
wiggler magnet appears Lorentz contracted by the factor y, and particles passing 
through the wiggler magnet oscillate with the frequency yw, in its own system 
emitting radiation at that frequency. The observer in the laboratory system receives 
this radiation from a source moving with relativistic velocity and experiences 
therefore a Doppler shift by the factor 2y. The wavelength of the radiation emitted in 
the forward direction, ? = 0, from a weak wiggler magnet, K < 1, with the period 
length A, is therefore reduced by the factor 2y?. In cases of a stronger wiggler 
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magnet or when observing at a finite angle 0, the wavelength is somewhat longer as 
one would expect from higher order terms of the Doppler effect. 

From (26.46) we determine two more dominant terms originating from the 
second term for R,, + 1 = 0 at frequencies 


o = (n+ 2m-—1) a, (26.48a) 
o=(n+2m+I1)a, (26.48b) 
respectively. The summation indices n and m are arbitrary integers between —oo 
and oo. Among all possible resonant terms we collect such terms which contribute 
to the same harmonic k of the fundamental frequency w,. To collect these dominant 
terms for the same harmonic we set @ = w; = k @; where k is the harmonic number 

of the fundamental and express the index n by k and m to get 

from (26.47): n=k—2m, 
from (26.48a): n=k—2m+1 (26.49) 
and from (26.48b): n=k—2m—-1. 


Introducing these conditions into (26.46) all trigonometric factors assume the 


sin(Np Aox/o) 
form Np Awg/1 


, where 
ao ek (26.50) 


and we get the photon energy spectrum of the kth harmonic for radiation from a 
single electron passing through an undulator 


5 2 
PWi(w) Fe mep N2 w? E air 


da dQ y? os TNpAax/ oi 
= ul 
x] +8Aq D> Jn (e) Seam (v) e! 27-2 
m=—OCoO 
[o,@} 
A ii = 
+$Bo D> Im(u) Je—am(v) e267 (26.51) 
m>=—CoO 
1 [o.@) 
A is a(k— 
+ 1x re. >» Jin(u) Jk-2m+1(V) el2rl een 
m=—Oo 
oO 2, 


1 lath 
“id SAl » Jin(u) Jk-2m-1 (v) el 27(k-2m—-1) 


m>=—OCo 
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Fig. 26.4 sr distribution for N, = 5 and N, = 100 


All integrals exhibit the resonance character defining the locations of the spectral 
lines. The (sin.x/x)-terms represents the line spectrum of the radiation. Specifically, 
the number N, of beamlets, here source points, determines the spectral purity of the 
radiation. In Fig. 26.4 the (sinx/x)-function is shown for Np = 5 and N, = 100. It 
is clear that the spectral purity improves greatly as the number of undulator periods 
is increased. This is one of the key features of undulator magnets to gain spectral 
purity by maximizing the number of undulator periods. 

The spectral purity or line width is determined by the shape of the (sin x/x)- 
function. We define the line width by the frequency at which sin x/x = 0 or where 
mNpAa,/w@, = 1 defining the line width for the kth harmonic 


A 1 
SP a (26.52) 
Wk KN» 


The spectral width of the undulator radiation is reduced proportional to the 
number of undulator periods, but reduces also proportional to the harmonic number. 

The Bessel functions J,,,(u) determine mainly the intensity of the line spectrum. 
For an undulator with K < 1, the argument u « K? < 1 and the contributions 
of higher order Bessel’s functions are very small. The radiation spectrum consists 
therefore only of the fundamental line. For stronger undulators with K > 1, higher 
order Bessel’s functions grow and higher harmonic radiation appears in the line 
spectrum of the radiation. 
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Summing over all harmonics of interest, one gets the total power spectrum. In the 
third and fourth terms of (26.51) we use the identities ie*!7/? = +1, Jm(u) ei” = 
J_m(u) and abbreviate the sums of Bessel’s functions by the symbols 


Y= DE Fen) Siam) (26.53a) 
De = DE Fen) Ye-2m—1 (0) + Siam 10) (26.53b) 


The total number of photons Npy emitted into a spectral band width Aw/w 
by a single electron moving through a wiggler magnet is finally with Npn(@) = 
W(a)/(ho) 


d92 O TN, Aa; / a 


: 2 
ANon() _ 42920 40 3 2 E (7NpAcx/ =| (26.54) 
2 
k=1 


(2y03 5°, cosy — Ky) FY + (2y8 >>, sin y) >< 
x ’ 
(1+ EK? + y292)’ 

where q@ is the fine structure constant and where we have kept the coordinate unit 
vectors to keep track of the polarization modes. The vectors x and y are orthogonal 
unit vectors indicating the directions of the electric field or the polarization of the 
radiation. Performing the squares does therefore not produce cross terms and the 
two terms in (26.54) with the expressions (26.53) represent the amplitude factors 
for both polarization directions, the o-mode and zr-mode respectively. 

We also made use of (26.50) and the resonance condition 


wo — ko + Aw, a Pd! = 2y? k 
Wp Wp © Op 143K? + p29?” 


(26.55) 


realizing that the photon spectrum is determined by the (sin x/x)?-function. For not 
too few periods, this function is very small for frequencies away from the resonance 
conditions. 

Storage rings optimized for very small beam emittance are being used as modern 
synchrotron radiation sources to reduce the line width of undulator radiation and 
concentrate all radiation to the frequency desired. The progress in this direction is 
demonstrated in the spectrum of Fig. 26.5 derived from the first electron storage ring 
operated at a beam emittance below 10nm at 7.1 GeV [7]. In Fig. 26.5 a measured 
undulator spectrum is shown as a function of the undulator strength K [8]. For a 
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Fig. 26.5 Measured frequency spectrum from an undulator for different strength parameters K [8] 


strength parameter K < | there is only one line at the fundamental frequency. 
As the strength parameter increases, additional lines appear in addition to being 
shifted to lower frequencies. The spectral lines from a real synchrotron radiation 
source are not infinitely narrow as (26.66) would suggest. Because of the finite size 
of the pinhole opening, some light at small angles with respect to the axis passes 
through, and we observe therefore also some signal of the even order harmonic 
radiation. 

Even for an extremely small pin hole, we would observe a similar spectrum as 
shown in Fig. 26.5 because of the finite beam divergence of the electron beam. The 
electrons follow oscillatory trajectories due not only to the undulator field but also 
due to betatron oscillations. We observe therefore always some radiation at a finite 
angle given by the particle trajectory with respect to the undulator axis. Figure 26.5 
also demonstrates the fact that all experimental circumstances must be included to 
meet theoretical expectations. The amplitudes of the measured low energy spectrum 
is significantly suppressed compared to theoretical expectations which is due to a 
Be-window being used to extract the radiation from the ultra high vacuum chamber 
of the accelerator. This material absorbs radiation significantly below a photon 
energy of about 3 keV. 

While we observe a line spectrum expressed by the (sinx/x)?-function, we also 
notice that this line spectrum is red shifted as we increase the observation angle 
v. Only, when we observe the radiation though a very small aperture (pin hole) 


912 26 Insertion Device Radiation 


100 


75 
a 
z \ 
2 50 \ angle integrated spectrum 
n 
q \ 
2 pin hole a eek 
25 spectrum - 


2.5 5 TS 10 12.5 15 
photon energy (keV) 


Fig. 26.6 Actual radiation spectra from an undulator with a maximum field of 0.2 T and a beam 
energy of 7.1 GeV through a pin hole and angle-integrated after removal of the pin hole [7] 


do we actually see this line spectrum. Viewing the undulator radiation through a 
large aperture integrates the linespectra over a finite range of angles 3} producing 
an almost continuous spectrum with small spikes at the locations of the harmonic 
lines. 

The difference between a pin hole undulator spectrum and an angle-integrated 
spectrum becomes apparent from the experimental spectra shown in Fig. 26.6 [7]. 
While the pin hole spectrum demonstrates well the line character of undulator 
radiation, much radiation appears between these spectral lines as the pin hole is 
removed and radiation over a large solid angle is collected by the detector. The pin 
hole undulator line spectrum shows up as mere spikes on top of a broad continuous 
spectrum. 

The overall spatial intensity distribution includes a complex set of different radi- 
ation lobes depending on frequency, emission angle and polarization. In Fig. 26.7 
the radiation intensity distributions described by the last factor in (26.54) 


fic (2y0Z cosy — KZ)? 
ok = (42K? + 292)? 


for the o-mode polarization and 


_ 2ybZ 1 sing)? 
ee (i+3 K2 + y292)2 
for the -mode polarization are shown for the lowest order harmonics. 


We note clearly the strong forward lobe at the fundamental frequency in o-mode 
while there is no emission in z-mode along the path of the particle. The second 
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Fig. 26.7 Undulator radiation distribution in o- and -mode for the lowest order harmonics 


harmonic radiation vanishes in the forward direction, an observation that is true for 
all even harmonics. By inspection of (26.54), we note that v = 0 for } = O and 
the square bracket in (26.53b) vanishes for all odd indices or for all even harmonics 
k. There is therefore no forward radiation for even harmonics of the fundamental 
undulator frequency. 

A contour plot of the first harmonic o- and m-mode radiation is shown in 
Fig. 26.8. There is a slight asymmetry in the radiation distribution between the 
deflecting and nondeflecting plane as one might expect. It is obvious that the pin hole 
radiation is surrounded by many radiation lobes not only from the first harmonics 
but also from higher harmonics compromising the pure line spectrum for larger 
apertures. 
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Fig. 26.8 Contour plot of the 
first harmonic o-mode (solid) 
and z-mode (dashed) 
undulator radiation 
distribution 


26.2.4 Line Spectrum 


To exhibit other important and desirable features of the radiation spectrum (26.54), 
we ignore the actual frequency distribution in the vicinity of the harmonics and set 
Aa, = 0 because the spectral lines are narrow for large numbers of wiggler periods 
Np. Further, we are interested for now only in the forward radiation where v=0 
keeping in mind that the radiation is mostly emitted into a small angle (7) = 1/y. 

There is no radiation for the z-mode in the forward direction and the only 
contribution to the forward radiation comes from the second term in (26.54) of the 
o-mode. From (26.41) we get for this case with w /w, =k 


kK? 


— 44 2K2 and vo = 0. (26.56) 


uo 


The sums of Bessel’s functions simplify in this case greatly because only the lowest 
order Bessel’s function has a nonvanishing value for v9 = 0. In the expression for 
2/7 all summation terms vanish except for the two terms for which the index is zero 
or for which 


k—2m—1=0, or k—2m+1=0 (26.57) 


and 


yy = > J—-m(u) [Je—2m—1 (0) + Je—2m+.1(0)] 


m>=—COo 


J_ist (uo) + J_iti (uo). (26.58) 
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The harmonic condition (26.57) implies that k is an odd integer. For even 
integers, the condition cannot be met as we would expect from earlier discussions 
on harmonic radiation in the forward direction. Using the identity J_, = (—1)"J, 
and (26.56), we get finally with Nj» = W/ha the photon flux per unit solid angle 
from a highly relativistic particle passing through an undulator 


lee) : 2 
_ apie” K2 Se (= Np mult) @ 
= P @ (1 re 1 2? = TN, Aa, / a 


(26.59) 
where the JJ-function is defined by 


kK? kK? 
JJ = J 1-1) pay es ~ Frag 4+ 2K2 . (26.60) 


The amplitudes of the harmonics are given by 


Kk? K? ‘ 


A(K) = G+iKe? 


(26.61) 


The strength parameter greatly determines the radiation intensity as shown 
in Fig. 26.9 for the lowest order harmonics. For the convenience of numerical 
calculations the values A;,(K) are tabulated for odd harmonics in Table 26.1. For 
weak magnets (K < 1) the intensity increases with the square of the magnet field or 
undulator strength parameter. There is an optimum value for the strength parameter 
for maximum photon flux depending on the harmonic under consideration. In 
particular, radiation in the forward direction at the fundamental frequency reaches 
a maximum photon flux for strength parameters K ~ 1.3. The photon flux per unit 
solid angle increases like the square of the number of wiggler periods N,, which is 
a result of the interference effect of many beams concentrating the radiation more 
and more into one frequency and its harmonics as the number of interfering beams 
is increased. 
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Table 26.1 Amplitudes K 
Ax(K) for k = 1,3,5,7,9, 11 01 10.010 


0.2 | 0.038 
0.4 | 0.132 
0.6 


08 [0322 | 0087 | 0.015 | 0.002 [0 
1.0 0.368 | 0.179 | 0.055 | 0.015 | 0.004 
1.2 |0.381 | 0.276 | 0.128 
ri 
1.8 0.320 | 0.423 | 0.371 0.286 | 0.206 | 0.142 
2.0 0.290 | 0.423 | 0.413 | 0.354 | 0.285 | 0.220 
5.0 0.071 0.139 | 0.188 | 0.228 | 0.261 0.290 


10.0 | 0.019 | 0.037 | 0.051 | 0.064 | 0.075 | 0.085 
20.0 |0.005 | 0.010 | 0.013 | 0.016 | 0.019 | 0.022 


The radiation opening angle is primarily determined by the (sin x/x)?-term. We 
define the opening angle for the kth harmonic radiation by v% being the angle for 
which sinx/x = 0 for the first time. In this case x = a or Np Aa,/@, = 1. With 


wpe OY? 2y? Ao _ Np ky? 0p 
O1 = pyre? ag oe T+ 202 and = ye eel 1+] K2+y207 


1 or after ie for 0, 


1+ 5K? 
j= _. (26.62) 
y°(KNp — 1) 
Assuming an undulator with many periods (KNp > 1) the rms opening angle of 
undulator radiation is finally 


1 /1+1k2 
1 2 
aw HH = : 26.63 
m wa * y 2kNp ( ) 


Radiation emitted into a solid angle defined by this small opening angle 
AQ= ae? (26.64) 


is referred to as the forward radiation cone. The opening angle of undulator radiation 
becomes more collimated as the number of periods and the order of the harmonic 
increases. On the other hand, the radiation cone opens up as the undulator strength 
K is increased. We may use this opening angle to calculate the total photon flux of 
the kth harmonic within a bandwidth ae into the forward cone 


Aw kK? . 


——_ 26.65 
@ 1+ 5K? = 


Noh (x) | 59 = 3770Np 
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where @ = ka . The radiation spectrum from an undulator magnet into the 
forward direction has been reduced to a simple form exhibiting the most important 
characteristic parameters. Utilizing (26.61) the number of photons emitted into a 
band width fe from a single electron passing through an undulator in the kth 
harmonic is 


Aw l+ 5K? 
Non (k)|y <9 = $7ON, Po A) (26.66) 
Ok 


Equation (26.66) is to be multiplied by the number of particles in the electron 
beam to get the total photon intensity. In case of a storage ring, particles circulate 
with a high revolution frequency and we get from (26.66) by multiplication with 
I/e, where J is the circulating beam current, the photon flux 


ANon (x I Aw 1+ 4K? 
dp (x) aN, ~— 2 A(R). (26.67) 
dt 9=0 e Wk k 


The spectrum includes only odd harmonic since all even harmonics are suppressed 
through the cancellation of Bessel’s functions. This photon flux represents fully 
spatial coherent radiation as long as the beam divergence does not significantly 
contribute to the photon divergence (26.63). 


26.2.5 Spectral Undulator Brightness 


Similar to Chap. 27 we define the spectral brightness of undulator radiation as the 
photon density in six-dimensional phase space. The actual photon brightness is 
reduced from the diffraction limit due to betatron motion of the particles, transverse 
beam oscillation in the undulator, apparent source size on axis and under an oblique 
angle. All of these effects tend to increase the source size and reduce brightness. 

The particle beam cross section varies in general along the undulator. We assume 
here for simplicity that the beam size varies symmetrically along the undulator with 
a waist in its center. From beam dynamics it is then known that, for example, the 
horizontal beam size varies like o2 = 02) + 0'p9s”, where ojo is the beam size at the 
waist, 0; the divergence of the beam at the waist and —$L Ss 5L the distance 
from the waist. The average beam size along the undulator length L is then 


(02) = 0% + yO" tol’. (26.68) 


Similarly, due to an oblique observation angle with respect to the (y, z)-plane 
or y with respect to the (x, z)-plane we get a further additive contribution 20L to the 
apparent beam size. Finally, the apparent source size is widened by the transverse 
beam wiggle in the periodic undulator field. This oscillation amplitude is from (26.6) 
a=A,K/(2ry). 
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Collecting all contributions and adding them in quadrature, the total effective 
beam-size parameters are given by 


Oe ie 
Op, = 407 + Ob. + (=) + bopvl + 3’, (26.69a) 
Ope = 407 + Oy (26.69b) 
2 12 2 ApK : 1 2 2 1 Zpe 
Ory — 70; ++ 9v0,y + ony + 72% 0,yL + ag V L zs (26.69c) 
Ony = 507 + Oy » (26.69d) 


where the particle beam sizes can be expressed by the beam emittance and betatron 
‘ 2: 2 * a ‘a a 
function as of = eB, 0, = €/f, and the diffraction limited beam parameters are 


oy = /A/L, and o, = VAL/(2z). 


26.3 Elliptical Polarization 


During the discussion of bending magnet radiation in Chap. 25 and insertion 
radiation in this chapter, we noticed the appearance of two orthogonal components 
of the radiation field which we identified with the o-mode and zr-mode polarization. 
The z-mode radiation is observable only at a finite angle with the plane defined 
by the particle trajectory and the acceleration force vector, which is in general 
the horizontal plane. As we will see, both polarization modes can, under certain 
circumstances, be out of phase giving rise to elliptical polarization. In this section, 
we will shortly discuss such conditions. 


26.3.1 Elliptical Polarization from Bending Magnet Radiation 


The direction of the electric component of the radiation field is parallel to the 
particle acceleration. Since radiation is the perturbation of electric field lines from 
the charge at the retarded time to the observer, we must take into account all 
apparent acceleration. To see this more clear, we assume an electron to travel counter 
clockwise on an orbit travelling from say a 12-0’clock position to 9-o’clock and then 
6-0’clock. Watching the particle in the plane of deflection, the midplane, we notice 
only a horizontal acceleration which is maximum at 9-o’clock. Radiation observed 
in the midplane is therefore linearly polarized in the plane of deflection. 

Now we observe the same electron at a small angle above the midplane. Apart 
from the horizontal motion, we notice now also an apparent vertical motion. Since 
the electron follows pieces of a circle this vertical motion is not uniform but exhibits 
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acceleration. Specifically, at 12-0’clock the particle seems to be accelerated only 
in the vertical direction (downward), horizontally it is in uniform motion; at 9- 
o’clock the acceleration is only horizontal (towards 3-o’clock) and the vertical 
motion is uniform; finally, at 6-0’ clock the electron is accelerated only in the vertical 
plane again (upward). Because light travels faster than the electron, we observe 
radiation first coming from the 12-o0’clock position, then from 9-o’clock and finally 
from 6-0’ clock. The polarization of this radiation pulse changes from downward to 
horizontal (left-right) to upward which is what we call elliptical polarization where 
the polarization vector rotates with time. Of course, in reality we do not observe 
radiation from half the orbit, but only from a very short arc segment of angle +1/y. 
However, if we consider Lorentz contraction the 9-o’clock trajectory in the particle 
system looks very close to a half circle radiation into +180 degrees which appears in 
the laboratory system within +1/y. Therefore the short piece of arc from which we 
observe the radiation has all the features just used to explain elliptical polarization 
in a bending magnet. 

If we observe the radiation at a small angle from below the midplane, the 
sequence of accelerations is opposite, upward-horizontal (left-right)-downward. The 
helicity of the polarization is therefore opposite for an observer below or above the 
midplane. This qualitative discussion of elliptical polarization must become obvious 
also in the formal derivation of the radiation field. Closer inspection of the radiation 
field (25.87) from a bending magnet 


3 e€@ ; 1 . YOK 3(€) 
Ew) = —2 ey + y?? a o ~i- Su, 
(@) es rm + y°v°) en(t) 2/3(E) Ug —i es agi 
(26.70) 


shows that both polarization terms are 90° out of phase. As a consequence, the 
combination of both terms does not just introduce a rotation of the polarization 
direction but generates a time dependent rotation of the polarization vector which 
we identify with circular or elliptical polarization. In this particular case, the 
polarization is elliptical since the mz-mode radiation is always weaker than the 
o-mode radiation. The field rotates in time just as expected from the qualitative 
discussion above. The linear dependence of the second term in (26.70) also defines 
the helicity proportional to the sign of #. 

We may quantify the polarization property considering that the electrical field is 
proportional to the acceleration vector B. Observing radiation at an angle with the 
horizontal plane, we note that the acceleration being normal to the trajectory and 
in the midplane can be decomposed into two components By and B: as shown in 
Fig. 26.10a. 

The longitudinal acceleration component together with a finite observation angle 
v gives rise to an apparent vertical acceleration with respect to the observation 
direction and the associated vertical electric field component is 


E,« By = nyB: + NyNyBx . 
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Fig. 26.10 Acceleration along an arc-segment of the particle trajectory in (a) a bending magnet, 
(b) polarization as a function of time, and (c) radiation field components as a function of time 


An additional component appears, if we observe the radiation also at an angle with 
respect to the (x, y)-plane which we, however, ignore here for this discussion. The 
components n,, 1, are components of the observation unit vector from the observer 
to the source with n, = —sinv. We observe radiation first from an angle 3 > 0. 
The horizontal and vertical radiation field components as a function of time are 
shown in Fig. 26.10b. Both being proportional to the acceleration (Fig. 26.10a), we 
observe a symmetric horizontal field E, and an antisymmetric vertical field £,. The 
polarization vector (Fig. 26.10c) therefore rotates with time in a counter clockwise 
direction giving rise to elliptical polarization with lefthanded helicity. Observing 
the radiation from below with 3% < 0, the antisymmetric field switches sign and 
the helicity becomes righthanded. The visual discussion of the origin of elliptical 
polarization of bending magnet radiation is in agreement with the mathematical 
result (26.70) displaying the sign dependence of the 7-mode component with ?. 

The intensities for both polarization modes are shown in Fig. 26.11 as a function 
of the vertical observation angle 3 for different photon energies. Both intensities 
are normalized to the forward intensity of the o-mode radiation. From Fig. 26.11 
it becomes obvious that circular polarization is approached for large observation 
angles. At high photon energies both radiation lobes are confined to very small 
angles but expand to larger angle distributions for photon energies much lower than 
the critical photon energy. 

The elliptical polarization is left or right handed depending on whether we 
observe the radiation from above or below the horizontal mid plane. Furthermore, 
the helicity depends on the direction of deflection in the bending magnet or the sign 
of the curvature sign(1/p). By changing the sign of the bending magnet field the 
helicity of the elliptical polarization can be reversed. This is of no importance for 
radiation from a bending magnet since we cannot change the field without loss of 
the particle beam but is of specific importance for elliptical polarization state of 
radiation from wiggler and undulator magnets. 
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Fig. 26.11 Relative intensities of o-mode and z-mode radiation as a function of vertical 
observation angle @ for different photon energies 


26.3.2 Elliptical Polarization from Periodic Insertion Devices 


We apply the visual picture for the formation of elliptically polarized radiation in 
a bending magnet to the periodic magnetic field of wiggler and undulator magnets. 
The acceleration vectors and associated field vectors are shown in Fig. 26.12a, b 
for one period and similar to the situation in bending magnets we do not expect 
any elliptical polarization in the mid plane where } = 0. Off the mid-plane, we 
observe now the radiation from a positive and a negative pole. From each pole we 
get elliptical polarization but the combination of lefthanded polarization from one 
pole with righthanded polarization from the next pole leads to a cancellation of 
elliptical polarization from periodic magnets (Fig. 26.12c). In bending magnets, this 
cancellation did not occur for lack of alternating deflection. Since there are generally 
an equal number of positive and negative poles in a wiggler or undulator magnet 
the elliptical polarization is completely suppressed. Ordinary wiggler and undulator 
magnets do not produce elliptically polarized radiation. 


Asymmetric Wiggler Magnet 


The elimination of elliptical polarization in periodic magnets results from a 
compensation of left and righthanded helicity and we may therefore look for an 
insertion device in which this symmetry is broken. Such an insertion device is 
the asymmetric wiggler magnet which is designed similar to a wavelength shifter 
with one strong central pole and two weaker poles on either side such that the 
total integrated field vanishes or { By ds = 0. A series of such magnets may be 
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Fig. 26.12 Acceleration vectors along one period of (a) a wiggler magnet, (b) associated 
polarization vectors, and (c) corresponding radiation fields 
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Fig. 26.13 Asymmetric wiggler magnet 


aligned to produce an insertion device with many poles to enhance the intensity. The 
compensation of both helicities does not work anymore since the radiation depends 
on the magnetic field and not on the total deflection angle. A permanent magnet 
rendition of an asymmetric wiggler magnet is shown schematically in Fig. 26.13 
The degree of polarization from an asymmetric wiggler depends on the desired 
photon energy. The critical photon energy is high for radiation from the high field 
pole (e«*) and lower for radiation from the low field pole (e>). For high photon 
energies (€p, * €{) the radiation from the low field poles is negligible and the 
radiation is essentially the same as from a series of bending magnets with its 
particular polarization characteristics. For lower photon energies (er Seger ) 
the radiation intensity from high and low field pole become similar and cancellation 
of the elliptical polarization occurs. At low photon energies (€pn = «,) the intensity 
from the low field poles exceeds that from the high field poles and we observe again 


elliptical polarization although with reversed helicity. 


Elliptically Polarizing Undulator 


The creation of elliptically and circularly polarized radiation is important for 
a large class of experiments using synchrotron radiation and special insertion 
devices have therefore been developed to meet such needs in an optimal way. 
Different approaches have been suggested and realized as sources for elliptically 
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Fig. 26.14 Permanent magnet arrangement to produce elliptically polarized undulator radia- 
tion [11] 
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Fig. 26.15 3-D view of an elliptically polarizing undulator, EPU [11] 


polarized radiation, among them for example, those described in [9, 10]. All 
methods are based on permanent magnet technology, sometimes combined with 
electromagnets, to produce vertical and horizontal fields shifted in phase such that 
elliptically polarized radiation can be produced. Utilizing four rows of permanent 
magnets which are movable with respect to each other and magnetized as shown in 
Fig. 26.14, elliptically polarized radiation can be obtained. 

Figure 26.15 shows the arrangement in a three dimensional rendition to visualize 
the relative movement of the magnet rows [9, 11]. 
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Fig. 26.16 Undulator for elliptically polarized radiation [10] 


The top as well as the bottom row of magnet poles are split into two rows, each of 
which can be shifted with respect to each other. This way, a continuous variation of 
elliptical polarization from left to linear to right handed helicity can be obtained. By 
shifting the top magnet arrays with respect to the bottom magnets the fundamental 
frequency of the undulator radiation can be varied as well. Figure 26.16 shows a 
photo of such a magnet [10]. 


Problems 


26.1 (S). Consider an undulator magnet with a period length of Ap = 5cm ina 
7GeV storage ring. The strength parameter be K = 1. What is the maximum 
oscillation amplitude of an electron passing through this undulator? What is the 
maximum longitudinal oscillation amplitude with respect to the reference system 
moving with velocity B ? 


26.2 (S). An undulator with 50 poles, a period length of Ap = 5cm and a strength 
parameter of K = 1 is to be installed into a 1 GeV storage ring. Calculate the 
focal length of the undulator magnet. Does the installation of this undulator require 
compensation of its focusing properties? How about a wiggler magnet with K = 5? 
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26.3 (S). Consider the expression (26.67) for the photon flux into the forward cone. 
We also know that the band width of undulator radiation scales like Aw/w « 1/Np. 
With this, the photon flux (26.67) becomes independent of the number of undulator 
periods!? Explain in words, why this expression for the photon flux is indeed a 
correct scaling law. 


26.4 (S). A hybrid undulator is to be installed into a 7 GeV storage ring to produce 
undulator radiation in a photon energy range of 4keV to 15 keV. The maximum 
undulator field shall not exceed a value of By < 2 T at a gap aperture of 10 mm. The 
available photon flux in the forward cone shall be at least 10 % of the maximum flux 
within the whole spectral range. Specify the undulator parameters and show that the 
required photon energy range can be covered by changing the magnet gap only. 


26.5 (S). Consider an electron colliding head-on with a laser beam. What is the 
wavelength of the laser as seen from the electron system. Derive from this the 
wavelength of the “undulator“ radiation in the laboratory system. 


26.6 (S). An electron of energy 2 GeV performs transverse oscillations in a wiggler 
magnet of strength K = 1.5 and period length A, = 7.5cm. Calculate the maximum 
transverse oscillation amplitude. What is the maximum transverse velocity in units 
of c during those oscillations. Define and calculate a transverse relativistic factor 
y.. Note, that for K = 1 the transverse relativistic effect becomes significant in the 
generation of harmonic radiation. 


26.7 (S). Calculate for a 3 GeV electron beam the fundamental photon energy for 
a 100 period-undulator with K = 1 and a period length of Ap = Scm. What is 
the maximum angular acceptance angle 3 (as determined by adjustable slits) of the 
beam line, if the radiation spectrum is to be restricted to a bandwidth of 10 %? 


26.8 (S). Strong mechanical forces exist between the magnetic poles of an undu- 
lator when energized. Are these forces attracting or repelling the poles? Why? 
Consider a £ =1 m long undulator with a pole width w = 0.1m, 15 periods each 
Ap = 7cm long and a maximum field of By) = 1.5T. Estimate the total force 
between the two magnet poles ? 


26.9 (S). In Chap. 23 we mentioned undulator radiation as a result of Compton 
scattering of the undulator field by electrons. Derive the fundamental undulator 
wavelength from the process of Compton scattering. 


26.10 (S). The undulator radiation intensity is a function of the strength parameter 
K. Find the strength parameter K for which the fundamental radiation intensity 
is a maximum. Determine the range of K-values for which the intensity of the 
fundamental radiation is within 10 % of the maximum. 


26.11 (S). Show from (26.54) that along the axis (v = 0) radiation is emitted only 
in odd harmonics. 


26.12 (S). Show from (26.51) that undulator radiation does not produce elliptically 
polarized radiation in the forward direction (0 = 0). 
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26.13 (S). Try to design a hybrid undulator for a 3 GeV storage ring to produce 
4keV to 15 keV photon radiation. Is it possible? Why not? Optimize the undulator 
parameters such that this photon energy range can be covered with the highest flux 
possible and utilizing lower order harmonics (order 7 or less). Plot the radiation 
spectrum that can be covered by changing the gap height of the undulator. 


26.14 (S). An undulator is constructed from hybrid permanent magnet material 
with a period length of A, = 5.0cm. What is the fundamental wavelength range 
in a 800 MeV storage ring and in a 7 GeV storage ring if the undulator gap is to be 
at least 10 mm? 


26.15 (S). Determine the tuning range for a hybrid magnet undulator in a 2.5 GeV 
storage ring with an adjustable gap g > 10mm. Plot the fundamental wavelength 
as a function of magnet gap for two different period lengths, A, = 15mm and 
Ap = 75mm. Why are the tuning ranges so different? 


26.16. Consider a 26-pole wiggler magnet with a field B, (T) = 1.5 sin (32) and 


?P 
a period length of A, = 15cm as the radiation source for a straight through photon 


beam line and two side stations at an angle } = 4mrand } = 8 mr ina storage ring 
with a beam energy of 2.0GeV. What is the critical photon energy of the photon 
beam in the straight ahead beam line and in the two side stations? 


26.17. Verify the relative intensities of o-mode and z-mode radiation in Fig.26.12 
for two quantitatively different pairs of observation angles 3 and photon energies 
&/Ec. 


26.18. Design an asymmetric wiggler magnet assuming hard edge fields and 
optimized for the production of elliptical polarized radiation at a photon energy of 
your choice. Calculate and plot the photon flux of polarized radiation in the vicinity 
of the optimum photon energy. 


26.19. Calculate the total undulator (N, = 50,A, = 4.5cm, K = 1.0) radiation 
power from a 200mA, 6 GeV electron beam. Pessimistically, assume all radiation 
to come from a point source and be contained within the central cone. This is a safe 
assumption for the design of the vacuum chamber or mask absorbers. Determine the 
power density at a distance of 15m from the source. Compare this power density 
with the maximum acceptable of 10 W/mm/”. How can you reduce the power density, 
on say a mask, to the acceptable value or below? 


26.20. Use the beam and undulator from problem 26.19 and estimate the total 
radiation power into the forward cone alone. What percentage of all radiation falls 
within the forward cone? [hint: make reasonable approximations to simplify the 
math but keep the result reasonably close to the correct answer]. 


26.21. Derive an expression for the average velocity component B = v/c of a 
particle traveling through an undulator magnet of strength K. 
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Chapter 27 
Free Electron Lasers 


Synchrotron radiation is emitted when electromagnetic fields exert a force on a 
charged particle. This opens the possibility to apply external fields with specific 
properties for the stimulation of electrons to emit more radiation. Of course, not 
just any external electromagnetic field would be useful. Fields at some arbitrary 
frequency would not work because particles interacting with such fields would in 
general be periodically accelerated and decelerated without any net energy transfer. 
The external field must have a frequency and phase such that a particle may 
continuously lose energy into synchrotron radiation. Generally, it is most convenient 
to recycle and use spontaneous radiation emitted previously by the same emission 
process. In this part, we will discuss in some detail the process of stimulation as it 
applies to a free electron laser. 

In a free electron laser (FEL) quasi-monochromatic, spontaneous radiation 
emitted from an undulator is recycled in an optical cavity to interact with the 
electron beam causing accelerations which are periodic with the frequency of the 
undulator radiation. In order to couple the particle motion to the strictly transverse 
electromagnetic radiation field, the path of the electrons is modulated by periodic 
deflections in a magnetic field to generate transverse velocity components. In a 
realistic setup, this magnetic field is provided in an undulator magnet serving both as 
the source of radiation and the means to couple to the electric field. The transverse 
motion of the particle results in a gain or loss of energy from/to the electromagnetic 
field depending on the location of the particle with respect to the phase of the 
external radiation field. The principle components of a FEL are shown in Fig. 27.1. 

An electron beam is guided by a bending magnet unto the axis of an undulator. 
Upon exiting the undulator, the beam is again deflected away from the axis by a 
second bending magnet, both deflections to protect the mirrors of the optical cavity. 
Radiation that is emitted by the electron beam while travelling through the undulator 
is reflected by a mirror, travels to the mirror on the opposite side of the undulator 
and is reflected there again. Just as this radiation pulse enters the undulator again, 
another electron bunch joins to establish the emission of stimulated radiation. The 
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Fig. 27.1 Free electron laser setup (schematic) 


electron beam pulse consists of a long train of many bunches, much longer than 
the length of the optical cavity such that many beam-radiation interactions can be 
established. 


27.1 Small Gain Regime 


We may follow this process in great detail observing an electron as it travels 
through say the positive half period of its oscillatory trajectory. During this phase, 
the electron experiences a negative acceleration from the undulator magnet field 
which is in phase with the oscillation amplitude. Acceleration causes a perturbation 
of the electric fields of the electron as was discussed in detail in Chap. 12. This 
perturbation travels away from the source at the speed of light, which is what we 
call electromagnetic radiation. For an electron, the electric radiation field points in 
the direction of the acceleration. As the electron travels through the positive half 
wave, it emits a radiation field made of half a wave. Simultaneously, this radiation 
field, being faster than the electron, travels ahead of the electron by precisely 
half a wavelength. This process tells us that the radiation wavelength is closely 
related to the electron motion and that it is quasi-monochromatic. Of course, for 
a strong undulator the sinusoidal motion becomes perturbed and higher harmonics 
appear, but the principle arguments made here are still true. Now, the electron 
starts performing the negative half of its oscillation and, experiencing a positive 
acceleration, emits the second halfwave of the radiation field matching perfectly the 
first halfwave. This happens in every period of the undulator and when the electron 
reaches the end of the last period a radiation wave composed of N, oscillations exists 
ahead of the electron. This process describes the spontaneous radiation emission 
from an electron in an undulator magnet. 

The radiation pulse just created is recycled in the optical cavity to reenter the 
undulator again at a later time. The length of the optical cavity must be adjusted very 
precisely to an integer multiple of both the radiation wavelength and the distance 
between electron bunches. Under these conditions, electron bunches and radiation 
pulses enter the undulator synchronously. A complication arises from the fact that 
the electrons are contained in a bunch which is much longer than the wavelength of 
the radiation. The electrons are distributed for all practical purposes uniformly over 
many wavelengths. For the moment, we ignore this complication and note that there 
is an electron available whenever needed. 
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We pick now an electron starting to travel through a positive half wave of its 
oscillation exactly at the same time and location as the radiation wave starts its pos- 
itive field halfperiod. The electron, experiences then a downward acceleration from 
the radiation field. During its motion the electron is continuously accelerated until 
it has completed its travel through the positive half oscillation. At the same time, 
the full positive have wave of the radiation field has moved over the electron. At 
this moment the electron and the radiation field are about to start their negative half 
periods. Continuing its motion now through the negative half period, the electron 
still keeps loosing energy because it now faces a negative radiation field. The fact 
that the radiation field “slides over the electron just one wavelength per undulator 
period ensures a continuous energy transfer from electron to the radiation field. 
The electron emits radiation which is now exactly in synchronism with the existing 
radiation field and the new radiation intensity is proportional to the acceleration or 
the external radiation field. Multiple recycling and interaction of radiation field with 
electron bunches results therefore in an exponential increase in radiation intensity. 

At this point, we must consider all electrons, not just the one for which the 
stimulation works as just described. This process does not work that perfect for all 
particles. An electron just half a wavelength behind the one discussed above would 
continuously gain energy from the radiation field and any other electron would loose 
or gain energy depending on its phase with respect to the radiation. It is not difficult 
to convince oneself that on average there may not be any net energy transfer one way 
or another and therefore no stimulation or acceleration. To get actual stimulation, 
some kind of asymmetry must be introduced. 

To see this, we recollect the electron motion in a storage ring in the presence of 
the rf-field in the accelerating cavity. In Sect. 9.2.1 we discussed the phase space 
motion of particles under the influence of a radiation field. The radiation field of 
a FEL acts exactly the same although at a much shorter wavelength. The electron 
beam extends over many buckets as shown in Fig.27.2 and it is obvious that in its 
interaction with the field half of the electrons gain and the other half loose energy 
from/to the radiation field. The effect of the asymmetry required to make the FEL 
work is demonstrated in Fig.27.3. Choosing an electron beam energy to be off- 
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Fig. 27.2 Interaction of an electron beam (on-resonance energy) with the radiation field of a FEL. 
The arrows in the first bucket indicate the direction of particle motion in its interaction with the 
electromagnetic field 
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Fig. 27.3 Interaction of an electron beam (off-resonance energy) with the radiation field of a FEL 


resonance by a small amount, the energy gain and losses for all electrons within a 
bucket becomes unbalanced and we can choose a case where all electrons on average 
loose energy into (FEL) or gain energy (particle acceleration by a radiation field) 
from the radiation field. The arrows in the first bucket of Fig.27.3 show clearly 
the imbalance of energy gain or loss. What it means to choose an electron beam 
energy off-resonance will be discussed in more detail in the next section, where we 
formulate quantitatively the processes discussed so far only qualitatively. 

We concentrate on the case where only a small fraction of the particle energy 
is extracted such that we can neglect effects on particle parameters. This regime is 
called the “small-gain“ regime. Specifically, we ignore changes in the particle 
energy and redistribution in space as a consequence of the periodic energy 
modulation. 


27.1.1 Energy Transfer 


Transfer of energy between a charged particle and an electromagnetic wave is 
effected by the electric field term of the Lorentz force equation and the amount 
of transferred energy is 


Aw =e f B.dr=e [ Evar, (27.1) 
L 

where EF, is the external field or the Laser field in the optical cavity and v the particle 
velocity. In free space v 1 E,, and therefore there is no energy transfer possible 


(AW = 0). Generating some transverse velocity v through periodic deflection in 
an undulator, we get from (26.3) 


y= +Beo sin (kpz) . (27.2) 


where ky = 27/Ap. The external radiation field can be expressed by 


E, = Eo, cos (wt —kzt Yo) (27.3) 
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and the energy transfer is 
AW = e | evar = ef wtvar 


K : 
= eB es Po i. cos (@Lt — kz + Go) sin (Kpz) dt (27.4) 


II 


K 
5eBc—Eo. if (sinWt —sinW) dr, 
y 


where 


W* = ot— (kL +h) z+ oo. (27.5) 


The energy transfer appears to be oscillatory, but continuous energy transfer can 
be obtained if either Yt = const. or Y= const. In this case 


dw= ; 
ae = WO (ki + kp) z=0 (27.6) 


and we must derive conditions for this to be true. The velocity z is from (26.3) 


i. ae K? 
— Bec + Poe COs (2kpz) 4 (27.7) 


where the average drift velocity B c is defined by 


dz; K? 
—=fc= 1———'}. 27.8 
F = Be= Be(1- 75) @7.8) 
We modify slightly the condition (27.6) and require that it be true only on average 
ie (k soe, (27.9) 
qe ae 
or 
K2 
(k. + kp) B (1 = a) —kL =0. (27.10) 
y? 


With B ~ 1 —1/2y? and ky < ky, (27.10) becomes 


k 1 : 1 S 1};+k,~0 (27.11) 
o 2y? 4y? ie, : 
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or fory > 1 
— — (1+ 5K?) +k, =0. (27.12) 


Equation (27.12) can be met only for the +sign or for 


ky 1 p2 
kb = ay2 (+ 2K’), (27.13) 
which is identical to the definition of the fundamental undulator radiation wave- 
length 


Ap 1 p2 
AL = 32 (1+ 5K’). (27.14) 


Radiation at the fundamental wavelength of undulator radiation guarantees 
a continuous energy transfer from the particles to the electromagnetic wave or 
stimulation of radiation emission by an external field. For this reason, it is most 
convenient to use spontaneous undulator radiation as the external field to start the 
build-up of the free electron laser. 


27.1.2 Equation of Motion 


The energy gain dW of the electromagnetic field is related to the energy change dy 
of the electron by 


d 1 dW 
a (27.15) 
dz mc? Bcdt 
or with (27.4) 
d KE 
oY = ———™ (sinwt — sinw-). (27.16) 
dz 2ymc? 
With the substitution sin.x = —Re (ie”) 
d KE : eo 
AY os Op (ie”* — ie” i (27.17) 
dz = 2ymc? 
In YW = ot — (k, £ ky) z(t) + Go, we replace the location function z(f) by its 
expression (26.5) 
- 2 _- 
t)h= t in (2k, Bct), 27.18 
ct) = Bet_+ Asin hyper ons) 
i -_— FO 


< Bet 
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composed of an average position z and an oscillatory term. With ky < ky 


dy — eBKEo. ’ _ ALK? =) | [iv _ iv 
re = wee Re fiexp i 8y2k, sin (2kpz) [e —e | (27.19) 
and the phase 
Ww = at— (kp + kp) 7+ Go. (27.20) 


With the definition exp (ixsing) = \"—*°° J, (x)ei”? we get finally 


n=—oo 


d KE n=+ kKLK?\ = og —_ 
al = eBK Fou. j » a de L e@l2nkpz (i** _ el” ) : (27.21) 
dz 2ymc? n=—00 8y7kp 


The infinite sum reflects the fact that the condition for continuous energy transfer 
can be met not only at one wavenumber but also at all harmonics of that frequency. 
Combining the exponential terms and sorting for equal wavenumbers hk,, where h 
is an integer, we redefine the summation index by setting 


k= 
nk, + ky = hk) —> n= —— (27.22a) 
h+1 
2nky —kyp = hky —> n= “i (27.22b) 


dy = eBK Eo. ac 


[Jags (%) = Jag (| Re ie eree le eral (27.23) 


dz 2ymc? 
h=1 —_ 
=— sin[ (kL +h kp) Z-oLt+9o] 
where x = 7 — . Using the JJ-function (26.60) the energy transfer is 
dy epK Eo. aa : 
—-=- JJ W, 27.24 
dz 2ymc? 2 Pa ( ) 


h=1 


For maximum continuous energy transfer sin Y = const. or 


dz 
Bee te (ki. i hkp) is Oo. (27.25) 
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1 K? 
= (kL + hk) ere c ae — ck 


dv ck, 172 
— = 14+ 5K hkyc = 0, 
ap aye 5 (1+ 3K’) +hige = 
where we assumed that kr, >> hkp, which is true since Ap >> Ay and the harmonic 
number of interest is generally unity or a single digit number. This condition 
confirms our earlier finding (27.14) and extends the synchronicity condition to 
multiples / of the fundamental radiation frequency 


AL= 


37h (1+ 4K’). (27.26) 


The integer therefore identifies the harmonic of the radiation frequency with 
respect to the fundamental radiation. 

In a real particle beam with a finite energy spread we may not assume that all 
particles exactly meet the synchronicity condition. It is therefore useful to evaluate 
the tolerance for meeting this condition. To do this, we define a resonance energy 


1+ 4k’), (27.27) 
which is the energy at which the synchronicity condition is met exactly. For any 
other particle energy y = y; + dy we get from (27.25) and (27.27) 


dw 5 
oe ht, (27.28) 
dz Vr 


With the variation of the energy deviation 4 8y - te | — we = Me | and (27.24) 
“Tr * “Tr 


we get from (27.28) after differentiating with respect to z 
ey d dy __ehk pKE ou 


— = 2hk JJ| sin W(z 27.29 
dz? Pad Vr yom ce @, : 


where, for simplicity, we use only one harmonic h. This equation can be written in 
the form 

ey 

res + 27 sinW =0 (27.30) 
exhibiting the dynamics of a harmonic oscillator. Equation (27.30) is known as the 
Pendulum equation [1] with the frequency 


hkyKE 
oe ig); (27.31) 


yeme* 
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While interacting with the external radiation field, the particles perform harmonic 
oscillations in a potential generated by this field. This situation is very similar to the 
synchrotron oscillation of particles in a storage ring interacting with the field of the 
rf-cavities as was discussed in Sect. 9.2.1. In phase space, the electron perform 
synchrotron oscillations at the frequency {2 while exchanging energy with the 
radiation field. 


27.1.3 FEL-Gain 


Having established the possibility of energy transfer from an electron to a radiation 
field, we may evaluate the magnitude of this energy transfer or the gain in field 
energy per interaction process or per pass. One pass is defined by the interaction of 
an electron bunch with the radiation field while passing through the entire length 
of the undulator. The gain in the laser field AW, = —mc?’n, Ay, where Ay is the 
energy loss per electron and pass to the radiation field and n, the number of electrons 
per bunch. The energy in the laser field 


Wi = feoEoLV. (27.32) 


where V is the volume of the radiation field. With this, we may define the average 
FEL-gain for the Ath harmonic by 


AW, mene (A a 2mc?y;Ne 
Gh = u == €0 : Me a es — a (Av), : (27.33) 
We TEOLV eghkp Eo, V . 


making use of (27.28). (AW”),,, is the average value of AW’=—W for all electrons 
per bunch, where W% is defined at the beginning of the undulator and ¥ at the end 
of the undulator. To further simplify this expression, we use (27.31), solve for the 
laser field 


24,292 
me; QF 


Py = ——t . 
Or ehKky [JI] 


(27.34) 


and define the electron density ny» = n-/V .Here we have tacitly assumed that the 
volume of the radiation field perfectly overlaps the volume of the electron beam. 
This is not automatically the case and must be achieved by carefully matching the 
electron beam to the diffraction dominated radiation field. If this cannot be done, 
the volume V is the overlap volume, or the larger of both. With this the FEL-gain 
becomes 


87 e? mphK7kp [JJ] 


G — 
me?y3 2} 


(Av), (27.35) 
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Numerical evaluation of (AW’),, can be performed with the pendulum equation. 
Multiplying the pendulum equation 2W and integrating we get 


Ww”? — 2Q? cos W = const. (27.36) 
Evaluating this at the beginning of the undulator 
w? — W? = 22) (cos W — cos), (27.37) 


which becomes with Ws = 2hkp a 


2 
wy? = (2h me ) + 227 (cos W — cos %) (27.38) 
Finally, 
yr—v Qe ¥ 
W' (z) = 2hk / 1+ —+ ‘ _ Icos W (z) — cos WJ, (27.39) 
an 2B (y — y,)? 
or with 
w= hkjLy —, (27.40) 
Vr 
where Ly = NpAp is the undulator length, 
: 2w BO 
W (Zz) = —41/ 14+ [cos W (z) — cos WH]. (27.41) 
Li 2w2 


We solve this by expansion and iteration. For a low gain FEL, the field Epox is 
weak and does not influence the particle motion. Therefore 2, « 1 and (27.41) 
becomes 


2 Io? 
We a [ + z ae (cos VW — cos W) 


8 bk . 
Says (cosW —cosW%)° + ...]. (27.42) 


In the lowest order of iteration W’ = Wi = = and AV = 0 for all particles, 


which means there is no energy transfer. For first order approximation, we integrate 
Wz) = ae to get Wy) (z) = as + W and 


2 
LQ 
2w 


AW), = (Ly) — Yj (0) = [cos (2w + YW) — cos H] + O(2) (27.43) 
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from (27.42). Averaging over all initial phases occupied by electrons 0 < W% < 27 


Loe 1 f* 


N\ 
aus Qw 2n 


[cos (2w + W) —cosW] d% = 0. (27.44) 


No energy transfer to the laser field occurs in this approximation either. We need 
a still higher order approximation. The higher order correction to Y% (s) = Y% (s) + 
5W (s) is from (27.42) 


LQ? 
5W) — ar [cos W —cos %] , (27.45) 


and the correction to Y% (s) is 


Loe 7 Z 
bWyy) = aa [eo (+: + v) — cos v| ds 
0 
PR? | 
= ae [sin (2w + W) — sin W% — 2wcos Wl. (27.46) 
w 


The second order approximation to the phase is then %j(z) = z+ W + 


iy 
6W%1) and using (27.42) in second order as well we get 
Q2 
AW = aA [cos (2w + % + SW) — cos | 
304 
— = SE [cos (2w + %) —cos %) +... (27.47) 


where in the second order term only the first order phase %(z) = ote +W is used. 
The first term becomes with 6W4) K WB + 2w 


cos (2w + WF + 6%) — cos % (27.48) 
= cos (2W + WY) — 6W sin (2w + YW) — cos % (27.49) 


and 


Li Qt { 8w? 
16w3 (12:22 
— 2sin (2w + YW) [sin (2w + W) — sin % — 2wcos | 


AW; = [cos (2w + W) — cos WY] 


— [cos (2w + YH) — cos]? +.. (27.50) 
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Now, we average over all initial phases assuming a uniform distribution of 
particles in z or in phase. The individual terms become then 


(cos (2w + WB) — cos %) = 
(sin? Qw + W)) = 
(sin (2w + W) sin% 


) cos (2w) (27.51a) 
(sin (2w + W) cos) = 
(cos (2w + W) cos W) = 


With this 


(Aws) = —22L [1 — cos (2w) — wsin 2w)] (27.52) 


: : 3 d /sinw\2 
and finally with [1 — cos (2w) — wsin (2w)] /w? = = (=2*) 


Ww 


B3Q4 d (sinw\? 
(Avs) = = =( ~ ) : (27.53) 


The FEL-gain is finally from (27.35) 


enph K? Lk, 2 
ea > WP — (=) (27.54) 
Ww 


iE: 


where we may express the particle density np by beam parameters as obtained from 
the electron beam source 


= es (27.55) 
m= — = ——, . 
VY 2n028 
where o is the radius of a round beam. With these definitions, and f= cen./£ the 
electron peak current, the gain per pass becomes 


P3arh7it KP d (a) 


(27.56) 
cota = & (1 + 4K2)°/? dw 


Gp = 
Ww 


The gain depends very much on the choice of the electron beam energy through 
the function (27.40), which is expressed by the gain curve as shown in Fig. 27.4. 

There is no gain if the beam energy is equal to the resonance energy (y = y;). 
As has been discussed in the introduction to this chapter, we must introduce an 
asymmetry to gain stimulation of radiation or gain and this asymmetry is generated 
by a shift in energy. For a monochromatic electron beam maximum gain can be 
reached for w ~ 1.2. A realistic beam, however, is not monochromatic and the 
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Fig. 27.4 Free electron laser 
: ne 
gain curve G ax (s2¥) 


lw 


narrow gain curve indicates that a beam with too large an energy spread may not 
produce any overall gain. There is no precise upper limit for the allowable energy 
spread but from Fig. 27.4 we see that gain is all but gone when |w| 2 5. We use 
this with (27.40) and (27.27) to formulate a condition for the maximum allowable 
energy spread 


2y7 AL 


; 2757 
1+ 4K? 


| om < 
x 

For efficient gain the geometric size of the electron beam and the radiation field 
must be matched. In (27.55) we have introduced a volume for the electron bunch. 
Actually, this volume is the overlap volume of radiation field and electron bunch. 
Ideally, one would try to get a perfect overlap by forming both beams to be equal. 
This is in fact possible and we will discuss the conditions for this to happen. First, 
we assume that the electron beam size varies symmetrically about the center of the 
undulator. The beam size develops like 


2 
eZ= On + (<) Lz (27.58) 
00 


with distance z from the beam waist. To maximize gain we look for the minimum 
average beam size within an undulator. This minimum demands a symmetric 
solution about the undulator center. Furthermore, we may select the optimum beam 
size at the center by looking for the minimum value of the maximum beam size 
within the undulator. From do? / dog = 0, the optimum solution is obtained for 
a= 5 Lu = 0) /€ = Bo. For Bo = 5 Lu the beam cross section grows from a value of 
oe in the middle of the undulator to a maximum value of 20, at either end. 

The radiation field is governed by diffraction. Starting at a beam waist, the growth 
of the radiation field cross section due to diffraction is quantified by the Rayleigh 
length 


=a, (27.59) 
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where wo is the beam size at the waist and A the wavelength. This length is defined 
as the distance from the radiation source (waist) to the point at which the cross 
section of the radiation beam has grown by a factor of two. For a Gaussian beam, 
we have for the beam size at a distance z from the waist 


w(2) = w+ 02, (27.60) 
where 0 = “ is the divergence angle of the radiation field. This is exactly the 
same condition as we have just discussed for the electron beam assuming the center 
of the undulator as the source of radiation. 


27.2 High Gain Free Electron Laser 


We have discussed the interaction of an electron beam with an external electromag- 
netic field and found that repeated recycling of the photon beam by reflecting mirrors 
this photon beam intensity can be made to grow until it is big enough to modulate 
the electron beam into microbunches at a distance equal to the radiation wavelength. 
This interaction works only at wavelength where good reflectors are available. This 
is, for example, not possible for UV and x-rays. The question arises what would 
happen if an electron beam would travel through a very long undulator instead 
of being reflected many times. This is the principle of self-amplified-spontaneous- 
emission or SASE. 

The goal is to look for electron dynamics which leads to micro bunching at 
the wavelength of interest. Any bunch radiates coherently at wavelengths equal or 
longer than the bunch length as was discussed in Sect. 24.7. This coherent radiation 
scales like the square of the number of particles per bunch ; rather than linear with 
Np as is the case of incoherent radiation emitted at wavelength shorter than the bunch 
length. Since the number of electrons per bunch can be very large we gain a large 
increase in the photon intensity. Actually this is the highest photon intensity one 
can extract from a bunch of electrons. Unfortunately, it is technically not possible 
to generate bunches at visible or shorter wavelength and preserve such bunches 
along a beam line. The way out is to possibly generate microbunches at the place 
of the radiation source. This was possible in the FEL and we will now discuss this 
possibility in the realm of SASE. 


27.2.1 Electron Dynamics ina SASE FEL 


In this section we aim at producing coherent radiation at any wavelength specifically 
at very short wavelength like x-rays without the support of reflecting mirrors. 
The electron beam appears in bunches which are long compared to the desired 
wavelength. Although the electron distribution is assumed to be uniform, there will 
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be statistical fluctuations due to the finite number of electrons per bunch. This beam 
is travelling through an undulator of as yet undetermined length. An x-ray single 
pass FEL consists basically of a high brightness electron source, a linac followed 
by a long undulator, both with parameters to produce the desired photon radiation 
wavelength as the fundamental wavelength of the undulator. 

In a perfect uniform beam every electron radiates in the undulator at an arbitrary 
phase resulting in in-coherent radiation. The radiation travels faster than the 
electrons and therefore the radiation field will interact with them. This interaction 
however is incoherent and will not lead to anything. Now we assume that along the 
bunch there is a density fluctuation or whisker which is very short of the order of 
the desired wavelength. That whisker radiates coherently at wavelengths equal to 
the temporal length of the whisker. The coherent radiation, although very small at 
first, interacts with the electron beam. However, only the fundamental wavelength 
as determined by electron beam energy and undulator properties will constructively 
grow from undulator period to period. As this fundamental radiation travels over the 
bunch ahead of the whisker it interacts coherently with the electrons and modulates 
their energy periodically at the fundamental wavelength. This energy modulation 
together with the deflection in the undulator leads to a density modulation at 
the desired wavelength. This process occurs because electrons which have been 
decelerated by the photon field get deflected more in the undulator field while 
electrons being accelerated by the photon field get deflected less. Both effects lead 
to a density modulation. 

Such whiskers can and do occur at any place along the bunch. Therefore a number 
of coherent fields will be created and grow along the undulator. Eventually though 
there will be a strongest radiation field and all others, being spread over statistical 
phases, will decoher and vanish in the one largest spike. This spike keeps growing 
along the undulator and reaches a point from which on the photon field is strong 
enough to microbunch the electron beam at which point the intensity does not grow 
anymore. The SASE-FEL has reached its saturation. At the same time the energy 
change introduced by the photon field is big enough to spoil the SASE principle 
leading also to saturation. The theory of SASE-FEL has been first developed by [2] 
as a 1-D theory. Later this was extended to a 3-D theory [3, 4]. We will however 
restrict ourselves in this text to the 1-D approximation which is well met for a high 
quality electron beam. Where ideal parameters are not available some degradation 
of the photon beam parameters must be accepted and the actual characteristics are 
mostly determined by numerical simulations. 

The increase of photon intensity along the undulator is exponential because the 
bunching depends on the photon intensity itself and is given by 


Ton & Ip exp (=) (27.61) 
G 
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where Lg the power gain length and Jp the initial spontaneous coherent intensity for 
an undulator of length Lg (26.66). The gain length is defined by 


Au 
Io = (27.62) 
4/329 
where p is the FEL-parameter 
Kies" 
a ( 2) (27.63) 
4./2 2ncy 
Here the JJ-function is defined by (26.60) with the argument x = cig 2 = 


E [Amc? rene, is the plasma frequency and n, the electron density. Tacitly we have 


assumed a planar undulator which could also generate third harmonic radiation 
albeit at a lower intensity (~ 1 %) while a helical undulator would only produce the 
fundamental harmonic. Numerical simulations indicate that for presently achievable 
electron beam parameters about 20 gain lengths are required to reach saturation 
while the FEL-parameter is of the order of 1073. 

The peak photon pulse power at saturation is expected [2] to be 


Pest, 2 


E 
——— = pett Ipeak—, (27.64) 
V 27 Trms aaa: Xe 


tg peak = 


which is about 14 GW for SLAC-LCLS parameters [5] (Pete = 2.9: 107+, Tpams = 77 
fs, Ipeak = 3,400 A, E = 14.35 GeV). Simulations give a somewhat lower power of 
about 8 GW by taking all inefficiencies like increase of beam emittance along linac 
and undulator into account. 

A high photon intensity (27.61) demand a short gain length for a given length of 
the facility while the gain length (27.62) itself is only related to the undulator period 
length and FEL-period. The period length is limited to a minimum of a few cm 
by technical considerations and the available linear accelerator energy and desired 
radiation wavelength. The FEL-parameter (27.63) is greatly determined by the 
electron density, e.g. by the electron beam emittance and bunch length. Therefore 
a small beam emittance and bunch length is of paramount importance. In addition 
the electron beam emittance must be close to the photon emittance for the desired 
wavelength to get maximum overlap of both beams. Theoretical considerations 
also require that the beam energy spread should be less than the FEL-parameter 
(| <p). 

To make SASE work well, a very high quality electron beam must be produced 
and preserved along the linac and undulator. In the following sections we will shortly 
discuss the requirements and the solutions employed in the first few x-ray facilities. 
The development in this newest accelerator system is still flowing and experimental 
experience from the first facilities contribute to a vigorous development especially 
toward more compact solutions. 
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Different from storage ring which can provide radiation to many users simul- 
taneously an x-ray laser can do so only for one user at one wavelength at a time. 
This is acceptable because of the extraordinary properties of the radiation in terms 
of photon intensity, brightness, coherence and femto-second pulse length. A high 
desired pulse repetition rate obviously pushes the facility designs more and more to 
superconducting technology. 


27.2.2. Electron Source 


The electron source determines the ultimate performance of the x-ray laser. For 
maximum radiation intensity the number of electrons per microbunch should be 
as large as possible. This strongly points to a laser gun where it is possible to 
generate a large electron charge within a pulse of less than 100fs. Not to loose 
spatial coherence the beam emittance must be very small of the order of less than 
10-!° m at the end of the linac. At source energies of y = 1 the lowest possible 
normalized emittance is 1 — 2 10~° m at high electron intensity of about 1 pC per 
bunch. Lower emittances are possible for lower charges. The design requirements 
are determined by many detailed simulations with specially developed numerical 
programs to find solutions close to desired performance. 


27.2.3 Beam Dynamics 


Along the linac and undulator the beam should be focused as much as possible to 
maximize the electron density. However, if the beam size is too small diffraction 
effects will appear. Therefore there is an optimum beam size which can be realized 
by quadrupole focusing in a FODO channel. Numerical simulations are required to 
determine the optimum beam size for the parametrization of the FODO channel. 
The focusing requirements must also include the effect of beam steering which is 
stronger in strong focusing FODO channels. 

To reach a realistic gain length a high peak current or short bunch length in the 
fs regime must be achieved. That is not possible with present day technology and 
bunch compression schemes must be included in the beam dynamics design. 

In the low energy section of the linac (up to 200-300 MeV) the electron bunch 
is accelerated “off-crest” to obtain a mostly linear correlation of energy with phase 
along the electron bunch in preparation for the bunch compression system. There 
is a small non-linearity left from the sinusoidal variation of the acceleration field. 
Simulations show that part of this non-linearity can be eliminated by deceleration 
in a higher harmonic accelerating section. If the main linac operates at 3 GHz then 
this linearizing section could operate in the X-band or about 12 GHz where suitable 
power sources exist [6]. The decelerating in the X-band section is small and has no 
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detrimental effect on the overall beam dynamics. After passing through the X-band 
section the beam travels through a four-bend bunch compressor. 

Following Liouville’s theorem the bunch compression is obtained in exchange 
with energy spread. In order not to increase the energy spread too much two bunch 
compression systems must be employed to reach the desired short bunch length. 
After the first bunch compression the beam is further accelerated (still “off-crest’’) to 
a higher energy and the energy spread is reduced due to adiabatic damping. At some 
intermediate energy a second four-bend bunch compressor is installed. The choice 
of the intermediate energy should be chosen such that the remaining acceleration is 
enough to reduce the beam energy spread by adiabatic damping to the final value of 
a ~ p for optimum SASE. No beam heater is necessary here because the bunch 
length is already much shorter and the non-linearity is very small. Both bunch 
compressors must be designed such as not to perturb beam parameters like beam 
emittance too much. The Rs¢-term is therefore chosen to be about the same in both. 
Final distribution of compression is determined though by numerical simulations. 

The resulting beam after the second bunch compressor includes now a very 
high peak current which can drive a micro-bunching instability [7, 8] thus possibly 
ruining emittance and energy spread. Therefore a “beam heater” is installed just 
before the second bunch compressor. This beam heater is a short and strong wiggler 
magnet which by emission of synchrotron radiation increases the incoherent energy 
spread. While this seems to be the wrong method only a very small insignificant 
increase of the energy spread is required to suppress the instability. After the second 
bunch compressor the beam is ready to be accelerated “on-crest” to the final energy. 

A significant problem arises if the bunch length is reduced too much such 
that coherent radiation (CSR) can be emitted with detrimental effect on the beam 
emittance and energy spread. Other problems arise from the interaction of the 
beam with surface resistance of the vacuum chamber (resistive wall effect). As a 
consequence vacuum chamber materials with low wall resistance should be used, 
e.g. aluminum rather than steel or copper-plated steel. In addition the surface must 
be polished to reduce the roughness which can cause beam instabilities. Satisfactory 
polishing to a roughness of well below 100 nm must be followed. 

The specific techniques described here are not the only way to solve problems. 
The interested reader is therefore encouraged to review the design reports of various 
X-FEL facilities. The performance is determined by simulation of the electron 
beam propagating through linac and undulator as well as the simulation of the 
photon built-up in the undulator. While it is possible to calculate order of magnitude 
parameters theoretically, small effects from actual particle distribution from source 
to end can significantly affect the outcome. Therefore the whole process must 
be simulated and any undesirable effect be studied and possibly eliminated or 
corrected. 
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27.2.4 Undulator 


The undulator parameters determine together with the electron beam energy the 
wavelength of the photon beam. This is the fundamental wavelength of the undulator 
and at reduced intensity one can contemplate the third-harmonic. To reduce the 
undulator length for a desired wavelength a short period length is desired. There are, 
however, technical limits for period length below 2—3cm. When the gap aperture 
becomes close to the period length the field drops off rapidly. The period length 
in the SLAC-LCLS is 3cm and the desired undulator strength K = 3.71 which 
requires an electron beam energy of 14.35 GeV to reach a fundamental wavelength 


of 1.5 A. This high undulator strength requires a very small aperture of 6 mm which 
is acceptable for a linac beam because no Gaussian tails must be preserved for 
lifetime. 

The built-up of photon intensity occurs exponentially from noise and therefore 
many gain length are needed to get the intensity into desired values. In other words, 
the undulator must be very long. In the SLAC-LCLS case the undulator length is 
120 m long of which theoretically 91 m are required to reach saturation. Such a long 
undulator cannot be built in one piece and is therefore broken down into shorter 
pieces of, in this case, 3.42 m. This breakup allows some space for beam monitoring 
and beam control. 


Problems 


27.1 (S). Consider an electron travelling through an undulator producing radiation. 
Show, that the radiation front moves ahead of the electron by one fundamental 
radiation wavelength per undulator period. 


27.2 (S).. Why does a helical undulator not produce higher harmonics? 


27.3 (S). From the peak power at saturation derive the number of x-ray photons 


(ex ~ 104 eV) per electron. For the SLAC-LCLS K = 3.711, Ap = 3cm and Ny = 


3,070. Compare this with incoherent radiation. For the band width use ce = =: 
Pp 
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Solutions 


Solutions for Chap. 1 


1.1 We start with the total energy and solve for the momentum cp = 
V E2 = (me?) = me? /y? 1 = yme?/1— 1/7? = B (Exin + mc?) . In terms 
of kinetic energy cp = 7 (Exin + mc?) _ (me?) = ,/Ej,, + 2mc?Exin and the 


kinetic energy is Exin = E — mc” = mc? (y — 1). For very large energies we get 
with y >> 1 for the momentum cp ~ E and for the kinetic energy Exin ~ E. For 
B < 1 the kinetic energy is with y—1 ~ 5 B? just Exin = tmv?. The non-relativistic 
momentum is from p = 1B (Exin + mc?) ~ Bmc = mv. 


1.2 From Maxwell’s equations we have VE = zp. We integrate this over a 
cylindrical volume including part or all of the charge and get with Gauss’s Integral 
Theorem: [ VEdV = ¢ Eda = E,2zrL, where L is the length of the cylindrical 
beam considered. The electrical field has for symmetry reasons only a radial 
component. The r.h.s. of Maxwell’s equation is then the integral over the volume 


contained within the surface used on the IL.h.s. Be fpdv = earl for (r < R) 
and RL for (r > R), and the radial electrical field component is E, = eat 


for (r < R) and cae for (r > R). Similarly we get for the magnetic field after 
integration {V x BAV = 2 fudV = 2x’ for (r < R) and 2AxR'L for 
(r > R). Since v = (0,0, v,), symmetry restricts the r.h.s. to only a z-component 
and the field to only a g-component. We get [ V x BdV = [ Beoay = By Loser: 
Solving for the field, we get By = 5 opr for (r < R) and 1 opp for (r > R). 


1.3 We integrate Maxwell’s equation VE = ee over a cylindrical volume con- 


centric to the beam. The I.h.s. becomes [ VEdV = ¢ Eda = E,2mrL, where 
da is an element of the co-centric cylindrical surface and L an arbitrary length 
along the beam axis. Since an infinitely long beam is assumed, only a radial 


© Springer International Publishing Switzerland 2015 949 
H. Wiedemann, Particle Accelerator Physics, Graduate Texts in Physics, 
DOI 10.1007/978-3-319-18317-6 


950 Solutions 


electric field ae cae exists. The rh.s. is integrated over the same cylinder 
nt a exp (- 5) rdr = 2x ZL? [! — exp (-s)] and the radial electric 


field component is finally E, = a — al = exp (—# f)|. In a similar way the 
magnetic field can be obtained. Only the azimuthal component is non-zero given 
by By = LopBL [! — exp (-=)]. The fields vanish for r + 0 and for r = o are 


E,(o) = £o [1 — exp (—4)] and By («) = popBo [1 — exp (—4)]. 


1.4 The circulating beam current is defined by i = enfrey = env/C where n is 
the number of particles circulating, frey is the revolution frequency, v the particle 
velocity and C the accelerator circumference. The number of particles representing a 
current of 1A are:n = iC/ (Ev) = 6.2458-10!*. The ejected beam resembles a pulse 
with a pulse current of 1A since particles are assumed to be distributed uniformly. 
The pulse length is given by the revolution time tT = S = 1.0007 ws assuming 
fB = 1. The synchrotron produces ten pulses of 1.0007 ws duration and at a pulse 
current of 1A. The average beam current is therefore igyg = 10-7- 1A= 10 WA. 


1.5 The bending radius p of a particle’s path due to a force F is given by the 
2 
equality of this force with the centrifugal force. 2 = F or p = B’ymc?/F. The 


gravitational force is F = f ue = 1.6397- 10-*°kg m s~?,where the proton mass 
m = 1.67262-10-?’ kg, the mass of earth M = 5.98-10**kg, and the earth’s radius 
= 6.380- 10° m. Numerically, the gravitational constant is f = 6.67259-107!! m3 


- 's~? and for a 1 eV proton (y = 1 + sa3h5s = 1 + 1.0661 - 10~°) the velocity 
aw 2Ekin ~— 4.6175 - 10~>. With these parameters, the bending radius is 


p = 1.955 -10’m and therefore negligible compared to any bending radius 
occurring in a realistic beam transport. Equivalent electromagnetic fields can be 
derived from the Lorentz equation. The electrical field equivalent to the gravitational 
force is Ey. = F/e = 1.023- 10-7 V/m and the corresponding magnetic field is 
B= F/ (ecB) = 7.393: 10~'* T. The ratio of electrical to magnetic field is 13, 837. 
For an intergalactic 10 TeV proton the gravitational force is the same and therefore 
the bending radius is increased by the increase in the factor B*y = 4.696 - 10° 
and the bending radius is p = 9.18 - 10!°km. The required electrical field to bend 
the same does not depend on the particle energy, while the required magnetic field 
scales inversely proportional to 6 and is therefore reduced to B = 3.414- 107!°T. 
The field ratio finally has changed from 13, 837 to 2.996 - 108 = c making the 
magnetic field the more efficient field to bend relativistic particles. This is indeed a 
small field, actually about a million times smaller than intergalactic magnetic fields 
of some 107!" T. Cosmic rays therefore are more affected by intergalactic magnetic 
fields than by gravitational field. 


1.6 The fields at the surface of the beam are E, = aa and By = 5 LopBR. 
The charge density for the cylindrical slug is p = e=a, = 1.275- 107 C/m?. 
The electrical field on the surface of the beam is then (6 1) EF, = #£R = 


2€0 


1.44-10!! V/m and the magnetic field is By = 5 opBR = 480T. The peak electrical 
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current is defined by i = Q/t, where Q = e- 10!° is the total charge, and t is 


the duration of the current pulse t = + = 3.3356-107!*s. The peak current is 


then J = 480.33 A. Two particle beams either attract or repel each other depending 
on whether we use an e*-e~ -system or an e-e~ or et-e*-system. For y >> 1 the 
forces due to electrical and magnetic field are the same and act along a line including 
the particle in one beam and the center of charge of the other beam. The longitudinal 


PBR? 
260 r 


where now r = 10um. From problem 1.5 we have for the curvature F’/ (ymv’) 
and the deflection angle is then 9 = 74 “ = 1.152-10~° rad. This deflection is 


ym 
significant and can be used as a diagnostic means to probe the closeness of both 


beams which eventually must be steered to collide head-on. 


. 


forces cancel as both beams pass each other and the radial forces are F. = e 


1.7 Plane waves can be expressed by a = do exp [i (w — kr)], where the wave propa- 
gation vector k = kn, k = w/c and n the unit vector in the direction of wave propa- 
gation. Applied to the fields (FE, B) and using Amperes law V x E = — B we get on 
the Lh.s. VxXE = V exp|i(@ — kr)|xEo =iV (kr) x E =ikV (nr) x E =ik (n x E) 


The r.h.s. is -8 =iwB. Equating both sides gives finally n x E = cB. 


1.8 We multiply two 4-vectors a and b and apply a Lorentz transformation: 


ai" = abet + athe + akbt — att 
= ayb, + anby + y? (a3 — Bas) (b3 — Bhs) — y? (Bas — aa) (Bb3 — ba) 


= ayb; + anby + y” (1 — B”)a3b3 — y* (1 — B’)agba 
— ——— —| 
=] =1 


= ayb; + azb2 + a3b3 — agby 


face ee 2g ~y! . os 
1.9 The 4-acceleration a= ys + 0% (va) or in component form a= 
(Gx, ay, Gz, ia,) we get @, = y?a, + 0B: (88) y= oa =a =v Ba, 
where a is the ordinary acceleration. The other components can be obtained in 
a similar way. Experimental verification through, for example, observation of 
synchrotron radiation parameters. 


1.10 We formulate in laboratory frame the 4-vectors before and after scattering. To 
describe electron and photons we use the 4-velocity m = (y, yu) with u = v/c 
and energy-momentum 4-vector ck = (w, ck) . In the lab frame i, = (y, yuLZ) and 
ck, = (@L,—@ 2) where Z is a unit vector in the direction of photon motion. We 
assume both the electron and photons to travel along the z-axis. After scattering, 
i, = (y, yur2) and ck, = (ay, w,2), where w, is the frequency of the outgoing 
photon. Here, we have assumed that the photon energy is much less than the electron 
energy. The product of both 4-vectors is Lorentz invariant and is therefore the 
same before and after scattering, or wy (1 + u_) = wy, (1 — uy) and solving for the 
scattered frequency w, = a 9 = 4@_y? where we made use of y~? = 1 — uj, 
and uy, ~ 1. 
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1.11 Here, the only difference to problem 1.10 is that the undulator field does 
not move, or that ck = (@,,0), where w@, = 27c/A, and um, = (y, yuyz). After 
scattering ck, = (wy, @,Z) and uw, = (y, yuiZ) as in problem 1.10. Equating again 
the Lorentz invariant products gives @, = @, (1 — uy) or @y = Oui ro) 2oy?. 
This is the fundamental radiation frequency emitted by electrons from an undulator. 


1.12 The total proton energy at the end of the 200 MeV linac is E = Exin + mc? = 


1, 138.27 MeV and the momentum is cp = \ E? — (mc?)” = 644.44 MeV. Finally 
the velocity is 8 = $ = 0.56616 or about half the velocity of light. 


1.13 Due to Lorentz contraction the length of the linac is reduced by the factor y 

which along the anes is y(z) = yo + page = Yo + 39.139- z. The initial 

energy is — = 1.1547 and the final ener. L) = 27,418. Now, the 
Se ae cere gy y (L) 


integrated length of the linac is £ = i. ree ae a In (Yo + 39.139 2) = 
0.29458 m. In the electron system the 3 km SLAC linac has contracted to some 
30cm. Now coasting, a tube of 3km would contract even more to a = 


0.10942 mx 11cm. 


1.14 The invariant center of mass energy of colliding particles is defined by E2,, = 
~e->d (cpi)” . For the collision of a positron with a target electron assumed to 
be at rest this evaluates to E2,, = (ymc? + me?)> — (yBmc?)” = =2(y +1) m’c4 or 
Eom = J2(y + 1)mc?. This is also the available energy to produce new particles 
since no particles must be conserved in a positron-electron collision. In an electron- 
electron collision the available energy would be only E,yai) = Ecm — 2mc* because 
the lepton number must be conserved in the collision. The same calculation for 
head-on collision of such particles would produce a center of mass energy of Eym = 
2ymc?. Obviously, head-on collisions provide more available energy to produce new 
particles. 


1.15 From the invariant E2,, = )°E? — >° (cpi)” we get for a particle colliding 


with a target particle at rest E2,, = (ye? + myc?)” - (yBmc?)” = (yme?)” + 
2ymc?mec + (mec?) a (yBmc?)’, where m, is the mass of the target particle. The 
available energy to produce a y/J particle must be 3.1 GeV. In case of a proton 
colliding with a target proton, the center of mass energy must be at least Egy = 
3.1 GeV+mc*, because the proton number must be preserved. With m = m, we 
get Euan = ce (y + 1)mc* = 3.1 + 2mc? and solving for the particle energy we 
getty = 1 (Su et 2) — 1 = 13.071. The proton energy must therefore be at least 
Ey = 12. 26d GeV which was available at the Brookhaven AGS. In case of a positron 
colliding with a target electron the center of mass energy is also the available energy 
since the lepton number in this case is zero. Therefore, ,/2 (y + 1)mc? = 3.1 GeV 


and the minimum positron energy y = 4 (oy — 1 = 1.84- 107. This energy 
is not available at any existing particle accelerator. Only in an electron-positron 
colliding beam storage ring like SPEAR is it possible in head-on collisions to reach 
sufficient center of mass energy to produce a w/J particle. 
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1.16 The pion lifetime will be lengthened by the factor y. Therefore, the pion 
lifetime at 20 MeV is t7.29 = 29.759ns and at 100MeV t,z.199 = 44.679n. The 
velocities of the pions are B29 = 0.48473 and Bio9 = 0.81277 for 20MeV and 
100 MeV respectively. The distances traveled are C29 = t29ch29 = 4.3245m and 
£io9 = 10.887m. This is a significant difference when trying to transport a pion 
beam from the target to the patient. 


1.17 We look for the minimum kinetic energy necessary to perform the reaction, 
which means, the resulting particles have after collision no kinetic energy left. 
The length of the 4-vector (cp,iE) = p, after the collision is therefore —16M? 
which must be equal to the length of the 4-vector before collision or —16M? = 
(Pi + Pan) = Phy t2PinP2n+P3, = —2M?+2p1 pry and pi yPry = —7M?. 
For the two protons before collision p1,,P2,. = (cp1,i£) (0,iM) = EM. With this, 
we get finally E = 7M or the minimum total proton energy required to produce 
antiprotons is 7M or subtracting the mass of the incoming proton the minimum 
kinetic energy must be Exin = 5.6GeV. The 6.2GeV Berkeley Bevatron was 
designed to do just this allowing the discovery of the antiproton. 


Solutions for Chap. 2 


2.1 A horizontal deflection can be accomplished by a horizontal electric field 
generated between two vertical plane electrodes (x =const) connected to a potential 
V. Since the aperture is 2cm we place the electrodes at a position of x = +1 cm. 
The deflection angle is given by gy = e|E|¢/(B*E), where £ = 0.1m and 
E = 10.511 MeV and y = 20.57. With this, we get B = 0.99882 and B7E = 
10.486 MeV. The required electrical field is JE| = 1.0486MV/m = 10.5kV/cm. 
A potential difference of 21 kV between the electrodes would be sufficient. This is 
across a gap of 2cm which is possible in dry air (rule of thumb: breakdown field is 
about 30k V/cm in dry air) or in vacuum. 


2.2 Assuming no friction, the beam power is the product of the beam energy in Volt 
and the beam current. The minimum power for the charging belt motor is therefore 
500 kW. 


2.3 The particles travel from the source to the first gap where they gain an energy of | 
MeV. At the end of the nth gap they have an energy of Exinn = 100keV+n- 1 MeV 
(n = 1,2,3...). After each gap the particles travels protected from microwave 
fields in tubes for most of an rf-period. At 7 MHz the period is t,~, = 0.14286 ws. 


The velocities of the particles after each gap is B, = /1—y,? = SS — 


Exin.n-+Amce? 
which translates for potassium ions to Bp; = 0.002265, By2 = 0.007511, By3 = 
0.01038, By) = 0.01261. For electrons, the equivalent velocities are Bey = 


0.9484, Ber = 0.9807, 6.3 = 0.9899, B.4 = 0.9938. The related lengths of the 
drift tubes are now J, = cB, tr. For potassium ions this gives Lp) = 0.097 m, 
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Ly = 0.322cm, Ly3 = 0.445m, Lyg = 0.540m. These are realistic drift 
tube lengths. For electrons, the drift tube lengths would have to be much longer 
Lp = 20.315m,Ly. = 21.007m, Ly3 = 21.204m, Ly = 21.287m due to the 
higher velocities It is obvious that much higher frequency microwave sources had 
to be developed before this principle could be applied to electrons. This became 
possible with the invention of the klystron. 


Solutions for Chap. 3 


3.1 The total electron energy at injection is Ej. = (50+ 511) 10° = cp = 
yme?* = ymc’B. From (1.76) and an orbit radius of 1.23 m, the magnetic field 


on the orbit is o = Hot € = 1.520 x 10-3 [T]. The velocity can be derived from 


Y = Ey/mc? = 1.0978 and B, = 0.41260. Therefore the magnetic field at injection 
is B; = 6.273 x 10-* [T] . The field increases at a 60 Hz cycle and the particle needs 
atime of t = 27R/ch; = 6.2436 x 10-*s for the first turn. During that time, 
the magnetic field increases by AB + Byowt = 1.907 x 10~° [T]. At 20 MeV, the 
particles have gained virtually velocity of light with 6 ~ 1. The field change per 
turn at 20 MeV is now less by the factor ; for AB © 7.866 6 x 10~° [T] . The reason 
for this drop of acceleration is due to the higher velocity of the particle making the 
go-around time shorter. The acceleration is indeed slow and it takes the particles 
more than a million turns to reach the final energy. 


3.2 From (1.4) we choose Ampére’s law while ignoring the electric field and apply 
it to the betatron magnet. We integrate over an area which includes one coil. The left 
hand side of this integral can be expressed with Stoke’s Theorem as a line integral 
of the magnetic field along the boundary of the area chosen. The right hand side 
is just the total current in the coil multiplied by j1o. To be able to evaluate the line 
integral, we choose a boundary or integration path starting in the middle of the 
magnet aperture (orbit) and integrate the field (uw, = 1) along a vertical line to the 
magnet pole. This field is just the desired field Byo The integration path through the 
magnet iron to the mid plane is zero because we assume (4; = 00). Integration 
from the magnet iron along the midplane to the origin of the integration path is also 
zero because all fields in the midplane of a symmetric magnet are in the vertical 
direction while the integration path is in the horizontal direction. That gives the end 
result of Byos & = Lolo. Note, for the integration path chosen, we have to use here 
only half the gap. The field for 42 MeV electrons is By) = 523 = 0.35T and the 
total current in one coil is [.9;, = 13,926 A. This seems to be a large current, but 
actually isn’t. Magnet coils are wound of many turns and the current from the power 
supply is only Jy; = Jcoit/ Mums. if the coil in this example has say 20 turns then the 
power supply current is only [,, = 696.33 A which is perfectly acceptable. 


3.3 The microwave frequency scales inversely to the relativistic factor y or 
frtmax/f0 = Yo/y. The relativistic factor is y = 1 + 49 and the microwave 


ki 
mc~ 
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frequency is then given by a = fo. For protons mc” = 938.27 MeV and the end 


frequency fi¢ = 0.610f¢9. Deuterons are twice as heavy mpc? = 1,875.61 MeV 
and fe = 0.758 fizo. The situation for electrons is quite different. The rest mass is 
mc? = 0.511 MeV and fs = 1, 175/;¢0. The frequency swing is way too large to 
be practical and this is the reason why synchro-cyclotrons work only for heavier 
particles and at energies where the relativistic factor is not too far from unity. 


3.4 The energy loss to synchrotron radiation power per electron is given by (24.41). 
This energy loss should be multiplied with the circulating current to get the total 
radiation power. P, = 8.85 x 107° Ee f = 55.976 I. For 1W of synchrotron radiation 
power the current should be 17.865 mA. 


3.5 a.) The bending radius at 25 MeV is = = 0.32 = 2.568m !(assuming B ~ 1 

or 2R = 0.7788 m. b.) We choose a gap of 10cm. c.) The excitation current for the 
field is derived similar to problem 27.2.4 and is [.oi) = a8 = 4,259.6 A. The 
length of the coil is Lio = 27R + 10% = 2.69m and the resistance is Re = 
Pcu Sat = 1.205 10°77 2 (Pcu = 1.68 - 10-8 2m at 20 °C) The electrical power 
loss in each coil is then Pooi) = Re: Po= = 2186 W and, yes, you need water cooling. 
d.) The electrical power requirement does not depend on the number of turns, only 
on the amount of copper in the coil. If the coil is made of 7 turns, then the current 
will be reduced by the factor n. Yet, the length of the coil is increased by n and the 
cross section decreases by the factor n.To summarize, the coil resistance is increased 
by n? while the current scales like 1/n. Therefore the power is independent of n. 


3.6 The frequency swing depend on the velocity of the protons. The relativistic 


ee : 5 (me2)? +(cpo)? . 
factor at injection is yo = V — = 10.706 and at the maximum energy y = 


426.35. The corresponding velocities are at injection By = 0.995 and at maximum 
energy 6 ~ 1.000. The frequency changes only by 0.44 %. 


Solutions for Chap. 4 


4.1 From the variational principle (4.1) and the definition of the Hamiltonian (4.26) 
we have 6 is Ly) ¢ wat = —68 ri [>° giP; — HH, ()|dt = 0 and after changing to y 


this is 5 f" [S452 P; — Hy (p) ¥ |dt = 0. Therefore H, = Hy. 

4.3 We apply oo _ a = 0 to the Lagrangian (4.25) in curvilinear 

coordinates. With 62 = (+)? +2472) /c? we get % = mp + 
e (x4 2 4 pA 4 nzihs + &iA,) -e —eSandi#% = 4 (Fe 7) 


+e (x4 he +5 +298) = ymhe,2 ae Bas 4 5 Sty 4 2 Me + kxiAc) 
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—e%t . In curvilinear coordinates 


B=VxA 
tie dAy] 1 dA, a dA, OAy 
~ bi 5p eo =|: Al dz az nay]. E mal 


and we can replace os = kaa) — B,and = = 2 5 (HA: )+ hBy = = hits + K,A, + hB, 
and thereby replace the tents involving the: Sesto potential by maeneiic fields to 
give the equation of motion 4 (ymx) = ymhk,2 + e (iB, a zBy) + eE, which is 


the same as (4.16). 


Solutions for Chap. 5 


5.1 We use the Lagrangian (4.25) and evaluate (4.5) to get first a — 


dA, + JAy + JAy 
ax Ts +e dz ). Then 


ymx + eA, and second a = < (ym) + e (i 


a = ym2hk, + € (<4 x +499 a mn po oS ees 42) + eF, and the Lagrange equation is 


Syme) ag e (x4 dAx + jo +e) = pitino (22 dAy +5 43 ze) + ek, or 


after reordering and replacing - — = = B, and i — Ae = hBy the equation 


of motion is finally $ (ym) = “ym + e (By 2 hByz) + eE;. ee: into 


the Lagrange equations we get first 2 oe = aie + e (x4 ihe ye + zh =) + 


eE, = ymzh (kx + Ky) + e (+44 os +75 ie ar + hi :) + eE, and second a = 


£ (ymh?z )+eny With oad = hBy+ iGAy and 2 = —hB,+ > OAS) the equation 
saan is 4 (ymh?2) = me "hk, + ymizh (Kix i Ky) te (hByx <— hB,y) + eE,. 


5.2 Application of (5.33)-(5.30) gives with Ax = (x’A,+yAy+hA,) the 
: _ food = y) v = : L+ymc? sf 
Lagrangian L = V1—p? + e[Ax — ep and from this a 


s + “Ax — ed. san. by the momentum p = ymv and setting p = 


yinv 


L+ymc* sf == s! + (1 — 8) Ax _ d-6 


ymv ov 


PO 
1-3? 


a eS gy, The variational principle 


Ay Etyme st = As Ltyme" dz = 0, because constant additions or factors to 
the Lagrangian do not change the variational principle. 


5.3 Use the definition of the bending radius (5.3) and remember that we express the 
particle energy only in Volts, winch is E/e. For 1 GeV particles we have then E/e 
=10° V. With this the curvature is riled ~ 0. 29985 “ . In the last step of the equation 
we get a bit sloppy by using the energy in GeV instead of the voltage E/e as is 
commonly done. Numerically, however, we would ignore e insert only the voltage 
in GV. 
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5.4 We evaluate the second derivative of the solution P" (z) and insert into (5.74). 
From the first derivative P’ (z) we obtain with the property of the Wronskian (5.73) 
for the principal solutions the second derivative P" (z) = S" (z) ie D (ZC (@dz — 
C" (z) yp @S @dz. Recalling that S" = —KS and C" = —KC, we insert 
the expression for P"(z) and (5.75) into (5.74) and verify the validity of the 
ansatz (5.75). The function P(z) is therefore indeed a particular solution of the 
inhomogeneous differential equation (5.74). 


5.5 The Cartesian coordinates (x,x’) are identical to the normalized coordinates 
(w, w) except for some scaling factors if we consider the independent variable to be 
z in both cases. The transformations are then given from (5.54) by x” = —vuxtanw 


and J = ray orx = va cos ¥ and x’ = 2v sin y. With this, the Hamiltonian 


is K = vJ sin? wt Dt cos’ w. The ieaeney of the oscillator is W or from the 
aK 


equation of motion 5+ = y= =vsinr'y +2 2 cos? w which requires that D = v2 


Finally, K = vJ and K= w =v as we would expect. 


Solutions for Chap. 6 


6.1 The coil power, assuming only one turn, is P = RI’. The resistance is R = ps, 
where p is the specific resistance of the coil material (copper), L the length of the 


coil and A the coil cross section. With 7; turns, the resistance changes to R = p re = 


2 
pene, the current is x and the total power P = pen, (4) is independent of the 


number of turns. The total coil power is P = RI’. Utilizing the current density j 
the total power is now P = pLAj’. The weight of copper is W = oLA and the total 
power is finally P = p= j° depending only on specific resistance, weight and current 
density. 


6.2 The focal length of an electric quadrupole is : =k = BE gl. Solving for 


the gradient g we get g = : ee 3 = 19.81MV/m?.The profile of the electrodes is 
given by V) = —$g(x* —y*) and the profile of the electrodes is therefore given 
by V2 = 9.905 - Re = 24,763 V where R, is the aperture radius of the electric 
quadrupole. The r.h.s. derives from the potential for R, = 0.05m and y = 0. The 


electrode potentials are V) = +24, 763 V. 


6.3 For a purely aera field B = (By, By.0) = ‘4 x A. The field components are 


aA JA, 
B= == — 3 = 3 and By ans tas = — Be Here we made use of the fact 


y : 
that the fields do wot ‘depend on nthe longitudinal coordinate z. Therefore Ax, Ay are 
constant which we may as well choose to be zero A, = A, = O and the magnetic 


field is B = (, —& 0). If we derive the aes field from a scalar potential 
B = —VV, we would get = = -5 Vand 4: = - orA, = —f V dy and A, = 
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I5 dx. For a dipole _ B, = O and By = Bo and the potential is V| = —Boy + 
f(x) and (& yo = & = = -gx) . For a quadrupole we have a potential V = —gxy 
and aa = gy, = —gx. After integration A, = gy? and A, = —4gx" to give 
a aaa oa el A, = —38 (x? —y’). Following from their definition both, V 
and A,, depend differently on the coordinates but are equivalent otherwise for purely 
transverse fields. While the scalar potentials V define equipotential lines, the vector 
potential A, defines the field lines which are orthogonal to the equipotential lines. 

6.4 Such a magnet is basically a quadrupole which is displaced to generate a dipole 
field component along the path of the beam. From + = kAx the displacement 
is Ax = 0.00741 m or 7.41 mm. The sextupole term is a small perturbation of 
the quadrupole profile. From (6.86) and the definitions for ‘aon get for this 


combined field magnet the potential V = BE E ar yt kxy+ im (3x? y-y °)] = 


0.557y + 75.15xy + 640 (3xy —y’) = const. The constant can be estimated as 
follows: At a displacement Ax = 7.41 mm we require an aperture of r = 1cm 
and so we define one point (x = 7.41 mm,y = 10mm) of the profile. Inserted into 
the profile equation, we get V (0.00741, 0.01) = 0.01155285 and the pole profile 
equation is 0.557y + 75.15xy + 640 (3x?y — y3) = 0.01155285. 


6.5 Forces between magnet pole exist because a change of the field volume is 
associated with a change of field energy. Increasing the gap of a dipole magnet 
while keeping the field constant results in an increase of field energy due to work 
against the force trying to reduce the gap or the distance between poles. Therefore 
the force is attracting the pole. The field energy in the dipole magnet with a pole gap 
Gis Ey = 398 WIG = 89570-G. The force is F = —dE/dG = 89570 N= 9.13 
tons. The force does not depend on the pole gap. 


26.8 In first approximation, we assume that all the fields are contained within the 
volume between the two rows of poles and no field leaks out. Separating the poles 
by g requires to generate the additional field energy de = Fdg where F is the force 
between poles. Since de > 0 for dg > 0 the force is attractive meaning that the 


poles are attracted. The force is then F = *¢ = aa i i Bj sin* (Ong -)dz = 


dg 
A 
Tn BSE Np = 7,484 N = 0.76 tons. 


6.6 See Sect. 6.2.2 for solution. With respect to the numerical part of the question, 
we calculate the gradient to be g = i BE = 33.33 T/m and the coil excitation 


2 
current must be J.oi1 = -_ = 11937 A-turns. 


6.7 Since this is a cylindrical problem, we use the definition of the magnetic 
potential in (6.7) P(r,z) = >> C,r"e’”? from which we get the magnetic fields. 


n>=0 
From Maxwell’s equation V x EF = ub and integrating over the cross section of 
the rotating coil we get an emf of mV = —d¢/dt, where ¢ = $0 cos 2m Vt is the 


flux through the coil at time ¢ and m the number of turns in the coil. At time t = 0 
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we assume the coil to be parallel with the midplane enclosing the maximum flux 
oo = By2R. We ignore the length of the coil because the field is assumed to be 
uniform longitudinally. From the potential, we get By « el”, p = 2nvt and the 
induced voltage in the coil is mP = 22v > C,r"ei”?""". The signal voltage from 
n>0 
the rotating coil includes therefore all harmonics of the magnetic field. Usually the 
harmonics are normalized to the ideal field of the magnet at r = | cm. The signal of 
the ideal magnet of order N is mVy « 200Ni eiN2r»! and the relative strengths r,y of 


the harmonics are r,y = R"-“ ve 


6.8 A finite width of a quadrupole is like superimposing poles of opposite polarity on 
both sides of a pole. This is a symmetric perturbation with two negative poles within 
a 90° quadrant. We may complete this picture by assuming that there is also an 
additional pole superimposed on the main pole with the same polarity. We have now 
in each quadrant a main pole with a somewhat lower quadrupole field and three poles 
describing the perturbation by a multipole. In the case of a quadrupole this would 
be a 12-pole. Since the perturbation is nonlinear but symmetric about the main pole, 
we observe all odd harmonics of the quadrupole field, 3 x 4 =12-pole, 5 x 4 =20- 
pole, etc. These perturbations are due to the finite width of the quadrupole and 
have nothing to do with tolerances. We call therefore these harmonics, “allowed” 
harmonics or multipole components. 


6.9 The fields scale differently depending on the multipole field order. The 
quadrupole scales linearly and therefore By (1cm) = Bos = 0.20832T. The 
next higher order field is the sextupole field which scales quadratically and the 
normalized field is therefore B3(1cm) = Bo = 1.8727 - 10-4. Similar 
renormalizations lead to By(1cm) = 1.925- 107°, Bs;(1cm) = 1.6833 - 107, 
Be (1cm) = 1.8964- 107°, B7 (Lom) = 2.9186: 107’, Bg (lcm) = 1.875- 107’, 
By(1cm) = 2.322- 1078, By (1cm) = 1.8066 - 1077. The 12-pole and 20-pole 
components do not follow the general downward trend and are larger, because they 
are “allowed” harmonics due to the finite pole width. 


6.10 The upright octupole potential is from Table 6.6 given by =aa = 
rt (x8y—xy°) = const. From the geometry of an octupole the pole tip is at a 
radius R and angle 22.5° or at x = Rcos 2/8, y = Rsinz/8. With these values the 
constant or potential of the pole is 7 Eo y- xy’) = 1.50R*. It is actually easier to do 
the calculation for a rotated octupole, which has a pole tip in the midplane at x = R 
and y = 0. From Table 6.6 the potential is —V4 (x, y) = S454 (x* — 6xy? + y’) = 
const and the equation for the pole profile is x (x* — 6xy*? + y’) = R. The field in 
the midplane is B, (x,0) = sy 2x3, the field at the pole tip is expected to be 0.2 T and 
the octupole strength parameter is s, = sy = one = 44, 444T/m?. To calculate 
the coil excitation current, we integrate from the octupole center along the x-axis 
to the pole giving fe exidx = aR. The integration through the iron is zero as is 
the integration back along the 45° line from the return yoke to the magnet center, 
because field and integration path are orthogonal. Similar to the calculation of the 


excitation current for the quadrupole, we have [coi1 = i . R* = 1,193.7 A-turns. 


24, 
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6.11 For a pure dipole field the current distribution scales like cosy, we assume 
the coil thickness to be h (6) = hocos@ and the cutrent density at gp = 0 is from 
di (g) = I,cosg dg and AA = h(g) RAg just j = me . To generate a field of 


Bo = LoHo = 5T we need a peak current of ) = 2RH, = 2R= = 2.387 x 10°A 


A 5 . . . . . 
and 7 = 10? = ee with hg = 7.96mm. The maximum coil thickness is 


therefore about 8 mm. 


Solutions for Chap. 7 


7.1 Study Fig.7.16 in detail. Now, make a similar sketch with the cosine-like 
trajectory for the horizontal and vertical plane both ending at the focal point. Tracing 
back both cosine-like trajectories from the focal point straight to the intersection 
with the incoming cosine-like trajectories. The distance from the focal point to 
the intersections should be equal to the doublet focal lengths from (7.20). The two 
intersections are also known from light optics as the principal planes of the system. 


7.2 Transform through doublet and 5 m drift space to focal point. We need only the 
M,, element of the total transformation matrix, which must be zero to let a parallel 
trajectory entering the doublet go through the focal point such that x = Mj,\xo = 0. 
We have Mi, = 1 — TAL — Tal + tA = 0. Since we want is ne point in both 
planes, we have a second equation My, = 1 + oe + ipl + Wa = = 0. From both 


equations we isolate f; and fy for f, = /6m and fp = 5/V/6m. The total focal 


lengths in both planes are different f* = 58 = 8.4495 m and he — a = 
3.5505 m. The definition of these focal lengths is the distance of two principal planes 
from the focal point. The principal planes, one for each plane, are located at the 
intersection of the parallel incoming trajectory and the extension of the trajectories 
reaching the focal point. The principal planes are the positions of virtual lenses 


resembling the doublet. These lenses are at different location for both planes. 


7.3 The transformation matrix of a quadrupole doublet with a drift space of length 


« f L=d 
d between quadrupoles is ( a ig) where # = ; on . - i We 
et . = =< = 0.2m"! and get for the on-energy focal mae in both planes 
= * or f* = 25m. Since the combined focal length depends quadratically on 


the energy, we get a +10% spread in focal length. The beam width at the focal 
point is with f, = Smis ? = eB = (1—d/fi)’ Bo + dey = 0.64efo + S. 
The beam emittance is € = rorj and therefore the spot size in the horizontal line 
is r2 = 0.6475 +r and in the vertical plane with f, = —5 mis ze = 1.44€Bp + S 


For the last question on dispersive beam sizes set |f,|| = 5- (1 + 22) and calculate 
ky = (Ap/po) : 
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7.4 Equations (7.63) and (7.67) give the focal lengths for a wedge magnet in both 
planes: + = (tan yo + tann.)cos@ — tannotann, + sind and na = tanno + 


tan je + 15, + 5. We choose both edge angles to be the same and require that both 
focal lengths be the same as well. This results in the condition 2 tan 7 cos 6 —tan? n+ 
sin? = 2tann + =5/, which can be solved for the wedge angle 7. 


7.5 The deflection angle } = uj and we assume uy = 0. Then the betatron 
amplitude downstream from point zp is just u(z) = VBBod sin ( — Wo). From 
this it is clear that the value of the betatron function should be as large as possible 
at both points to get a large kick amplitude. 


7.6 a.) a trim coil in QF; causes a beam bump which will reach zero again in the 
middle of QF3. This is too early and we need to activate another trim coil in QD» 
to be symmetric. The beam bump arrives in the middle of QD with a slope of x’. 
A kick with an angle of 62 = 2x’ (QD,) will symmetrize the beam bump ending 
at zero in the middle of QF, where a third trim coil 63 = 6, ends the beam bump. 
Because of symmetry we need to consider only half the three FODO cells from QF, 


to QD» for which the transformation matrix be M = he aa ) . The beam bump 
m2, 22 

at QD, is then x(QD,) = m6, and x (QDz) = m26). Since x(QD,) = 0.02 

m we get 0; = 08 and 6; = —2-x’(QD,). The bump amplitude Ay is not the 


maximum bump amplitude which occurs in the QF) and QF3, 


Solutions for Chap. 8 

8.1 We start from u = /eBcos(w+ 54), calculate the derivative wu’ = 

—Je WE cos (w +6) — Ys sin(w +6), and eliminate from both equations the 

phase terms to get Bu? + 2au'u + a7u? + a = € (*). Defining a coordinate 
‘ = na ; xz : Pole: a 

transformation by w = u/ VB and w = Bu +a VE and inserting into (*) we get 

w? + Ww? = € which is the equation of a circle. The derivative w = “ and the new 


dy 
independent variable is the phase w. 
8.2 First, we transform the phase ellipse as in problem 8.1. This transformation is 
scale preserving since its determinant is equal to unity. The phase ellipse is now a 
circle with radius ./€ and therefore the area of the circle is ze. 


8.3 Write (8.41) in component form and place beam matrix elements with their 
definitions (8.38) 


8.4 The transformation matrix of such a transformer between symmetry points 


Po and P is ( Ae The transformation of Bo with aw = 0 through 


BoB 
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such a transformer is 8} = C?By + aS and y) = C7 Bo Fg 55” = i The 
dispersion function transforms through this matching section fake. a trajectory and 
we would look for 7; = Cyo + Sn = Cno. A 2/2-transformer does not work for 
the dispersion functions since C = 0. 


Solutions for Chap. 9 


9.1 The proton energy after accelerating gap i is Exin (MeV) = 1 + 0.5i. At the 
energies under consideration the protons are nonrelativistic and their velocities are 


vj = Cy/ =k = 1.384 x 1071 + 0.5im/s. The period of the 500 MHz rf-field is 


2ns and the tube lengths therefore must be ¢; + leap = n+ 2 x 107°v;, where n is 
an integer and £; >> fap. The first three sections are (n = 1) €; + gap = 3.39cm, 
ly + Leap = 3.92cm, and £3 + leap = 4.38cm. These are the minimum lengths 
of the tubes. To meet the requirement of a minimum tube length of 15cm we need 
to choose n; > 5,m2 > 4,n3 => 4. That means the first tube length is as long as 5 
wavelengths and the second and third tube length is as long as 4 wavelengths. 


9.2 Each proton travels at a slightly different velocity due to a finite energy spread. 
We assume nonrelativistic protons and get for the velocity spread oe =< 5 ae The 
time t it takes for debunching is equal to the time it take for the fastest particle to 
travel half the separation of bunches A = me where C is the circumference. The 


other half of the bunch spacing is covered by particles with = < 0. The debunching 
0 


x 7 _ LC. Eo 
time is then tT = an AE 


9.3. The accelerating rate is given by Pk = freveVie Sin Ws. We solve for the 


2 
synchronous phase and insert into (9.74) to get 2 eee 
cpo mh\nelepo 


[eos y +14 (2 arcsin a p= r) 2] 


9.4 We start from (9.52) and write it in the form of a linear harmonic oscillator 
@+aQ;=“ tos pelle @~ = O where the synchrotron frequency is 27 = 
2? sing cos Ws—sin Ws 


ei (ces 9) For small oscillation amplitudes the frequency 2 ~ Qo 
and reaches zero at the separatrix. There is also a g-dependence of the synchrotron 
frequency indicating a periodic variation as the particle travels along the phase space 
trajectory. 


9.5 The longitudinal emittance is Ang, = = ae ie 


increasing the r-votage. Since 2 % Ta, we set yaad? = /Vard? o 
1/4 
~A=f~ (Ga 2) . The bunch length scales like the fourth root of the rf-voltage. 


Foetal 0 and does not change by 


Vit 
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Solutions for Chap. 10 


10.6 We use a FODO lattice with « = /2,B = L (2 ie 2 and = & =(4 ee a) 
to construct a ring. The beam width in such a lattice is 6? = =p +7 (2e) and the 

x «(f-+8) 4 
i (2) . From the peak magnetic field : = 0.0036m7! and we can solve for L. 


The length of the bending magnet should be no more than 0.8Z and each bending 
magnet deflects the beam by % = a. To complete a ring, we need n = 277/W 


half cells. Numerically, we get 104R? ~ 0.2L + 9.75 x 10-°L? = 4 or solving for 
half the cell length L = 19.818 m. 


beam height o = cB . Since 6, < 6, we get 62 + ome = 


10.2 For a Gaussian beam the largest beam size is along one axis or in case 
of an optimum FODO lattice 6? = cB + 4 2 (6 EY? from which we get L since 


6, = eB < 6?. For a maximum field of 1.8T the bending radius and angle 
; = 0.0108m™! and yw = a respectively with the total number of cells 


Ne = 21/ (2%). The focal length of a half quadrupole is f = «kL = J/2L and 

<< J21L, = 0.05V2L? assuming that the total quadrupole length is 10 % of 
L. Numerically, we get 10*R? = 0.2L + 0.02921? = 9 and solving for half the 
cell length L = 14.463 m. The bending field is 1.8 T and therefore within practical 
limits, the quadrupole gradient is g = —- = 11.268 T/m and the pole tip field 
is B. = gR = 11.268- 0.03 = 0.338T, which is well within practical limits of 
about 1 T. The number of FODO cells is ne = 2/ = 25.141 and to make it 
an even number of cell, say 26, we may decrease the bending magnet field by a 
factor 25.141/26 to 1.740 T. Now the tunes are QO, y = 6.25 or right on a destructive 
half integer resonance. By raising the quadrupole strengths, we increase the tunes 
to say Q,, = 6.75. The new quadrupole strengths are from siny = i = 0.72815 
from which we get the new quadrupole strength k = 0.06962m~ and gradient 


g = 11.603 T/m which is still within practical limits. 


10.3. The transformation matrix of an unperturbed FOFO cell is My = 


( cosWo Bosin Yo 


it sai Coes and for n, cells forming a half ring with % = n.Wo 
0 
cosW fPosinW% 


: oe a Ne 
the transformation matrix is Mo, = M)° = ts tio ” 
—%, sinW% cos % 


. Inserting 


a drift space of length @ at the beginning and end of the half ring results in a new 
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transformation matrix with 


Hits 12 cos% fBosinW% 12 
"N01 — 3, sin Yo cos W 01 


_ er mp Sin Y Bo sin Wy + 2£ cos WF — pe) 


1 gj — £ gj 
7 Sin WY cos WM Z sin WY 
‘ : : cos W sin VW 
This must be equal to a new symmetric matrix M, = Po P and 
—z sin W cosW 


equating matrix elements on both sides we can solve for the new phase W and 
betatron function 6. The tune change is AQ = (W —W%)/z which we get from 


the equality cosW = cos % — a sin W and the change of the betatron function is 


(B — Bo) & Bo ete Since n/ = 0 at the insertion point, we don’t find a change 
in the n-function by inserting a drift space. 


10.4 Fora, = a2 = 0 the transformation matrix of an arbitrary matching section is 


2 cosy v bi B2 sin Ww 
: Bi 
TEE sin W i cosy 


betatron function between two symmetry points is By = C7B, + 55: Especially 


Mn = = ( . 2) and the transformation of the 


C's’ 


simple solutions exist if the phases Ay, = Ay, = 2/2 in which case C, = Cy = 0 
and the sine-like matrix element is adjusted such that Sey = Bix B27. We have 
made no use of any FODO parameter and therefore the matching works between 
any two symmetry points where a; = a2 = 0. 


10.5 The vertical betatron function is periodic since the vertical focusing is periodic 
from the quadrupoles. This should be true also for the horizontal betatron function 
but it is not because of the use of sector bending magnets which contribute 
to horizontal but not to vertical focusing. The dispersion function is even more 
perturbed because of the missing bending magnets. 


Solutions for Chap. 11 


11.1 The oscillation period for 500 MHz is Tp = <5 = 2.0 x 10~°s. Therefore 
the bunches are separated by 2 ns. 


11.2 First we note that J, = 2 for rectangular magnets and the synchrotron 
oscillation damping time is from (11.30) t7! = 2recy?(-4] = 1139.68"! or 
Tt; = 0.877ms. On the other hand, the energy loss per turn is U(GeV) = 
7.1685 x 10-4 GeV or 0.023895 % of the particles energy. At this rate the particle 
radiates all its energy away in 4,185 turns. To orbit one turn it takes the particle 
0.2096 Ls or to radiate away all its energy 0.877 ms. In other words, the synchrotron 
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damping time is just as long as it takes the particle to radiate away all its energy at 
the initial energy rate. 


Solutions for Chap. 12 


12.1 We use the coordinates (Ww, Acp/cpo) for which the changes per unit time 
are W = —haone (42) (h harmonic number, wo revolution frequency) and 


(42 ) = WWE) With Ve (W) the rf-voltage at the phase y and U (E) 


dt \ cpo cpoTo 

the energy loss per turn at energy E. Expanding we get eVie(w) — U(E) ~ 
e an y — 4U (Eo) pe where ¢ is the phase deviation from the synchronous 
phase Ws. The last term gives rise to damping which we ignore here and 


d ( Acp _ 1 e Vie 
dt \ cpo “  cpoTo dw 


equation we scale the coordinates ¢ —> 


gy. Following the discussion leading to the Vlasov 
Ws 


cpoTo 
e Wie/dv ly, 


cpoTo o 1 Acp se _ |e dV ee 
to get Vv edVit/dV Ty, © ~ hoone y —hoone cPo vey 0 2arcpo © dy 


da Acp _ _ hie dVir 
( ) = 0 Iaepo © dw 


Acp 1 Acp 
cPo —haone ¢Po 


gy and 


Acp 
Ys PO 


gy. The synchrotron oscillation frequency is then 
Ws 


_ hie Vir 
2acpo dw ¢ 


. To make the 


s 


just the coefficient to the coordinates 2, = wo 


synchrotron oscillation frequency zero we would need a second rf-system adjusted 


dVit 
such that av |g 


and an additional rf-system at twice the frequency of the first. For maximum 
efficiency, we phase the second rf-system such that ~ = 0 at the synchronous 
phase of the first system. The conditions are then Vysiny, = Vj, sinw, and 
Viepcos Ws = Vi; cos Ws + V2 = 0 from which we can isolate the voltages to be 
V, = Vie and V2 = Vie cos Ws. 


= 0. For simplicity of discussion we assume sinusoidal rf-voltages 


Ver 
a ly, 


Vcos ps for this problem o¢ = oz liz wn ree 7. The bunch length can be 
manipulated by adjusting the relative phase and voltages of both rf-systems. In 


general, for a two frequency rf-system the combined voltage is Vig = V, sin@ ft + 
dV ir 
av lye 


12.2 The equilibrium bunch length is given by (11.48) and is with 


V> sin (@2t + 6), where 6 is the phase shift between both systems and 


V, cos wt + V2 cos (zt + 5) with the synchronous time f, = y,/@). 


12.3 An external field acts the same on all particles in a beam. If the beam as a 
whole performs coherent transverse or longitudinal oscillations, we may consider 
the situation being just one macroparticle. One could consider an external field 
which does depend on the amplitude of the macroparticle and this field would then 
damp the coherent oscillations because df /dw # 0. This is the case for a feedback 
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system to damp coherent oscillations within a beam. The beam position is measured 
at one point and the signal acts back on the beam after some amplification. If the 
feedback system is fast enough it works for slices of a bunch where the slice charge 
is offset from the equilibrium. This is exploited in stochastic cooling to reduce 
the beam emittance. However, damping of incoherent oscillations, as we get from 
the emission of synchrotron radiation, is not possible by external fields. Here each 
individual particle has a different coordinate but the external field is the same for all 
particles. 


12.4 We use (12.106) and note that for a stationary solution w, = 0 for all n. The 
remaining coefficients depend only on a,,/D or parameters that are determined by 
the storage ring containing the beam. This makes the Gaussian distribution of the 
injected beam irrelevant. 


12.5 The beam lifetime without quantum effects is 21.4h and we allow this lifetime 
to be reduced by 10%. The quantum lifetime for all three degrees of freedom 
therefore should be longer than t3 = 577.8h. Quantum effect lifetime is given 
by (12.132) where 2x is the aperture in Gaussian standard units. The vertical 


damping time is = = any The synchrotron radiation power is (Py) = 1,155 GeV/s 
tT] 


4.4 x 10° which can be solved for x ~ 15.75 or for 4 = 5.6. The vertical aperture 
should therefore be at least 5—6 times the Gaussian standard height. The same is 
true for the other two degrees of freedom. The damping times may be slightly 
different which makes little difference on the number of “sigma’s” because of the 
fast variation of the exponential function. To cover all three degrees of freedom it is 
common to use 7—10 Gaussian standard units. 


and the vertical damping time t, = 2.60 ms. The aperture factor is then e =2 


Solutions for Chap. 13 


13.1 The damping characteristics are determined by the partition numbers and the ?- 
2 2 eS 
parameter. Horizontal motion becomes antidamped when 3 = es ules, De > 1. 
We evaluate this integral in thin lens approximation for only one half cell, since all 
others are the same and assume that kag = —kgp = k and for = fap = ¢. Then 


0 = «(}+i) + 20(h—7)k. Furthermore f + 7) = KL and (f-H)k = KF. 
With this the -parameter is finally 7 = 24 (< + 1) . It follows from ¢ « L that 


v > 1 and that the horizontal betatron function is antidamped. It is only because 
of adiabatic damping during acceleration that the beam does not blow up. If the 
acceleration is too slow the beam emittance will grow. 
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Solutions for Chap. 14 


12.2 For Py | =ife rakes where (H (z)) = (67)+ (2ann’)+(y7n") we need to 


formulate an analytical expression. Since all bending magnets contribute similarly, 
only one has to be evaluated. We estimate the average values from known solutions 
in the middle of FODO-cell quadrupoles (10.3), (10.74), and (10.5). Transforming 


through the QF nop = —fi/f. Non = —n/f and (Bn?) ~ 4 (Bnd: + Bgo) = 


1 For (2a7n') we approximate (yn) ~ itn = — (n!\ & ma = > and 
= 
4 
witha = —5 a~) —5 b-B = we get for the second term (2an7') = -& ae 
a 


Finally, we note that y (i is constant in a drift space (bending magnet) and is from 
matrix transformation at the exit of QF yor = > B =const. With this (y 1) ~ 
in ee 3 «2 (4x2 : K-(4«-— 
(anh + t) sia = - pee Collecting all terms (H (z)) ~ - ee and 
the minimum is reached from dH/d« = 0 fork ~ 1.071. The associated FODO 

phase advance per cell is w + 138°. 


12.6 We use an energy of E = 2 GeV and an optimized 90° lattice for which from 
Fig. 14.5 (H) / (p02?) ~ 3. We also assume that only 75 % of the ring is occupied 
by bending magnets and therefore €,/l5,0 = 0.75. poling (14.25) for the minimum 


bending angle © per bending magnet we get 0 = Ee oy a 7065) ee 180 = 3.845°. 


To compose a ring, we need at least 94 bending magnets or quadrupoles to reach a 
minimum beam emittance of €, = 5 x 107? m. 


Solutions for Chap. 15 
15.1 We use the perturbation P22(z) = svaBr! mw, in the equation of 
motion wg + Vows = a a and get after reordering the equation 


wg + v5 (1 a 5B°/?mwg) wg = 0. On average, the sextupoles do not contribute 
in linear approximation to a tune shift because (5B >mwe) = 0. In higher 
order, however there is some tune shift which we can expect from the fact 
that the contribution to the tune shift is greater while wg > 0 compared to 
the case when wg < 0. The P2; (z)-term contributes to a tune shift because a 
beam passing on a distorted orbit w. through a sextupole feels a quadrupole field 
component and therefore a tune shift. The equation of motion shows this directly 
wp + v6 (1- 5B°/?mw-) wp = 0 and the tune shift is dv ~ 5% (B°/?mw-). 


15.2 At the bending magnet exit the dispersion is D = p (1 — cos ts) and the slope 


D! = sin® > Extrapolating linearly back, we expect the dispersion to start with a 


slope of DI at a distance s before the bending magnet exit. Therefore, s sin os = 
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p (1 — cos ts) and using & < | we gets 5 ep demonstrating that the dispersion 
function seems to start in the middle of the bending magnet. 


15.3 With errors Ave = @andA = 8 the vertical n-function is n(z) = 


2 sin rv 

Af Az JB @cosv[y()-—e@+xldz = AY, JB cosv[y (2) - gi + 7] 
where we have omitted the index ,. The expectation value for the dispersion 
function is for N statistically distributed errors (7) = >; 8:9? cos? vr = 
t B(2N, Bo? cotvz where og = sf (62) is the rms deflection angle due to 
misalignment errors and B the average value of the betatron function at the location 
of errors. For random quadrupole misalignment errors 0, and ¢ the length of the 
quadrupole og = k€o, and for bending magnet rotational errors oy the rms errors 
are 0g = See, Use numerical values from lattice #3 in Table 10.1 to estimate the 
actual expectation value of the vertical dispersion function. 


15.4 The transformation M through both bending magnets and the drift space L 
between them is with D = p (1 —cos@) 


1L+2€—(L+2)sin6 


M=10 1 sin 0 
0 O 1 
1é£ D 1L0 1 -—D 
= | 01 sin@ 010 0 1 —sin@ 
00 1 001 00 1 


From this, we get the dispersion at the end of the second bending magnet D = 
—(L+ £)sin@ = —d and D’ = 0. 


15.5 The tune change due to quadrupole field errors is Av = ~ >; Bi (AK 0) 


and with o, = J ((460") the expectation value is o, = Na Bok. To have a 
96 % probability to avoid an integer or half integer resonance 20, should be less 
than 0.25 or oy < 0.125. With numerical values Ng and f one can solve for 
the rms quadrupole field tolerance o,. Manufacturing tolerances resulting in non- 
parallelism of bending magnet poles cause gradient field errors as well. If the angle 
between pole surfaces is a then the field will be B (x) ~ BoGo_ ~ BoGo (1 + ax). 


Go+ax 
From this, we extract the gradient field error 6k = <e and get the rms tune 
shift oy) = Np Bp seq Ta where Gp is the nominal gap between magnet poles, 


£/p is the bending angle, N, the number of magnets and Bp the average value 
of the betatron function in the bending magnets. The total allowable tune shift is 


2 


theno, = , eae +0;, < 0.125 determining the gradient field and parallelism 


tolerances. 
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Solutions for Chap. 16 


16.1 We use an energy EF, a FODO half cell length Z and a bending fill factor 
of 50% or 4, = L/2. We assume that there are N = 21 FODO cells. The 


natural chromaticities are in both planes from (15.105) A&,, = ot per FODO 
half-cell and the sextupole strengths are msgpl, = —*— and mspl, = 


i(b—B) © (B-B)" 
The phase differences between similar sextupoles (SFs or SDs) are Ag = 


27/N. The stop-band width Av? = -7 ie ca By! * mes" dy] becomes with 


Vx0B.dg =dz and replacing the integral with sums over all SFs and SDs 


Ay?) —_=%0 ¥ ( me, erokAp 4 Bs? mls| etnalit1/2)49) . The sums 
stop 47./Bo = a Sl on 2 | as 
Net exp(i67 Vx0) 3 el 
become Pp» enh ie = apterigM and exp(iz7 Ay) Yo exp (Bvi0kAg) = 
=0 k=0 
30 exp(i67 Vx0)— (3) _ =x9AE 3/2 _ 
exp(i= ys cer oe ;- Finally, the stop band width is Avsiop = 7 7 Ez ») 
53/2 : ‘énvig)= : 
: i a oP (i3 2) oe . Close to the third order resonance 
n( B-— x 
3700 oe exp(i6mvx)—-1 l6md5v 
(V0 © ~w 3+ dv) we set exp (iy )x exp (iz a) and exp(i6z1v,;9/N)—1 ™~ expGl8x/N)—1 


and get Aue  xodv. For finite betatron amplitudes xo the stop band width scales 
like the tune distance dv from the third order resonance. 


16.2 In the definition (16.33) we notice that |m| < n and therefore there are odd 
values for m if m is even and vice versa. That means for n even there can be a value 
m = 0 while for an odd n we have m ¥ 0. Only terms Cym = Co 4 0 give rise to a 
tune shift in this approximation which is the case for even values of n only. 


16.3 We write (16.33) like cos" = Cao + cue + ege2” +... + cane”. on 
the other hand cos"p = +(e” +e)" = F (ei + ae DY +...) . In both 


equations there is only one term ei”. Therefore Cyne" = wen’ and Cnn = a 


Solutions for Chap. 17 


17.1 First, we find from (6.95) all second order terms in 5. The term —K 8” together 
with C(¢) = cos(é/p) and S(¢) = pcos(¢/p) the perturbation P(z,6) = 
—8? f ky |S (2) C (6) — C(2) S' (| = —6?p (1 —cosz/p). Since no slopes are 
involved, T1566 = C166 = —p (1 — cosz/p). We could have guessed this result, since 
the perturbation term is the same as for the dispersion except for a factor —é. This 
second order term is therefore just the second order chromatic perturbation. 
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17.2 From (6.95) we collect quadratic terms and with x = Cxp and x’ = C’xo we 
get the desired perturbation p (§|x>) = [(—3 — «3 — 2«xk) C? + $C? ] x5. Note, 
that we ignore the x’-term since we exclude non-constant strength parameters. With 
this perturbation, the integral (5.75) 


i= / “p(ék2) [S@C(O-COSs (Oat 
1 : I 5 
= (-5m- Ke — 24.) a [kS? ++ (1—C)] + - [2(1 — C) —ks”] 


where C, S are the principal solutions of linear beam dynamics. 


17.3 We consider a ring made of a total of N. = 61 FODO-cells, each with a phase 
advance of 90° and the tune is then v = 15.25. This ring has 61 QFs and 61 QDs. 


The betatron functions in the middle of the quadrupoles are 6B = L (2 am v2) and 


the y-function 7 = ~ (4 =e V2). The natural chromaticity &.) = -* and the 
change in chromaticity due to sextupoles 


Abwy =  [(Bnmls)oe + (Brmls)oo]| 


Nei 
= Ey [+ (10 % 6v2) mores + (10 = 6v2) moots | = Fkpy. 
After evaluation A&, = 18.485maprls + 1.515magpls = =e and Aé = 
—1.515marl, — 18.485 1mepls = 8, which can be solved for the two sextupole 
families mgrls = —mopls = wae We use this result for (17.45) and get 


> m£iB>” exp (iyi) = |me,p*/| = |B? | on exp (iz/4) #4 O because 
of cancellations within every four cells. Only the contribution of the last cell is 
uncompensated. If we place non-interleaved sextupoles in pairs 180° apart there is 
total cancellation and the driving term is zero. 


17.4 The tunes of the ring are in both planes v = 15.25. Equations (17.45)—(17.49) 
exhibit mainly integer and third order resonances. A proper choice on the tunes can 
minimize these aberration terms. A tune of 15.25 is close to a multiple of 3 and 
therefore efficient at driving a third order resonance. A better tune would be say 
16.25.16.75 or 17.25. One could argue that 16.75 or 17.25 might give the lowest 
driving terms, although this ignores the variation of sextupole strengths and betatron 
functions. To find the minimum driving terms (17.45)—(17.49) must be evaluated. 
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Solutions for Chap. 18 


18.1 The longest wavelength belong to the lowest TM and TE-mode. The lowest 
TM-mode is TMo;. We use (18.44) with B, = 0 and note that E,, meets the boundary 
condition by definition, E, = 4 o the walls because m = 0 and E, = 0 at the 
wall for Jo (k-R) = O or A, = + ak. The longest TM-mode wavelength that could 
propagate in this tube is about 30 % bigger than the diameter of the tube. The TE- 
mode fields are given by (18.44) with E, = 0. The magnetic field B, « J, (k-R) 
just like E,. The radial electric field end normally at the wall, E, = 0 for Jj (k-R) = 
0 or k.R = 1.84. Solving for the cut-off wavelength A, = oak which is about 
30% smaller than the tube diameter. Therefore, we may conclude that wavelength 
longer than the tube diameter do not propagate. One big consequence of this is that 
electron bunches with a bunch length of 1-2cm or longer do not emit coherent 
synchrotron radiation because the wavelength would be of the order of the bunch 


length or vacuum chamber dimensions or longer. 


18.2 A change of temperature would change all linear dimensions of a cavity as 
determined by the temperature coefficient of the cavity material, in this case copper. 
The fundamental cavity frequency is inversely proportional to the radial dimension 
a, of the cavity (18.48). From a — — Hs = nrAT and Af, = —16.6- fie ATCC). 


44] = = 440-6 = 166 


Assuming that the cavity is dimettinccd for 500 MHz 
10° - AT and the temperature tolerance is +0.06 °C. 


18.3 The cavity length scales like d « ae the diameter like a; « ee for a pill 
box cavity the transit time factor is en of the frequency, the quality factor 
Q « f¢, the shunt impedance R, « wie the specific shunt impedance r, « ./f¢ and 


cavity fill time tp « re 


18.4 Such high fields can be sustained only in pulsed linac systems. The wall losses 
per meter of accelerating structure are from (18.90) Pw = 5.6 MW/m. This is too 
large for financial and technical reasons because it would not be possible to cool 
that much power from one meter of structure. An electron linac is pulsed with 
a pulse length of say 2.5 1s and a rep rate of say 100s~!. The duty cycle is the 
2501s per second or 2.5 10~*. The average wall losses are therefore 1.4 kW which 
is manageable. 


18.5 The capture dynamics in linear accelerators depends on the particle velocity 
and initial phase at the time it enters the accelerating section. Particles can be 
accelerated even if the initial field is negative for example in the area 90° to 
180°/ — 180° to —90°. Between 90 and 180° initially low energy particles travel 
slower that the rf-wave and quickly fall back into the accelerating phase before 
they loos all energy. In the accelerating phase they get accelerated very fast and 
reach high energies. Higher energy particles do not reach equally high final energies 
because they are faster and will not fall back from the negative acceleration so 
fast and lose much of their initial energy before they get accelerated again in the 


972 Solutions 


accelerating phase. Particles starting at < —90° will stay longer in the decelerating 
phase and loose all their energy. After that they get accelerated/decelerated at 
random. The optimum phase should be wo = 0° and particles with higher initial 
relativistic energies gain the highest energy and stay at this phase. Lower energy 
particles at 1-2 MeV are not relativistic yet and therefore fall back from the crest 
of the wave gaining less energy. This is the regime of a buncher section where the 
phase velocity of the rf-wave is reduced to match that of the particles. 


18.6 We excite the prebuncher to a total effective voltage of Vie and the linear 
part scales with 3 GHz phase like V;, = Vit@so9. For simplicity, we assume the 
electrons to be nonrelativistic and the velocity deviation 6v from the Ey = 100keV 


j Exin = Ve ~ Ver 7 
reference particle 3H OU = an. = 1+ a eal | ~ 5B, 9500: A distance £ 
downstream from the prebuncher the reference particle arrives at a time t = €/v9 = 
£/./2E/m. During this same time a particle at phase 599 must advance or fall back 
with respect to the reference particle by the distance Att ps9 to arrive at the same 
time as the reference particle. Therefore, we require tév = Att 599 and the shortest 
Act 2Eo 
. . . ae eV . . . 
resulting bunch length is zero independent of the original bunch length. In reality, 


the sinusoidal variation of the prebuncher voltage results in a finite S-like bunch 
distribution in phase space. We may over-compress the bunch by going slightly 
beyond the distance ¢ thus getting a slightly higher intensity within a finite phase 
distance of, for example, +12° at 3 GHz. 


18.7 The no-load energy gain is Exi, (MeV) = 10.48,/P) (MW) = 40.6 MeV and 
with beam loading this energy gain is reduced by AE = —37.47 ip = —0.75 MeV. 
The fill time of one linac section is te = 0.73\s and the rf-pulse remaining 
for beam acceleration is 1.77 |1s. The beam pulse is therefore t = 1.77 1s long 
with an energy gain of Ejoaa = 39.85 MeV. The total beam energy is then e, = 
(E\oaa/e) ipt = 2.0J. On the other hand the total rf-pulse energy is e,¢ = 37.5J and 
the linac efficiency is then 7 = 5.3 %. If only one pulse is accelerated as is the case 
for a linear collider or a x-ray laser the total beam energy is ey = npEkin = 0.065 J 
and the efficiency is 7 = 0.17 %. 


bunch length is obtained at a distance = 


-. In this ideal (linear) case the 


Solutions for Chap. 19 


19.1 The synchrotron radiation energy loss per turn is U = 93.0keV, the beam 
current is J = 68.0 mA and the radiation power Psy, = 6.32 kW. The energy spread 
is = = 0.0727 % and the required acceptance 6% = 0.436 % for a beam lifetime 
of at least one hour. From (9.64) we solve for the function F(g) = 1.202 and 
q = 1.902. Ignoring beam loading the required minimum rf-voltage is Vip = qU = 
176.9kV. Including the cavity power Ppy = 1.86kW, the total minimum rf-power 
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needed is Pi, = 8.18 kW. Beam loading will change these parameters. From (19.35) 
the optimum coupling Bon = 4.40. 


[o.e) 
19.2 We express the current in an even Fourier series J (t) = Sao + ¥ a, cos (nwt) 
n=1 


and assume only one circulating bunch of charge Q between —tr <t< 5T where 


Tt < Tp is the revolution time and the average current J) = Q/t. The current is 
CO 
: 1 p3To = 
(Oo= ¥, & elkot — cy +2 > cn cos (nwt). Here cy = To ae Me kor dy, 
n=—oo n> 
—iwk4 T 
After integration c, = lox -— Je e*dx = yt = = [,. The coefficient a, = 


2cn = 21, for all values ofn = = 0.123% 


Solutions for Chap. 20 


20.1 From (20.95) we calculate the coupling from statistical errors to be Kims = 


(k?) = 5 (5k £q) \/ BxByNq where 5k is the strength of the error, the length of 
the error is £ and N, = 122 the number of errors. For 90° cells, the quadrupole 


strength has to be k = 2/ (V2Lt4) = J2 and 8k = kéa. In the lattice we 


have N, = 122 quadrupoles, for 90° cells BxBy = 21’ and the rms coupling is 
Kyms = 38.8 da. In the sample lattice we have equal tunes v, = vy and therefore 
the emittance coupling is always 100 %. To get a finite emittance coupling we must 
separate the tunes by fine adjustment of the quadrupoles and choose, for example, 
tunes like v, = 15.20 and vy = 15.15. The emittance coupling is from (20.117) 
& K 


= F272 = 0.01. Solving for the coupling, we get k ~ 0.005 which determines 


the fouuoual alignment tolerance of the quadrupoles to 60m; ~ 0.13 mrad. 


20.2 From the graph we obtain vy — vy = [vn _ vy ~ 0.05. With this and A, = 0 
at resonance (20.120) can be solved for the coupling resonance to be k + 0.1. The 
coupling coefficient is just the narrowest distance between the tune measurements. 


Solutions for Chap. 21 


21.1 The beam-beam tune shift is 6v = Bi, A (kl). We split the ring at the 
collision point and insert half of the beam-beam focusing on either side of the 
symmetry point. With the transformation matrix of the unperturbed ring being 


Mo we get for the perturbed ring M = PMoP, where P = ei ") ; 


COS 277 V9 * sin 27 vo cos 27v * sin 27 v 
Mo = ( . Poy ) ana = ( : P; ) 


pe sin27 V9 cos2z V9 — BF sin2av cos2mv 
Oy y 
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Performing the matrix multiplications, we look for the determination of the 
perturbed betatron function BS at the collision point only for the terms C = 
cos 27 V9 — 5A (kl) Boy sin2zvo and S = Boy sin 27 vo. The perturbed betatron 
function is 


2 2 
p= (cos 2 V9 — tA (k€) Bo, sin 2v0) Boy + (85, sin 2nv9) Be or the linear 
) ) ; i 


y 


x —276dv sin 27 Vo. 


: . ABs 
change of the betatron functions is _ 
Oy 


Solutions for Chap. 22 


22.1 To calculate the bunch charge we need to calculate the revolution frequency 
fo = 1.199 x 10°s7!. A bunch current of 4, = 50mA resembles therefore a charge 
of Q = 41.70pC. The induced voltage is then for the BPM Ving = 2.55 V and the 
power into the 50 Ohm terminator is P = Y = 0.130 W. This is very small but 
so are the BPMs which are very well insulated and continuously heat up. For the 
bellows these values are Ving = 2.552 V and the power is from (22.20) Pioy = 
0.128 W. Again this power is small enough without special cooling. However, this 
small impedance could only be achieved by shielding the inside of the bellows by 
smooth strips of flexible metal. An unshielded bellows has a loss factor of about 
ky = 50 V/pC. In this case the power would be P,,,,, * 105 W which is too much 
to tolerate in a bellows. With shielding the bellows looks for the beam smooth while 
pumping is still possible through the small gaps between the metal strips. Similar 
techniques are now common practice to keep the impedance small in all components 
of a modern storage rings. 


22.2 From (22.93) we get a—ib = _ ae roe = a—ib. Furthermore 


= dal af —a 4 and (8? +a Zi)? — = = Gate +2; = 64 267a“ 4 + (a%)° . From 


: 2Zr __ 94 oe an a2 p — = _— a 
this a“ = 6° + 26°aS and 4 = shy — 5, or 7 =A% iA = aye 


Solutions for Chap. 23 


23.1 The geometry of the field lines in the particle system may be expressed by 
x = tan a-z were a is the angle between the field line and the z-axis. In the laboratory 
system the z-coordinates are Lorentz contracted and the equation of the field lines 


becomes x* = a og 


23.2 The Cherenkov condition is Bnai;cos@ = 1. For electrons 6B (1OMeV) = 
0.99869 and B (50 MeV) = 0.99995. The Cherenkov angle for 10 MeV electrons is 
imaginary (cos @ = 1.001). In order to preserve energy and momentum, the electron 
energy must have a minimum energy such that n6 > 1$. For 50 MeV electrons 
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this condition is met and the Cherenkov angle is Och = arccos lapess oe) — 


1.214°. 


23.3 The Cherenkov angle is 6c, = — arccos [pes ) = 50.673°. This 
radiation will meet the other side of the plate at this angle with respect to the surface 


normal and will be totally reflected, because according to Snell’s law (3h = 12) 
transmission is limited to a maximum angle of nz sin 6 or 02 < 36.0°. The radiation 
continues to be totally reflected until it reaches the small side of the plate in which 
case the incident angle is now 39.327° which is still larger than the angle allowed 
by Snell’s law. This is an ideal material for a scintillator in high energy physics 
experiments to detect secondary particles. In this case the radiation will eventually 
reach the surface of a photomultiplier and escape the plastic. 


23.4 The revolution frequency is frey = c/C = 1MHz and the total number of 
particles orbiting is ne = 1.5604 x 10!? electrons or 3.1208 x 10° electrons per 
bunch. The photon pulses reflect exactly those of the electron bunches. We assume 
a uniform electron distribution over the bunch length. Therefore, there is a 1 cm long 
photon pulse every 0.6 m or one 30 ps photon pulse every 2 ns. 


23.5 We use the uncertainty relation AxAp = Ax-hk > h or Ax > 1/k and 
the characteristic volume of a photon may be set to Von © (Ax)? = (2), The 
average electric field within this volume is from the photon energy hw = mE Von 


or E = k*./2eohc. For a 0.1238eV photon (CO; Laser) the wavelength is 10 1m 
and the average electric field is E = 2.96x 1077 V/m. In the case of a 10 keV photon 
the field is E = 1.93kV/m. 


23.6 The relativistic Doppler effect is w*y (1 + B.n*) = wo and for the classical 
case we set y = 1, n* = —cos¥ and B = v/vo, where vp is the velocity of the 


wave (light or acoustic). The relative Doppler shift is then 7 = 7 cost. 


23.7 The energy loss per turn is from (24.41) Up = 20.32 keV and the total radiation 
power P = 20.32kW. In case of muons, we have the mass ratio m,/me = 206.8 
and the energy loss is reduced by the fourth-power of this ratio to become U,, = 
11.1 eV, which is completely negligible. Besides electrons and positrons there are 
no other particles that would produce competitive synchrotron radiation. 


23.8 The maximum photon flux occurs at a photon energy of about ¢ = 0.286¢, and 
S(0.286) ~ 0.569. To find the 1 % photon energy we use (24.61) to scale the photon 
flux and have 0.777,./x/e* = 0.00569, which is solved by x = 5.795. Appreciable 
radiation exists up to almost six times the critical photon energy. 


23.9 From (24.51) the beam energy is E = [0.4508 ¢.(keV)p(m)]! = 
2.0035 GeV. The magnetic field necessary for a bending radius of p = 1.784m 
would be B = 3.75T, which is beyond conventional magnet saturation. Either 
superconducting magnets must be used to preserve the ring geometry or a new ring 
must be constructed with bending magnets which must be longer by about a factor 
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of 2.5 for realistic magnetic fields, but then the beam energy must be increased to 
keep the critical photon energy. 


Solutions for Chap. 24 


24.1 In the horizontal plane the radiation distribution is uniform and an angle of 
Ay = 0.2 mr will produce a photon beam width of 1 mm at a distance of 5 m. The 
critical photon energy is €. = 563eV and ¢/é. = 0.00022. For the IR radiation 
the vertical opening angle Ag = 11.3mr (aq >> 1/y) and the source length 
L = 0.045 m. The total source height is oy = J/0.117 + 0.1077 = 0.153mm 
and the vertical divergence ojo.7 = 14.9mr. The photon flux for A = 10 \1m and 


S(0.00022) = 0.0805 is we = 1.275 x 10!° photons/s/mr/100 %BW. The photon 
(aia/ov)ay _ 


WT Oicr, rot y! y 


brightness is then B = = 1.162 x 10!* ph/s/mm7/mr?/100 %BW. 


24.2 The probability to emit a photon of energy € in a unit time is 7 (€pn) = ee Sa), 


: _ ey 55hc = = 
We are looking for the case ¢ = o, = 2 V GSime2I.p = 10.9MeV. For e. = 


3nck = 19, 166eV, the ratiox = 2 = },/ eee — 277.54 > 1 and B= 
23, 826 =... The probability becomes with this 71 (epn) ~ 2x 10~°6! We may, safely 
assume that no second photon of this energy will be emitted within a damping time. 
Energy is emitted in very small fractions of the electron energy. 


24.3 From (25.146) we get the number of photons emitted per unit time to be 
Non = evs = = 3.158 x 10°r and per radian ftp, = 0.01063y ~ 7h. 


24.4 The critical photon energy is ¢& = 38.04keV and e/e, = 0.21. The 


universal function is $(0.21) = 0.5625 and the photon flux oe = 3.1185 x 


10!? photons/s/mrad. The vertical opening angle /2709 = 0.251 mrad resulting 
in an effective beam height at the experiment of Y = 3.77mm. A beam size of 10 
jum at 15 m corresponds to an angle of 0.667 |1rad at the source. The total photon 
flux into the required sample cross section is then Non = 5.53 x 10° ph/s, which 
is more than required. For a still higher photon flux one might apply some photon 
focusing. 


24.5 From problem 24.1 L = 0.045 m and the diffraction limited source size and 
divergence are o, = 0.107mm and o, = 14.9mr, respectively. This is to be 
compared with the electron beam parameters (Ob.x5 Ob,y) = (1.1,0.11)mm and 
(iss Ov,y’) = (0.11,0.011) mr. There is a considerable mismatch in the x-plane 
with o,//2 = 0.076mm and o,//2 = 10.5mr while in the vertical plane 
the mismatch is small. In both planes the diffraction limited photon emittance is 
€ph.x,x = 797nm, which is much larger than the electron beam emittances in both 
planes. The 10 1m IR radiation is therefore spatially coherent. 
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24.6 The instantaneous radiation power is given by (24.32) as P, (GeV/s) = 
379.35B°E”. The total energy loss of an electron due to wiggler radiation power 
can be obtained by integrating through the wiggler field for AE(GeV) = 
189.67 PES and the total radiation power for a beam current J is P,(W) = 


632.67 BPE? Ly. 


Solutions for Chap. 25 


25.1 Integration of (25.61) over g results in factors 27 and x for the two terms 


in the nominator, respectively and we have the integrals 27 14 aa g@— 
sin? 0 = 42 : $ Ss . = 
a (1—’) fo aaa — nf _.dd = 4my*. With this the radiation power is Pio = 


2remer Ee which is (24.34). 


25.2 The vertical opening angle is 1/y = 0.085 mr and therefore all radiation will 
be accepted. The spectral photon flux into an opening angle of AY = 10 mr is 
Non = CyEI“28 (w/@.) Ay and with a critical photon energy of é. = 23.94keV 
the spectral photon flux from an ESRF bending magnet is Non = 4.75 x 104. 


€ph (keV) 
s( 23.94 i) 


25.3 We use (25.88) and get with § = 42(1+y767) 


*? for the photon flux 


.  &wlo% 2922 | K3/3(8) 292 KP/4(E) 
at p% the expression “T5q,/ anu = (1 +70’) 2,0) + Ty #0 ~ 
0.1. Solving for 6 gives the ae at which the intensity has dropped to 10%. 

4/3 

: d2W(10 %) y?02,—s4(1/3) @ _ 

For low frequencies aOan hana pes 1+ Teyte? BIT IG/3) D 
2) 292 

and for large arguments a wn | ew peared Sepoiiee) 


d2da ae seo a/1a 202 exp| 2 (14 yer)? = p. 
All expressions have to be evaluated numerically. The angle at which ia ee 
radiation intensity has dropped to 10 % is from (25.97) given by EWOOR) ja to — = 


aan (1 i. 3 or) = p, which can be solved by y = 1.390 for p = 10%. 
Y 


Solutions for Chap. 26 


26.1 The amplitude of the oscillatory motion in an undulator is from (26.6) 


ay = A = 0.581 um. The longitudinal oscillation amplitude is from (26.5) 


ay = rar = 0.053 A. Both amplitudes are very small, yet are responsible for 


the high intensities of radiation. 
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26.2 The focal length of a single pole end is given by (7.75) a = oe = 
2.58 x 10-> m7! and for the whole undulator F = 5 . 2N = 0.00258m7! or 
y, 4 Pp 


fy, = 387.60m. This focal length is very long compared to the focal lengths of 
the ring quadrupole, which are of the order of the distance between quadrupoles. 
Typically, the focal length of any insertion device should be more than about 50m 
to be negligible. The wiggler magnet with K = 5, on the other hand, produces 
a focal length of f,, = 15.50m which is too strong to be ignored and must be 
compensated. The difference comes from the fact that its the deflection angle which 
is responsible for focusing and a «x @?. Focusing occurs only in the nondeflecting 


plane and = =0. 


26.3 This result appears nonphysical, yet it is correct, but requires some interpre- 
tation. If the experimenter uses all radiation into the forward cone the photon flux 
is independent of N,, but as N, becomes smaller the bandwidth increases. That is 
not as experimenters generally use synchrotron radiation. They use spectrometers to 
cut out a very small bandwidth be <K = and in this case the useful band width is 
independent of N, and the photon flux is proportional to Np. 


26.4 To solve this problem, we do not rely on exact calculations, but are satisfied 
with the precision of reading the graph (Fig. 24.5). We also use iterations to get 
the solution we want. The fundamental flux drops below 10% for K < 0.25 and 
we use this value to get 15 keV radiation. From the definition of the fundamental 
photon energy the periodlength is A, = 3.0cm. To generate 4keV radiation we 
need to change K enough to raise the factor (1 + $K*) from a low value of 1.031 
by a factor of 15/4 to a value of 3.87 or to a value of K = 2.4, which corresponds 
to a field of B = 0.857 T. Unfortunately that field requires a gap of 8.1 mm which is 
less than allowed. We have to increase the period length to say A, = 3.0cm, which 
gives a maximum photon energy for K = 0.25 of ep, = 12.9keV. We plan to use 
the 3™ harmonic to reach 15 keV. To reach Eph = 4keV we need K = 2.16 and 
a field of B = 0.661 T which requires an allowable gap of g = 11.7mm. We use 
the 3 harmonic to reach Eph = I[5keV at K = 1.82. With this result we may even 
extend the spectral range on both ends. 


26.5 In the electron system the wavelength of the laser beam is Lorentz contracted 
by a factor of 1/(2y), where the factor of two is due to the fact that the relative 
velocity between both beams is 2c. The wavelength in the laboratory system is 
therefore A = ro] since K < | for the laser field. 


26.6 The maximum transverse oscillation amplitude is 4.57 tm and the transverse 
velocity in units of c is just equal to the maximum deflection angle BL = 0 = 
K/y = 0.38 mr. The transverse relativistic factor y ~ 1 + 7.22- 1078, indeed 
very small, yet enough to start generating relativistic perturbations in the transverse 
particle motion. 
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26.7 The fundamental wavelength is 4 = 2 (1 + 5K? + y?0?) and for # = 0 we 


have the fundamental wavelength 4 = 10.88 A. The natural bandwidth is fe — =_ 
P 


and we look therefore 10% an angle 3 such that the wavelength has increased by no 
= 0.09 and solving for d.we get 3 = 462.6 jrad. 


more than 9 %, or Tare 


26.8 In first approximation, we assume that all the fields are contained within the 
volume between the two rows of poles and no field leaks out. Separating the poles 
by g requires to generate the additional field energy de = Fdg where F is the force 
between poles. Since de > 0 for dg > 0 the force is attractive meaning that the 
poles are attracted. The force is then F = Pa =o tee B? (z)dz = 47,024N 
= 4.70 tons. 


26.9 In the electron rest frame energy conservation requires A@ + mc? = hw! + 
c*p* + mc? where hw and hw’ are the incoming and outgoing photon energies, 
respectively and cp the electron momentum after the scattering process. Solving for 
cp we get cp? = h? (w— a’) + 2hmc? (w — w’). For momentum conservation, 
we hae that hk = hk’ + p with the angle 0? between k and p. From this we get 
2p? = (ho) + (hw) — 2h? ww! cos &. Comparing booth expressions ot cp we 
get —2hww'! + 2hmc? (w — wo!) = —2h wo! cos 3 or 4 A (1—cos?) =4-1= 


@ 
= A .We look for radiation emitted in the forward direction or for } = 180° and 


get for the scattered wavelength A’ = A because tite ~ 4.8 x 10 < A. Note, 
that all quantities are still defined in the electron rest frame. The wavelength of the 
undulator field in the electron system is A = Aj/y, where now L*the laboratory 
system of reference and i scattered radiation in the laboratory system due to the 


mc 2 


Doppler effect is A* = (i + 4}K°), which is the expression for the fundamental 
wavelength of andulator pociaticay 


26.10 The photon flux density in the forward cone from a given undulator and 
storage ring is proportional to (26.61). For the fundamental the maximum value 
is reached for K = 1.1985 with an amplitude of A; (1.1985) = 0.381. An intensity 
of 90 % has an amplitude of 0.343 which corresponds to K-values of K = 0.873 
and K = 1.642. 


26.11 Equation (26.54) is for } = 0 proportional to (-K x) and we look at 
the factor )>, = S°Pr 5 Jen (U) [Je-am—1(V) + Jk—2m-+1(v)] to select harmonics. 
Here u = 2S andv = <C. Furthermore C « # = Oand S x K’B. Therefore 
v = Oand [Jp—2m—1(0) + Je—2m4+1(0)] = 0 for all indices which are non-zero. Only 
for k = 2m + | are the indices equal to zero and [Jo(0) + Jo(0)| 4 0. Finally 


>>, = 0 for all even values of k and }>, 4 0 only for odd values of k. 


26.12 To get elliptically polarized radiation we look at (26.51) and try to find one 
real term together with an imaginary term in the other plane to be non-zero at 
the same time. We also remember that undulator radiation is emitted only at odd 
harmonics k = 2m + 1. From (26.51) we note that the ix-terms are both non-zero 
but the pBo-term has a factor J,—2m(v) for which v = 0 in the forward direction and 
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J—2m(0) = 0 for all non-zero indices. Since k is odd J;-2,(0) = 0 for all odd values 
of k and with it the }Bo-term is zero too. Therefore there is no elliptical polarization. 
Of course the radiation from each pole is elliptically polarized, say, clock-wise but 
that polarization is compensated by the counter-clock-wise radiation from the next 
pole. 


26.13 The best approach to solve this design problem is to use a computer program 
like EXCEL. There are many parameters and the photon wavelength must be 
calculated many times. Specify beam energy, period length and min/max desired 
wavelengths. Calculate fundamental wavelength for Kmin/Kmax. For reasonable 
intensities Kin 2 0.5. Kmax determines the maximum field. At this point we can 
only guess, say Kmax = 6. Calculate min/max wavelength or photon energies for up 
to seventh harmonic. Try to find period length which covers the desired spectrum. 
From the undulator strength parameter calculate the maximum field strength. The 
maximum field in a permanent magnet undulator depends on the gap and period 
length. For a closed undulator assume a minimum gap of +10mm. That means 
to reach, say 1.5T, the period length should be longer than about 5—6cm (see 
Fig. 24.5). Note, do not assume that you must find a perfect solution that covers 
the whole desired spectrum. If, for example, you desire low energy photons from an 
unduator in a high energy storage ring you will not find a solution with a reasonable 
value of K. Sometimes the harmonic spectra just will not close. In this case several 
undulators with different period length and automatically exchangeable can be used. 
Such undulators are also known as “revolver’-type undulators. 


26.14 The fundamental wave length for a weak undulator (K <1) are 
A (800 MeV) = 102A and A (7 GeV) = 1.33 A which are the shortest achievable 
wavelength. For a 10 mm gap the field is from (24.5) B = 1.198 T and the maximum 
value of the strength parameter is K = 5.595. With this, the longest fundamental 
wavelength is A = 1698.5 A for the 800 MeV ring and A = 22.14A for the 7 GeV 
ring. 


26.15 The short wavelength limits are given for a weak undulator (K < 1) 
and are A = 3.13A for Ap = 15mm and A = 15.7A for Ap = 75mm. 
The long wavelength limits are determined by the magnetic fields when the 
undulator gaps are closed to 10mm. The fields are from (24.5) B (Ap = 15mm) = 
0.19T and B (Ap = 75mm) = 1.66T, respectively. The undulator strengths are 
K (Ap = 15mm) = 0.270 and K(Ap) =75mm) = 1.35 and the wavelengths 
A (Ap = 15mm) = 3.24 A and aA (Ap = 15mm) = 30.0 A. The tuning range is 
very small for the 15 mm undulator and about a factor of two for the long period 
undulator. The tuning ranges are so different because the K-value can be varied 
much more for longer period undulators. 
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Solutions for Chap. 27 


27.1 We may solve this problem two ways. First, we use — drift velocity 
Bp = Bp (1 — ‘) and calculate the time 4, = 3 ~ B e(1+2 + ) it takes the 


electron to travel along one period. During that same time the ae travels a 
3) and the difference is ds = sy —Ap = 
y 


: _ dp 
distance s, = ch = 2 ( 


Ap ( - 1) + 4 KB EN, — (1 + $K?), which is just equal to the fundamental 
radiation wave length. We may also integrate the aes length along the sinusoidal 
trajectory and get for one quarter period s, = By os i > JT + @ cos? xdx = 
#;EllipticE(—07) which is the Legendre elliptical integral of the second kind. 
Since the argument will always be very small, we may expand EllipticE(—67) ~~ 
7 0. 39367 for small arguments and get the electron travel time for one period 


te = = 424 F (4 + 0.39367). The path length difference between the photon, A, and 


ru 


8 - 0.393 
electron cf, is ds = Cte —Ap = Ang (1 + “pa —Xp i + = 5K? 


al 


which is again the fundamental wavelength of radiation. 


27.2 In a helical undulator the transverse as well as the longitudinal motion are 
uniform and therefore there are no varying relativistic effects perturbing the uniform 
motion. A Fourier transform of the motion will therefore only exhibit one, the 
fundamental harmonic . 


27.3 The total energy of the photon flux is E, = pN.E into a band width of 


A 1 
aS = Me and the number of coherent x-ray photons of energy ¢, per electron 


IS Nphcoh = oe = 717.5. On the other hand the radiation pulse energy from 


an undulator into the forward cone per electron and band width — = oF is 
Misi = Sma (1+ 5K) A 1 (K) = 0.015. The SASE process accounts for about 


five orders of magnitude in photon flux per electron. 


Appendix A 
Useful Mathematical Formulae 


A.1_ Vector Algebra 


Electric and magnetic fields are vectors which are defined by direction and 
magnitude in space E(x, y, z) and B(x, y,z), where we use a Cartesian coordinate 
system (x, y, z).The distribution of such vectors is called a vector field in contrast to 
a scalar field such as the distribution of temperature T(x, y, z). In component form 
such vectors can be written as 


E=E,x+By+E,z. (A.1) 
E+ B= (E, + By)x + (Ey + By)y + (E, + Bz (A.2) 
EB = E,B, + E,B, + E,B, = |E| |B| cos 0 (A.3) 


where @ is the angle between the vectors, and the 
ExB= (E\B, ~~ E-By, E.B, — E,B:, EY By = E\B,) , (A.4) 
|E x B| = |E| |B| sind. 


The resulting vector is orthogonal to both vectors E and B and the vectors [E, B, 
E x B] form a right handed orthogonal coordinate system. 
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984 A. Useful Mathematical Formulae 
A.l.1 Differential Vector Expressions 


To describe the variation of scalar and vector fields a gradient for scalars if defined 
by 


oT OT =) (A5) 


VT = gradT = {| —, —, — 
. ( dx’ dy dz 
which is a vector. 

For vectors we define two differential expressions. The first is the divergence of 
the vector field: 


OE,  OEy . OE; 


VE = divE = ; 
" a oe We 


(A.6) 


which is a scalar. Geometrically, the divergence of a vector is the outward flux of 
this vector per unit volume. As an example consider a small cube with dimensions 
dx,dy.dz. Put this cube in a uniform vector field and you get zero divergence, because 
the flux into the cube is equal to the flux out. Now, put the cube into a field free area 
and place a positive charge into the cube. The flux of fields is all outwards and the 
divergence is nonzero. 

The divergence can be evaluated by integrating over all volume and we get with 
Gauss’s integral theorem (A.25) 


/ VEdV = fin da, (A.7) 
V 


where 7 is a unit vector normal to the surface and da a surface element. The volume 
integral becomes an integral over the outer surface. 
The second differential expression is the ’curl” of a vector: 


(A.8) 


VxB= (F _ 0By OBy 9B, OBy _ = 


dy dz’ dz dx’ dx dy 


The ’curl” of a vector per unit area is the circulation about the direction of the vector. 


A.1.2 Algebraic Relations 


a(bxc)=b(c xa) =c(axb) (A.9) 
ax (b xc) = b (ac) —c (ab) (A.10) 
(a x b) (ce x d) = (ac) (bd) — (bc) (ad) (A.11) 
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ax (bxc)+bx(exa)+ex (axb) =0 (A.12) 


(a x b) x (cx d) =c [(ax b) d]-—d |(a x b)c] 
(A.13) 


A.1.3 Differential Relations 


V (ay) = gVa + aVo (A.14) 

V x (av) = 9 (V xa) -—ax Vo (A.15) 
V(axb) =b (V xa) —a (V xb) (A.16) 

V x (ax b) = (bV)a— (aV)b + a(Vb) —b (Va) (A.17) 
V (ab) = (bV)a+ (aV)b+ax(V xb) +b)x (V xa) (A.18) 

V x (Vg) = 0 (A.19) 
V(V xa) =0 (A.20) 
Vx (V xa) = V(Va) — Aa (A.21) 


A.l.4 Partial Integration 


Partial integration is defined by 
b b 
i uv'dx = uv|? — i vuldx, or (A.22a) 


b b 
/ udv = uv|? — i vdu (A.22b) 


A.1.5 Trigonometric and Exponential Functions 


e* = cosx +isinx 


a, Ih ix —ix ‘ — 1 [pix _ ,-ix 
cosx = 5 (e* +e") sinx = ; (e*-e") 
cos (a + b) = cosacosb ¥ sinasinb sin(a+ b) = sinacosb + sinbcosa 
_— tana-ttanb _— cotacotb#l 
, tan (a + 2) ~~ 1-Ftanatanb cot G + b) ~~ “cotacot b 
a en 2 a oe 
qq ana = = = 1+ tan’a qq arctana = ja 
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A.1.6 Integral Relations 


A. Useful Mathematical Formulae 


Pe dr = dvi do (A.24) 
S 
[ve dr = fai do Gauss’ theorem (A.25) 
V S 
i (Vxa)tdo = fads Stokes’ theorem (A.26) 


s 


A.1.7 Dirac’s Delta Function 


{ co forx = 0 


=") 6 forge £0 


lal f S(ax)dr= f 8) dy=1 


§ (x) = + * ein 


n=—oCo 


A.1.8 Bessel’s Functions 


Order n and first kind: 


n@= >> 


p=0 


n=0 
n= 1.923422 


f 6 (x) dx = 1 
6 (w) = ny 6 (the 2" d¢ = 1 


os v= 3 6 (x — 2xm) 


derivatives and recursion formulas with J, = J, (x): 


2n 
Jn+1 = —In — In-1 
x 


! n n 
J = Jn-1 — In = —In — Inti = 
x X 


n=—CoO m=>=—CoO (A.27) 
(CU? (4/2)"* 
AG eal (A.28) 
: ae : _ (A.29) 
(A.30) 
1 
5 (In—1 — Jn+1) (A.31) 


A.1 Vector Algebra 


first four roots of Bessel’s functions of the first kind: J, (x;) = 0 


Jn (x1) Jn (x2) Jn (3) Jn (x4) 
2.4048 5.5200 8.6537 = 11.7954 
3.8317 7.0155 = 10.1743 

5.1356 8.4172 11.6198 

6.3801 9.7610 


WNnNr OSs 


A.1.9 Series Expansions 


Fors < 1 
1 1 
| ye a 
e aT Tae So 
1, 1; 
In(Ql—x) = —x-—=x°-—=x-... for -l<x<1l 
2 3 
| , 1 
aa + 5% - 
1, 1, 
ame 1 age 8 
1 1 1 
¥14+6=14 -6-—8 + —8.... 
ms a 23 + 55 
1 
—— = 1-84 8 -8+..... 
1+6 


A.l.10 Fourier Series 
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(A.32) 


(A.33) 


(A.34) 


(A.35) 


(A.36) 


(A.37) 


(A.38) 


A function f (¢) is periodic if f (t) = f (t+ T) = f (t+nT) where n is an integer 
and T the lowest value for which this statement is true. Such a function can be 


expressed with t = + by 


1 [oe 
f(D = 540 + > [an Cos (2ent + Vy) + by sin (2ant + Vn)] 


n=1 


(A.39) 
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or using exponentials 
ioe) 
(A.40) 


f= DP ene™ 


n=—oo 


where c, = c_, are complex with c, = C_, and co = (f (t)) . The coefficients are 


1 
— / f(t) e "dr where t= t/T (A.41) 
0 
Parseval’s Theorem 
CO 1 CO 
i F’(#) dt = — i F’(a) do, (A.42) 
25 2H Js 
where F(t) = = [ F(w)e~'' dw and F(w) = f F(t) e dt. 
Fourier Transform 
For non-periodic functions f (t) with (T — oo) the Fourier transform is 
oo : 
F(a) =| fe dt (A.43) 
—0o 
and 
1 s% 
f= =| F(w)e "da (A.44) 
Qt J 66 
A.1.11 Coordinate Transformations 
Cartesian coordinates 
ds? = dx’ + dy* +. dz 
dV = dxdydz 
dy dw ow 
Vy =| —, —, — A.45 
¥ ( ox dy dz 
da, Oda, 0a, 
Va = x y Z 
a OK dy a dz 
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da, Ody day da, day da, 
Vxaz= = _ a= 
dy dz dz ox’ dx dy 
_ ow ew ow 


x2 " dy? ” Oz? 


Ay 


General Coordinate Transformation 


Transformation into new coordinates (u,v, w) , where x = x(u, v, Ww), 
y = y(u,v,w) and z = z(u,v, w) 


2 
ds U2 + y2 as w2 
ay — und 
U VW 
ow ow _ Ow 
= —, V— ,w— A.46 
ve (uv ou dv Ls ( ) 
va = UVW 0 ay 0 a ‘ 0 ay 
—— duVW | dv UW dwUV 


Vw[i@—-— 2a fo du _ 8 aw 
Vxa=| Ex in V1 UW La aH| 


du V dv U 


_ 0 ( U ow 0 ( V ow 0 ( Wow 
read E (a x) - dv (ay me) + dw (a x) 


where 


2 2 2 
PAGO on 
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ox 2 ae 
i“ . ss 
Oa, + ave. + aN, : 
dy Oz 


ox 
=a,Ww— WwW W— . 
7 ow 7% ow +4: ow 


and 


Cylindrical Coordinates 
Transformation to cylindrical coordinates (r, ¢, ¢) 
(r cosg,r sing, C) 


(x,y) = 
ds? = dr’ + r'dg* + dé? 


dV = rdrdgdé 
ow low %| (A.48) 


ue 


io _ 
ag” 0c dr’ ror 
ley ey 
ae? 


ey low 
a= or r or r? dy? 


Polar Coordinates 
Transformation to polar coordinates (r, ¢, 7) 


(x,y,z) = (r cosg sin6,r sing sin6,r cos 6) 

ds? = dr’ +?’ sin? 6 dy” +r dé? 

dV =r’ sin@ drdg dé 
dy low 1 aw 

Vy = , ; AA 

‘a E- Fan 00 an) 
la_, 1 oO 1 dag 

a= aa a) + 9 rsin@ dg ena ot rsin6 00 
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1 a(sin 6 az) = dag 1 dar sin @ d(rag) 
rsin@ dg 00 |’ rsin@ \ 06 or 2 
1 fa _ Ody 
r (2 (ra) ag ) , 


19 (,a¥ 1 ee 1 Of. ap 
ci ¢ )+siet ma amar 


V xX a= 


Curvilinear Coordinates 
Transformation to curvilinear coordinates of beam dynamics 


ds? = dx? + dy? + (1+ kax + yy)? dz? = dx? + dy? + Ide? 


dV = dxdyhdz 

Vy = My ef wy + ie 

— i kes ¥ ee at eat (A.50) 
ran [ Mee BML) 


“ipa (av). a (av). a (lay 
ware (elt ele l+e (Es) 


Appendix B 


Physical Formulae and Parameters 


B.1_ Physical Constants 


Speed of light in vacuum 


Electric charge unit 
Electron rest energy 
Fine structure constant 
Avogadro’s number 
Molar volume at STP 
Atomic mass unit 
Classical electron radius 
Classical proton radius 
p/e mass ratio 
Planck’s constant 
Planck’s constant 


Compton wavelength 
2 of a leV photon 
Thomson cross-section 
Boltzmann constant 
Stephan-Boltzmann 


hic 
Ac 
hc/e 
OT 

k 

o 


= 2.99792458 x 10° m/s 
1.602176462 x 107! C 


= 0.510998902 MeV 

= 1/137.04 

= 6.0221367x 10 ~—-I/mol 
22.41410 x 1077 m?/mol 

= 931.49432 MeV 


= 2.8179403 x 1075 =m 
= 1.534698x 107'8 =m 
= 1836.2 

= 4.1356692x 107! eVs 
= 6.5821220x 10-1 eVs 


= 197.327053 MeV s 
= 2.42631058x 107'? m 
= 12398.424 A 


= 0.66524616 x 10778 m? 
= 1.3806568x 10° J/K 
= 5.67051 x 10-8 
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994 B_ Physical Formulae and Parameters 


B.2. Relations of Fundamental Parameters 


@ 


fine structure constant a = Treofic 
e 


classical electron radius r, = Gone 
e? = 14,399652 eVA 


B.3 Unit Conversions 


permittivity of vacuum € 9 = 8.85418782 x 107'? C/(Vm) 
permeability of vacuum jp = 1.25663706 x 10-° Vs/(Am) 


See Tables B.1, B.2, and B.3. 


Table B.1 Energy conversion table 


Calories Wavenumber degKelvin 
(cal) (1/cm) (°K) 
Ieal | 1 4.186 2.6127 10!9 | 2.1073 107 3.0319 1073 
1J 0.23889 1 6.2415 10!8 5.0342 107? 7.2429 107 
leV 3.8274 107° 1.6022 107° 1 8065.8 11604 
lcm 4.7453 10~4 1.9864 10-73 1.2398 10-4 1 1.4387 
1°K 3.2984 10774 1.3807 10773 8.6176 10-> 0.69507 1 
Table B.2 Equation conversion factors 
Variable Replace cgs variable By SI variable 
Potential, voltage Vegs 41 € VuKs 
Electric field Ecgs 4 € Ewvxs 


Current, current density 


TegssJegs 


1/./42€o Iuxs. jus 


Charge, charge density q.P 1/4/47€0 dus, PMKs 
Resistance Regs V 42€0 Rus 
Capacitance Cogs 1/./42€ Cuxs 
Inductance Legs V 42€o Luis 
Magnetic induction Begs V 41/0 Bus 
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Table B.3 Numerical conversion factors 
Quantity Label Replace cgs units By SI units 
Voltage U lesu 300 V 
Electric fed fe [tesu—————*4d 10 Wie 
Current 10 ¢= 2.9979 10° A 
Charge q (10c)~! = 3.3356 107!°C 
Resistance R 8.9876 10!! 2 
Capacitance G (1/8.9876) 1071! F 
Inductance L 1 10° Hy 
Magnetic induction B 310-4*T 
Magnetic field H 1 Oersted 1000/4xr = 79.577 A/m 
Force iv ldyn 10-°N 
Energy E lerg 1077J 
B.4. Maxwell’s Equations 
VE = a p, Coulomb’s law 
VB = 0 
VxE= 8 Faraday’s law (B.1) 
VxB=poupvt ae Ampeére’s law , 
F = gE+q{v xB] Lorentz force 
where €, ju are the relative dielectricity and permeability, respectively. 
B.5 Wave and Field Equations 
Definition of potentials 
vector potentialA: B=VxA (B.2) 
: dA 
scalar potential @ : == VQ, (B.3) 
Wave equations in vacuum 
1 0?A 
ha OE 2, Ee (B.4) 
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Ag] ee (B.5) 


Vector and scalar potential in vacuum 


1 vp(x, y, Z) 
A(t) = - dx dyd. B.6 
(t) ax !/ R om y z ( ) 
1 X,Y,Z 
e(t) = —— / PUNO) graves (B.7) 
4m €o R es 


Vector and scalar retarded potentials for a point charge g in vacuum 


1 gq 8 


A(P,t) = — —___ B.8 
et) 4nceo R1+nB |, =) 
1 gq 1 
Pt = ant B.9 
oe) 4neo R1+nB|,., oo 
Radiation field in vacuum 
1 ; 
EQ =; = \R x [@ + BR) x 6] (B.10) 
TCEQ I ret 
cB (1)=[E x n,,, (B.11) 


B.6 Relativistic Relations 


B.6.1_ Lorentz Transformation 


Quantities x* etc. are taken in the particle system £*,while quantities x etc. refer to 
the laboratory system £. The particle system £* is assumed to move at the velocity 
B along the z-axis with respect to the laboratory system L. 

Lorentz transformation of coordinates 


x* 10 0 O 


x 
y* 01 0 O y 

= . B.12 
Zz 00 y —By Z yee 
ct 00-By y ct 


Lorentz transformation of frequencies (relativistic Doppler effect) 


w= o*y (1+ Bn*) (B.13) 
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Lorentz transformation of angles (collimation) 


sin 0* 


B.6.2  Four-Vectors 


Properties of 4-vectors are used in this text to transform physical phenomena from 
one inertial system to another. 
Space-time 4-vector 


S = (x,y, Z, ict) , (B.15) 
World time 
ct = V—S2. (B.16) 


Properties of 4-vectors 

length of a 4-vector is Lorentz invariant 

any product of two 4-vectors is Lorentz invariant 
Lorentz transformation of time. From (B.16) 


cdt = yc? (dt)? — (dx)? — (dy)? — (dz)” 


= ye —(v2+ w+ v2)dt 


= Vc? — v2dt = V1 — p2cdt 
or 


1 
dr = —dr. (B.17) 


Velocity 4-vector 
_ ds ds bes 
v= —=y—=7(4,5,Z ic). (B.18) 


4-acceleration 


~ dv d ds 
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4-acceleration ad = (a dy, Gz, id,) in component form 


a, = y"a, + y*B, (Ba) , (B.20) 
where a is the ordinary acceleration. 
B.6.3 Square of the 4-Acceleration 
a = y® {a — [B x al’ = a”? (B.21) 
in particle system (6 = 0, y = 1) 
a? = a*?. (B.22) 


B.6.4 Miscellaneous 4-Vectors and Lorentz Invariant 
Properties 


B.7 Transformation Matrices in Beam Dynamics 


In this section, we will collect transformation matrices for elements discussed in 
various parts of this book. Generally, we assume the following designations: 


cosine and sine like solution and dispersion 


C(z), S(z), D : ; 
@,5@),P@ function, respectively 


C’(z), S’(z), ..ete. 


derivatives are taken with respect to z 


u(z) u(z) can be either x(z) or y(z) 
£ path (arc) length of element 
5 = Ap/po relative momentum error 
og = V|kole quadrupole phase 
f quadrupole focal length 
O=,/ |k +k? |e deflection angle of synchrotron magnet 
06=k,l = t/po deflection angle of bending magnet 
magnet entrance and exit angles with respect to 
no and Ne 
sector magnet, 79 = Ne = —0/2 < 0 for rect. magnet 
L straight length of bending magnet 
2G full magnet gap aperture 


B.8 General Transformation Matrix 


4-vector: 
space-time 

5 = (r, ict) 
momentum-energy 
cp = (q@p, iE) 
wave number 

ck = (ck, iw) 
velocity 

0 =) (F, ic) 
acceleration 
a@=yat y*B (Ba) 
current divergence 
T= (7508) 


current density 


7 = (iuipo) 
force, 

p = (cp, iE) 
field potential 
A = (A, id) 
4-divergence 

Vv =(-V,i2) 


T : world time 


999 


invariance of 


world time 
2 


2 


F =—-c*t 
mass 
Op? = —A2mc4 
isotropy of space 
Ck = 0 
speed of light 
v=? 
radiation intensity or Poynting vector 
@ =[yaty'B Ba} 
charge conservation 
2 2 
wo-(2) =-(2) =0 
charge density 
P= pC 
inertial system 
op: =0 
Maxwell’s equations 


d’ Alambertian 


B.8 General Transformation Matrix 


u(z) C(z) S(z) D(z) 
u'(z) |} = | C’(z) S'(z) D(z) 
6 0 0 


uo(z) uo(z) 
ub(z) } =M{ wh(2) (B.23) 
6 6 


B.8.1 Symmetric Magnet Arrangement 


Moot = M,M = ( 


/ 7 / 
CS’ + SC 2SCS ) (B.24) 


2CC—s CS’ +- SC’ 


In a symmetric magnet or lattice segment, the diagonal elements of the transforma- 


tion matrix are equal. 
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B.8.2. Inverse Transformation Matrix 


CS 


rm=(%5 


: Ae Ss’ —S 
=> inverse matrix is M; = ; (B.25) 


and the total transformation matrix is Mj. = .M;M = ({ ') 


B.9 Specific Transformation Matrices 


B.9.1_ Drift Space 


Length of drift space is ¢ 


120 
Ma={010 
001 


B.9.2_ Bending Magnets 
Sector Magnet 


With ¢ being the arc length and po the bending radius, the transformation matrices 
are in deflecting plane 


cos? psin@ p(1 —cosé@) 
M5 = — sin@ cosé sin 0 (B.26) 
0 0 1 


and in the non-deflecting plane (including edge focusing) 


1+ 5065 e 9 

2, 

£1405 0], (B.27) 
0 0 1 


Myo = | 2 


where 6¢ = G/p and 2G the gap size between magnet poles. 
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Wedge Magnet 


In deflecting plane 


cos 6 — sin @ tan no psin@ p (1 —cos 6) 
tan je +tan no t 
et cos 0 . sin 0 
Mvp = | 1-tan he tan no sing °° 6 — sin 6 tan ne <Ginetasc6ee) 
0 0 1 
(B.28) 
and in the non-deflecting plane 
_ ty £ 0 
Mwo(|0) = | —4 (te + to) + Steto 1— Ste 0 |, (B.29) 
0 0 1 
where tp. = — tan 0,6 — +30.¢ and do. = ea 
Rectangular Magnet 
In the deflecting plane with np = n. = —0/2 
1 psin@ p(1—cos6) 
Mrp (E10) =] 0 1 2tan(6/2) |, (B.30) 
0 O 1 
and in the non-deflecting plane 
i=, 66 8 
M0 (£|0) = “pte lor o|, (B.31) 
0 0 1 
where = a , tan ($) (1-42), L = 2psin a the straight length and £ the arc 


length of the magnet. 
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Synchrotron Magnet (Sector Type) 


For a focusing synchrotron magnet with K = k + «2 > 0 the transformation matrix 
in the deflecting plane is with (O = /K€,x, = 1/p) 


C 1 wig i-cosO 
cos @ Je sin (2) pK 
My s(€,0) = | -/Ksin@ cos@ oe (B.32) 
0 0 1 


and when K = k + x2 < 0 the transformation matrix for a defocusing synchrotron 
magnet is 


iG sinhO __ 1~coshO 
cosh @ IKI alk ri 
Msy.a (€|0) = | ./|K| sinh © cosh @ ave (B.33) 
0 0) 1 


In the non-deflecting plane the transformation matrices are with (vy = //ké > 0) 


cos W i sinw 0 
Msys0 = | —JSksinw cosy 0 (B.34) 
0 0 1 


and for k < 0 and yw = y/Jk| € 


1 . 
cosh y Tie sinhy 0 
Mosysa = | J|kl sinh coshy 0 (B.35) 
0 


0 1 


Synchrotron Magnet (Rectangular Type) 


For a focusing synchrotron magnet K = ky + k? > O and with O = VKE, 
6 = Ll/p, ky = 1/p, ; = /K tan6/2 


iG a ee sin@ (1—cos @) 
cos @ + PW sin UE ae 
1 1 : C 
Meit= 7 (cos + 57g sin 2) cos @ shai 


+; cos 0 — VK sin @ +g sin @ pVK 
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In a defocusing synchrotron magnet K = ko + k? < 0 and with 


O= V|K|l,0 = €/p, kx = 1/p, ; = \/|K| tan 6/2 


sinh 9 sinh O __ U=cosh @) 
cosh @ + FI vi nak 
sinh O h sinh © 
vnuz| fore ate) sone 
+;cosh@ + VK sinh + —g sinh @ +4 eos 
0) (0) i| 


B.9.3 | Quadrupole 


Focusing Quadrupole (ko >0,9= Jkt) 


cos Y Ti sing 0 
Mor = —VJ/k sing cosp 0 (B.36) 
0 0 1 


Defocusing Quadrupole (k <0,9=V/ lke) 


cosh Tn sinhg 0 
Mav =] \/|klsinhg coshg 0 (B.37) 
0 1 


Quadrupole Doublet 


A quadrupole doublet formed by two quadrupoles of focal length f; and f) and 
separated by the distance d has the transformation matrix 


l=djf, a 0 
Ma=|- <7" 1-660 |. (B.38) 
0 0 1 
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Quadrupole Triplet 
Symmetric quadrupole triplet made of two equal doublets 


1—2d?/f? 2d(1+d/f) 0 
Mr=M:M={ -1/f* 1—2d2/f? 0 |, (B.39) 
0 0 1 


re 
where re = 7 + 5 — AR: 


